Statistics of Optimal Transport and Genetic Model

Back to: Input MOC

Source:

2024-06-04 Course Notes

This note recorded the essential information of the data descriptive science summer school (?).

Notes

Introduction

ML and AI techs

How to control probability

What is Statistics

In this course: Analysis on the probability-features with limited observations and inferences

Introduction to Optimal Transport

Consider OT in distribution way

Definitions

Monge's Question

Kantorovichi's problem

Advantages of Wasserstein distance

Statistics of OT

Core: with observations X1...Xn of P, how to infer the W(P,Q), in the case of P is unknown, while Q is known.
Definitions

Curse of Dimensionality

Approach avoidance on curse of dimensionality

Method 1: Slicing, which projects data into a straight line. SWP(P,Q)

Method 2: force on the normalization of entropy

why we need to know error distribution

one-variable or discrete variable

It is difficult to infer multiple variable

Inferred error of OT projection

Apply it to the neural network

On Genetic Model

References

Weed, J., & Bach, F. (2019). Sharp asymptotic and finite-sample rates of convergence of empirical measures in Wasserstein distance. Bernoulli, 25(4 A), 2620-2648.