Sitemap

A list of all the posts and pages found on the site. For you robots out there, there is an XML version available for digesting as well.

Pages

Posts

Transformer

4 minute read

Published:

In this blog, I will introduce the transformer architecture, which is the foundation of modern large language models. For better illustration, I will take translation task as an example.

Embedding

3 minute read

Published:

In the world of machine learning, we use embeddings(mathematically speaking, vectors) to represent everything. In this post, I will introduce how to encode nature languages into embeddings.

Attention

2 minute read

Published:

In this blog, we will focus on the attention mechanism, which is a key component of the Transformer architecture.

Training Flow and Diffusion Models: Flow Matching and Score Matching

3 minute read

Published:

In our previous post, we defined the theoretical targets for Flow and Diffusion models. However, calculating these global targets directly is often mathematically intractable.In this post, we bridge the gap between theory and practice. We will explore:

From Noise to Data: ODEs, SDEs, and Generative Flows

4 minute read

Published:

This post serves as the first entry in a series introcuding Flow Matching and Diffusion Models, potentially spanning a broad range of related topics as I expand upon areas of personal and research interest.
Special credits to MIT.

publications