<?xml version="1.0" encoding="utf-8"?><feed xmlns="http://www.w3.org/2005/Atom" ><generator uri="https://jekyllrb.com/" version="3.10.0">Jekyll</generator><link href="/feed.xml" rel="self" type="application/atom+xml" /><link href="/" rel="alternate" type="text/html" /><updated>2026-04-26T21:46:38+00:00</updated><id>/feed.xml</id><title type="html">Prominent Johnson’s Blog</title><subtitle>Pushing robotic limits, empowering everyday life.</subtitle><entry><title type="html">What Is a Diffusion Model?</title><link href="/ai-concepts/2026/04/26/what-is-diffusion-model.html" rel="alternate" type="text/html" title="What Is a Diffusion Model?" /><published>2026-04-26T09:00:00+00:00</published><updated>2026-04-26T09:00:00+00:00</updated><id>/ai-concepts/2026/04/26/what-is-diffusion-model</id><content type="html" xml:base="/ai-concepts/2026/04/26/what-is-diffusion-model.html"><![CDATA[<section data-lang="en">

  <h2 id="one-sentence-summary">One-Sentence Summary</h2>

  <p>A diffusion model is a generative model that learns how to create data by reversing a gradual noising process.</p>

  <h2 id="why-it-matters">Why It Matters</h2>

  <p>Diffusion models are widely used for image, audio, video, and 3D generation because they can produce high-quality samples while giving the model a stable learning objective.</p>

  <h2 id="core-ideas">Core Ideas</h2>

  <ul>
    <li><strong>Forward process:</strong> gradually add noise to clean data.</li>
    <li><strong>Reverse process:</strong> train a model to remove noise step by step.</li>
    <li><strong>Conditioning:</strong> guide generation with text, images, labels, or other signals.</li>
    <li><strong>Sampling:</strong> start from noise and repeatedly denoise until a sample appears.</li>
  </ul>

  <h2 id="placeholder-example">Placeholder Example</h2>

  <p>For image generation, the model starts with random noise and gradually turns it into a coherent image according to the prompt or condition.</p>

  <h2 id="notes-to-expand-later">Notes to Expand Later</h2>

  <ul>
    <li>Add a simple noise-to-image diagram.</li>
    <li>Explain the difference between DDPM and latent diffusion.</li>
    <li>Add a short section on why denoising is easier than direct generation.</li>
  </ul>

</section>

<section data-lang="zh">

  <h2 id="section">一句话总结</h2>

  <p>Diffusion model 是一种生成模型，它通过学习“如何反向去噪”来生成数据。</p>

  <h2 id="section-1">为什么重要</h2>

  <p>Diffusion model 常用于图像、音频、视频和 3D 生成，因为它可以生成高质量样本，同时训练目标相对稳定。</p>

  <h2 id="section-2">核心概念</h2>

  <ul>
    <li><strong>Forward process:</strong> 从干净数据开始，逐步加入噪声。</li>
    <li><strong>Reverse process:</strong> 训练模型一步一步去除噪声。</li>
    <li><strong>Conditioning:</strong> 用文本、图像、标签或其他信息引导生成过程。</li>
    <li><strong>Sampling:</strong> 从随机噪声开始，反复去噪，直到得到最终样本。</li>
  </ul>

  <h2 id="section-3">占位例子</h2>

  <p>在图像生成中，模型一开始面对的是随机噪声，然后根据 prompt 或其他条件逐步把噪声变成一张有结构的图像。</p>

  <h2 id="section-4">之后可以扩展</h2>

  <ul>
    <li>加一个从噪声到图像的简单示意图。</li>
    <li>解释 DDPM 和 latent diffusion 的区别。</li>
    <li>写一小节说明为什么“去噪”比直接生成更容易建模。</li>
  </ul>

</section>]]></content><author><name></name></author><category term="ai-concepts" /><category term="diffusion-model" /><category term="generative-ai" /><category term="denoising" /><summary type="html"><![CDATA[A bilingual template post explaining the intuition behind diffusion models.]]></summary></entry><entry><title type="html">What Is a Transformer?</title><link href="/ai-concepts/2026/04/26/what-is-transformer.html" rel="alternate" type="text/html" title="What Is a Transformer?" /><published>2026-04-26T08:00:00+00:00</published><updated>2026-04-26T08:00:00+00:00</updated><id>/ai-concepts/2026/04/26/what-is-transformer</id><content type="html" xml:base="/ai-concepts/2026/04/26/what-is-transformer.html"><![CDATA[<section data-lang="en">

  <h2 id="one-sentence-summary">One-Sentence Summary</h2>

  <p>A Transformer is a neural network architecture that uses attention to decide which parts of the input are most relevant to each other.</p>

  <h2 id="why-it-matters">Why It Matters</h2>

  <p>Before Transformers, many sequence models processed text step by step. Transformers made it easier to compare all tokens in a sequence at once, which helped models learn long-range relationships more effectively.</p>

  <h2 id="core-ideas">Core Ideas</h2>

  <ul>
    <li><strong>Tokenization:</strong> split text or other data into small units.</li>
    <li><strong>Embedding:</strong> turn each token into a vector.</li>
    <li><strong>Self-attention:</strong> let each token look at other tokens and decide what matters.</li>
    <li><strong>Feed-forward layers:</strong> transform the attended information into richer features.</li>
  </ul>

  <h2 id="placeholder-example">Placeholder Example</h2>

  <p>In the sentence “the robot picked up the cup because it was light,” attention helps the model connect “it” with the likely object being discussed.</p>

  <h2 id="notes-to-expand-later">Notes to Expand Later</h2>

  <ul>
    <li>Add diagrams for query, key, and value.</li>
    <li>Explain positional encoding.</li>
    <li>Compare encoder-only, decoder-only, and encoder-decoder Transformers.</li>
  </ul>

</section>

<section data-lang="zh">

  <h2 id="section">一句话总结</h2>

  <p>Transformer 是一种神经网络结构，它通过 attention 机制判断输入中哪些部分彼此最相关。</p>

  <h2 id="section-1">为什么重要</h2>

  <p>在 Transformer 出现之前，很多序列模型会按顺序一步一步处理文本。Transformer 可以让序列中的所有 token 同时互相比较，因此更容易学习长距离依赖关系。</p>

  <h2 id="section-2">核心概念</h2>

  <ul>
    <li><strong>Tokenization:</strong> 把文本或其他数据拆成小单位。</li>
    <li><strong>Embedding:</strong> 把每个 token 转换成向量。</li>
    <li><strong>Self-attention:</strong> 让每个 token 观察其他 token，并判断哪些信息重要。</li>
    <li><strong>Feed-forward layers:</strong> 对 attention 后的信息做进一步变换。</li>
  </ul>

  <h2 id="section-3">占位例子</h2>

  <p>在句子 “the robot picked up the cup because it was light” 中，attention 可以帮助模型判断 “it” 更可能指代哪个对象。</p>

  <h2 id="section-4">之后可以扩展</h2>

  <ul>
    <li>加 query、key、value 的示意图。</li>
    <li>解释 positional encoding。</li>
    <li>比较 encoder-only、decoder-only 和 encoder-decoder Transformer。</li>
  </ul>

</section>]]></content><author><name></name></author><category term="ai-concepts" /><category term="transformer" /><category term="attention" /><category term="deep-learning" /><summary type="html"><![CDATA[A bilingual template post explaining the basic idea behind Transformers.]]></summary></entry></feed>