<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:podcast="https://podcastindex.org/namespace/1.0" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:media="http://search.yahoo.com/mrss/" xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:atom="http://www.w3.org/2005/Atom">
    <channel>
        <title>AI4LUV</title>
        <link>https://videos.luvina.net/c/ai4luv/videos</link>
        <description></description>
        <lastBuildDate>Thu, 09 Apr 2026 08:20:10 GMT</lastBuildDate>
        <docs>https://validator.w3.org/feed/docs/rss2.html</docs>
        <generator>PeerTube - https://videos.luvina.net</generator>
        <image>
            <title>AI4LUV</title>
            <url>https://videos.luvina.net/client/assets/images/icons/icon-512x512.png</url>
            <link>https://videos.luvina.net/c/ai4luv/videos</link>
        </image>
        <copyright>All rights reserved, unless otherwise specified in the terms specified at https://videos.luvina.net/about and potential licenses granted by each content's rightholder.</copyright>
        <atom:link href="https://videos.luvina.net/feeds/videos.xml?videoChannelId=1001" rel="self" type="application/rss+xml"/>
        <podcast:txt purpose="p20url">https://videos.luvina.net/feeds/podcast/videos.xml?videoChannelId=1001</podcast:txt>
        <item>
            <title><![CDATA[Stanford CS229 I Machine Learning I Building Large Language Models (LLMs)]]></title>
            <link>https://videos.luvina.net/w/hpJ9A916YAscuggebzaQTX</link>
            <guid>https://videos.luvina.net/w/hpJ9A916YAscuggebzaQTX</guid>
            <pubDate>Sun, 30 Nov 2025 02:45:55 GMT</pubDate>
            <description><![CDATA[For more information about Stanford's Artificial Intelligence programs visit: https://stanford.io/ai This lecture provides a concise overview of building a ChatGPT-like model, covering both pretraining (language modeling) and post-training (SFT...]]></description>
            <content:encoded><![CDATA[<p>For more information about Stanford's Artificial Intelligence programs visit: <a href="https://stanford.io/ai" target="_blank" rel="noopener noreferrer">https://stanford.io/ai</a></p>
<p>This lecture provides a concise overview of building a ChatGPT-like model, covering both pretraining (language modeling) and post-training (SFT/RLHF). For each component, it explores common practices in data collection, algorithms, and evaluation methods. This guest lecture was delivered by Yann Dubois in Stanford’s CS229: Machine Learning course, in Summer 2024.</p>
<p>Yann Dubois<br />
PhD Student at Stanford<br />
<a href="https://yanndubs.github.io/" target="_blank" rel="noopener noreferrer">https://yanndubs.github.io/</a></p>
<p>About the speaker: Yann Dubois is a fourth-year CS PhD student advised by Percy Liang and Tatsu Hashimoto. His research focuses on improving the effectiveness of AI when resources are scarce. Most recently, he has been part of the Alpaca team, working on training and evaluating language models more efficiently using other LLMs.</p>
<p>To view all online courses and programs offered by Stanford, visit: <a href="http://online.stanford.edu" target="_blank" rel="noopener noreferrer">http://online.stanford.edu</a></p>
<p>Chapters:<br />
00:00 - Introduction<br />
00:10 - Recap on LLMs<br />
00:16 - Definition of LLMs<br />
00:19 - Examples of LLMs<br />
01:16 - Importance of Data<br />
01:20 - Evaluation Metrics<br />
01:33 - Systems Component<br />
01:41 - Importance of Systems<br />
01:47 - LLMs Based on Transformers<br />
01:57 - Focus on Key Topics<br />
02:00 - Transition to Pretraining<br />
03:02 - Overview of Language Modeling<br />
04:17 - Generative Models Explained<br />
05:15 - Autoregressive Models Definition<br />
06:36 - Autoregressive Task Explanation<br />
07:49 - Training Overview<br />
08:48 - Tokenization Importance<br />
10:50 - Tokenization Process<br />
13:30 - Example of Tokenization<br />
16:00 - Evaluation with Perplexity<br />
20:50 - Current Evaluation Methods<br />
24:30 - Academic Benchmark: MMLU</p>
]]></content:encoded>
            <dc:creator>AI4LUV</dc:creator>
            <category>Education</category>
            <enclosure length="597327339" type="video/mp4" url="https://videos.luvina.net/download/videos/generate/84e24cc9-4a3f-40fc-a45d-a77f799ae6fd?videoFileIds=2233"/>
            <media:community>
                <media:statistics views="8"/>
            </media:community>
            <media:embed url="https://videos.luvina.net/videos/embed/hpJ9A916YAscuggebzaQTX"/>
            <media:player url="https://videos.luvina.net/w/hpJ9A916YAscuggebzaQTX"/>
            <media:group>
                <media:peerLink type="application/x-bittorrent" href="https://videos.luvina.net/lazy-static/torrents/975cb6c6-41dd-4f24-9a16-e117f3fab05b-1080.torrent" isDefault="false"/>
                <media:content type="video/mp4" medium="video" height="1080" fileSize="597327339" url="https://videos.luvina.net/static/web-videos/b5d3e197-7fa9-4327-bdff-510b4bd8804d-1080.mp4" framerate="30" duration="6271" isDefault="true"/>
            </media:group>
            <media:thumbnail url="https://videos.luvina.net/lazy-static/previews/77ba8d3e-7559-488d-a7d8-98f6a081e1fa.jpg"/>
            <media:thumbnail url="https://videos.luvina.net/lazy-static/thumbnails/3c660320-91af-4f84-80ad-7a349cc6202f.jpg"/>
            <media:rating>nonadult</media:rating>
            <media:title type="plain">Stanford CS229 I Machine Learning I Building Large Language Models (LLMs)</media:title>
            <media:description type="plain">For more information about Stanford's Artificial Intelligence programs visit: https://stanford.io/ai This lecture provides a concise overview of building a ChatGPT-like model, covering both pretraining (language modeling) and post-training (SFT...</media:description>
        </item>
    </channel>
</rss>