CS25 I Stanford Seminar - Transformers in Language: The development of GPT Models including GPT3

Brought by: YouTube

Overview

This course will introduce the concept of unsupervised learning as a means of natural language processing. In this course, we will examine the different algorithms for building generative models for text and language understanding, including 3-gram models, recurrent neural nets, big LSTM, the Transformer, GPT-2, GPT-3, the Unsupervised Sentiment Neuron, and Zero-Shot Reading Comprehension. We will also explore the possibility of applying GPT-3 to images through IGPT, examining the HumanEval dataset and the pass@k metric as a measure of human evaluation. Finally, we will cover techniques for approximation of sampling against an oracle, while considering the limitations and implications of natural language processing.

Syllabus

Introduction.
3-Gram Model (Shannon 1951).
Recurrent Neural Nets (Sutskever et al 2011).
Big LSTM (Jozefowicz et al 2016).
Transformer (Llu and Saleh et al 2018).
GPT-2: Big Transformer (Radford et al 2019).
GPT-3: Very Big Transformer (Brown et al 2019).
GPT-3: Can Humans Detect Generated News Articles?.
Why Unsupervised Learning?.
Is there a Big Trove of Unlabeled Data?.
Why Use Autoregressive Generative Models for Unsupervised Learnin.
Unsupervised Sentiment Neuron (Radford et al 2017).
Radford et al 2018).
Zero-Shot Reading Comprehension.
GPT-2: Zero-Shot Translation.
Language Model Metalearning.
GPT-3: Few Shot Arithmetic.
GPT-3: Few Shot Word Unscrambling.
GPT-3: General Few Shot Learning.
IGPT (Chen et al 2020): Can we apply GPT to images?.
IGPT: Completions.
IGPT: Feature Learning.
Isn't Code Just Another Modality?.
The HumanEval Dataset.
The Pass @ K Metric.
Codex: Training Details.
An Easy Human Eval Problem (pass@1 -0.9).
A Medium HumanEval Problem (pass@1 -0.17).
A Hard HumanEval Problem (pass@1 -0.005).
Calibrating Sampling Temperature for Pass@k.
The Unreasonable Effectiveness of Sampling.
Can We Approximate Sampling Against an Oracle?.
Main Figure.
Limitations.
Conclusion.
Acknowledgements.

Taught by

Stanford Online

CS25 I Stanford Seminar - Transformers in Language: The development of GPT Models including GPT3
Go to course

CS25 I Stanford Seminar - Transformers in Language: The development of GPT Models including GPT3

Brought by: YouTube

  • YouTube
  • Free
  • English
  • Certificate Not Available
  • Available at any time
  • All
  • N/A
8.1.2PHP Version242msRequest Duration2MBMemory UsageGET en/courses/{slug}Route
    • Booting (150ms)
    • Application (91.36ms)
    • 1 x Booting (62.05%)
      150.33ms
      1 x Application (37.71%)
      91.36ms
      14 templates were rendered
      • public.courses.show (resources/views/public/courses/show.blade.php)3bladefile
        Params
        0
        course
        1
        links
        2
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.courses.partials.details (resources/views/public/courses/partials/details.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.layouts.main (resources/views/public/layouts/main.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.meta (resources/views/public/layouts/partials/meta.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.navbar (resources/views/public/layouts/partials/navbar.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.links (resources/views/public/auth/profile/partials/links.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.layouts.partials.flash-session (resources/views/public/layouts/partials/flash-session.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      uri
      GET en/courses/{slug}
      middleware
      web, localize:en
      controller
      App\Http\Controllers\CourseController@show
      as
      en.courses.show
      namespace
      prefix
      /en
      where
      file
      app/Http/Controllers/CourseController.php:17-35
      7 statements were executed6.11ms
      • select * from `courses` where `slug_en` = 'cs25-i-stanford-seminar---transformers-in-language:-the-development-of-gpt-models-including-gpt3' limit 1
        4.46ms/app/Http/Controllers/CourseController.php:20corspedia
        Metadata
        Bindings
        • 0. cs25-i-stanford-seminar---transformers-in-language:-the-development-of-gpt-models-including-gpt3
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:20
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • update `courses` set `visitors` = `visitors` + 1, `courses`.`updated_at` = '2025-04-05 00:27:12' where `id` = 1627
        290μs/app/Http/Controllers/CourseController.php:21corspedia
        Metadata
        Bindings
        • 0. 2025-04-05 00:27:12
        • 1. 1627
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:21
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `topic_id`, `slug_en`, `slug_ar` from `subjects` where `subjects`.`id` in (95)
        290μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `slug_en`, `slug_ar` from `topics` where `topics`.`id` in (1)
        270μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 25. /app/Http/Controllers/CourseController.php:23
        • 26. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 27. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 28. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 29. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `institutions` where `institutions`.`id` in (5) and `institutions`.`deleted_at` is null
        190μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `providers` where `providers`.`id` in (21) and `providers`.`deleted_at` is null
        230μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `html_files` where `html_files`.`id` = 1619 limit 1
        380μs/app/Models/Course.php:84corspedia
        Metadata
        Bindings
        • 0. 1619
        Backtrace
        • 21. /app/Models/Course.php:84
        • 28. view::public.courses.show:29
        • 30. /vendor/laravel/framework/src/Illuminate/Filesystem/Filesystem.php:125
        • 31. /vendor/laravel/framework/src/Illuminate/View/Engines/PhpEngine.php:58
        • 32. /vendor/laravel/framework/src/Illuminate/View/Engines/CompilerEngine.php:72
      App\Models\HtmlFile
      1
      App\Models\Provider
      1
      App\Models\Institution
      1
      App\Models\Topic
      1
      App\Models\Subject
      1
      App\Models\Course
      1
        _token
        ilujvT6j1dZK2f4djgUveDiZE3C338b1rCDpkEfK
        locale
        en
        _previous
        array:1 [ "url" => "https://www.corspedia.com/en/courses/cs25-i-stanford-seminar---transformers-in...
        _flash
        array:2 [ "old" => [] "new" => [] ]
        PHPDEBUGBAR_STACK_DATA
        []
        path_info
        /en/courses/cs25-i-stanford-seminar---transformers-in-language:-the-development-of-gpt-models-including-gpt3
        status_code
        200
        
        status_text
        OK
        format
        html
        content_type
        text/html; charset=UTF-8
        request_query
        []
        
        request_request
        []
        
        request_headers
        0 of 0
        array:24 [ "cf-ipcountry" => array:1 [ 0 => "US" ] "cf-connecting-ip" => array:1 [ 0 => "18.223.126.246" ] "cdn-loop" => array:1 [ 0 => "cloudflare; loops=1" ] "x-forwarded-proto" => array:1 [ 0 => "https" ] "cf-visitor" => array:1 [ 0 => "{"scheme":"https"}" ] "sec-fetch-site" => array:1 [ 0 => "none" ] "accept" => array:1 [ 0 => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" ] "user-agent" => array:1 [ 0 => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" ] "upgrade-insecure-requests" => array:1 [ 0 => "1" ] "sec-ch-ua-platform" => array:1 [ 0 => ""Windows"" ] "sec-ch-ua-mobile" => array:1 [ 0 => "?0" ] "sec-ch-ua" => array:1 [ 0 => ""HeadlessChrome";v="129", "Not=A?Brand";v="8", "Chromium";v="129"" ] "cache-control" => array:1 [ 0 => "no-cache" ] "pragma" => array:1 [ 0 => "no-cache" ] "sec-fetch-dest" => array:1 [ 0 => "document" ] "cf-ray" => array:1 [ 0 => "92b4eb196a281230-ORD" ] "accept-encoding" => array:1 [ 0 => "gzip, br" ] "priority" => array:1 [ 0 => "u=0, i" ] "sec-fetch-user" => array:1 [ 0 => "?1" ] "sec-fetch-mode" => array:1 [ 0 => "navigate" ] "x-forwarded-for" => array:1 [ 0 => "18.223.126.246" ] "host" => array:1 [ 0 => "www.corspedia.com" ] "content-length" => array:1 [ 0 => "" ] "content-type" => array:1 [ 0 => "" ] ]
        request_server
        0 of 0
        array:50 [ "USER" => "www-data" "HOME" => "/var/www" "HTTP_CF_IPCOUNTRY" => "US" "HTTP_CF_CONNECTING_IP" => "18.223.126.246" "HTTP_CDN_LOOP" => "cloudflare; loops=1" "HTTP_X_FORWARDED_PROTO" => "https" "HTTP_CF_VISITOR" => "{"scheme":"https"}" "HTTP_SEC_FETCH_SITE" => "none" "HTTP_ACCEPT" => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" "HTTP_USER_AGENT" => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" "HTTP_UPGRADE_INSECURE_REQUESTS" => "1" "HTTP_SEC_CH_UA_PLATFORM" => ""Windows"" "HTTP_SEC_CH_UA_MOBILE" => "?0" "HTTP_SEC_CH_UA" => ""HeadlessChrome";v="129", "Not=A?Brand";v="8", "Chromium";v="129"" "HTTP_CACHE_CONTROL" => "no-cache" "HTTP_PRAGMA" => "no-cache" "HTTP_SEC_FETCH_DEST" => "document" "HTTP_CF_RAY" => "92b4eb196a281230-ORD" "HTTP_ACCEPT_ENCODING" => "gzip, br" "HTTP_PRIORITY" => "u=0, i" "HTTP_SEC_FETCH_USER" => "?1" "HTTP_SEC_FETCH_MODE" => "navigate" "HTTP_X_FORWARDED_FOR" => "18.223.126.246" "HTTP_HOST" => "www.corspedia.com" "REDIRECT_STATUS" => "200" "SERVER_NAME" => "corspedia.com" "SERVER_PORT" => "443" "SERVER_ADDR" => "141.95.147.152" "REMOTE_USER" => "" "REMOTE_PORT" => "51284" "REMOTE_ADDR" => "172.69.58.194" "SERVER_SOFTWARE" => "nginx/1.18.0" "GATEWAY_INTERFACE" => "CGI/1.1" "HTTPS" => "on" "REQUEST_SCHEME" => "https" "SERVER_PROTOCOL" => "HTTP/2.0" "DOCUMENT_ROOT" => "/var/www/corspedia/public" "DOCUMENT_URI" => "/index.php" "REQUEST_URI" => "/en/courses/cs25-i-stanford-seminar---transformers-in-language:-the-development-of-gpt-models-including-gpt3" "SCRIPT_NAME" => "/index.php" "CONTENT_LENGTH" => "" "CONTENT_TYPE" => "" "REQUEST_METHOD" => "GET" "QUERY_STRING" => "" "SCRIPT_FILENAME" => "/var/www/corspedia/public/index.php" "PATH_INFO" => "" "FCGI_ROLE" => "RESPONDER" "PHP_SELF" => "/index.php" "REQUEST_TIME_FLOAT" => 1743812832.4994 "REQUEST_TIME" => 1743812832 ]
        request_cookies
        []
        
        response_headers
        0 of 0
        array:5 [ "content-type" => array:1 [ 0 => "text/html; charset=UTF-8" ] "cache-control" => array:1 [ 0 => "no-cache, private" ] "date" => array:1 [ 0 => "Sat, 05 Apr 2025 00:27:12 GMT" ] "set-cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6IjNHWnBDRXVIcHVUNjRMZEoyaysraXc9PSIsInZhbHVlIjoiTm01YmgvbitvU2FaWTJUUXd2OTdzTEFCMVBsY1VrTkZmdVpnU0pEWXVGdi9VYnJ2TnU3UGNQeThyYWk2N3F2VjVQZFh5WVR3R3hIcjlvcTF2OGdJdmd5a2IzWUhWSm9tU2J0SExMT1FLZEtMQWdGWHlWT1UvQlZuM3UyWEdycUsiLCJtYWMiOiJhYWFjZDQ3ZjU4ODFiMDU4YmMxZTAxYjg2Y2Y4YzAwNTg4YmM1MjJjNzBmNWYzNGZkYjc1YmIwMTM0NjdlZTg4IiwidGFnIjoiIn0%3D; expires=Sat, 05 Apr 2025 02:27:12 GMT; Max-Age=7200; path=/; samesite=laxXSRF-TOKEN=eyJpdiI6IjNHWnBDRXVIcHVUNjRMZEoyaysraXc9PSIsInZhbHVlIjoiTm01YmgvbitvU2FaWTJUUXd2OTdzTEFCMVBsY1VrTkZmdVpnU0pEWXVGdi9VYnJ2TnU3UGNQeThyYWk2N3F2VjVQZFh5W" 1 => "laravel_session=eyJpdiI6IjA4MkxIODB5SnRzQmVDS3RmYWczekE9PSIsInZhbHVlIjoiY3RHcFpxVHZnb1k1NFl3N3RZM3NBQjJiOHlVdEJWNGwzYkd0dGRQVjdxZ1RXLzdlaEtNMWdhT1duWFEzdVZUK3BDaDBzWHNwM0pCUnF0bFgyUklyODFRdjUzN09vdG0yTzcvcWNhSFRkb0NOODhucHBTT3Bla0pvbmZnS3JOTmciLCJtYWMiOiI3MmI3MGNhZGM3YzQ0MGJlMjdhMDYzZTQ2ODNiM2Y4MDA1YTMzOTRjM2ViYmJhYmQxZjE5YWZkY2RiOTU4OTA5IiwidGFnIjoiIn0%3D; expires=Sat, 05 Apr 2025 02:27:12 GMT; Max-Age=7200; path=/; httponly; samesite=laxlaravel_session=eyJpdiI6IjA4MkxIODB5SnRzQmVDS3RmYWczekE9PSIsInZhbHVlIjoiY3RHcFpxVHZnb1k1NFl3N3RZM3NBQjJiOHlVdEJWNGwzYkd0dGRQVjdxZ1RXLzdlaEtNMWdhT1duWFEzdVZUK3BD" ] "Set-Cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6IjNHWnBDRXVIcHVUNjRMZEoyaysraXc9PSIsInZhbHVlIjoiTm01YmgvbitvU2FaWTJUUXd2OTdzTEFCMVBsY1VrTkZmdVpnU0pEWXVGdi9VYnJ2TnU3UGNQeThyYWk2N3F2VjVQZFh5WVR3R3hIcjlvcTF2OGdJdmd5a2IzWUhWSm9tU2J0SExMT1FLZEtMQWdGWHlWT1UvQlZuM3UyWEdycUsiLCJtYWMiOiJhYWFjZDQ3ZjU4ODFiMDU4YmMxZTAxYjg2Y2Y4YzAwNTg4YmM1MjJjNzBmNWYzNGZkYjc1YmIwMTM0NjdlZTg4IiwidGFnIjoiIn0%3D; expires=Sat, 05-Apr-2025 02:27:12 GMT; path=/XSRF-TOKEN=eyJpdiI6IjNHWnBDRXVIcHVUNjRMZEoyaysraXc9PSIsInZhbHVlIjoiTm01YmgvbitvU2FaWTJUUXd2OTdzTEFCMVBsY1VrTkZmdVpnU0pEWXVGdi9VYnJ2TnU3UGNQeThyYWk2N3F2VjVQZFh5W" 1 => "laravel_session=eyJpdiI6IjA4MkxIODB5SnRzQmVDS3RmYWczekE9PSIsInZhbHVlIjoiY3RHcFpxVHZnb1k1NFl3N3RZM3NBQjJiOHlVdEJWNGwzYkd0dGRQVjdxZ1RXLzdlaEtNMWdhT1duWFEzdVZUK3BDaDBzWHNwM0pCUnF0bFgyUklyODFRdjUzN09vdG0yTzcvcWNhSFRkb0NOODhucHBTT3Bla0pvbmZnS3JOTmciLCJtYWMiOiI3MmI3MGNhZGM3YzQ0MGJlMjdhMDYzZTQ2ODNiM2Y4MDA1YTMzOTRjM2ViYmJhYmQxZjE5YWZkY2RiOTU4OTA5IiwidGFnIjoiIn0%3D; expires=Sat, 05-Apr-2025 02:27:12 GMT; path=/; httponlylaravel_session=eyJpdiI6IjA4MkxIODB5SnRzQmVDS3RmYWczekE9PSIsInZhbHVlIjoiY3RHcFpxVHZnb1k1NFl3N3RZM3NBQjJiOHlVdEJWNGwzYkd0dGRQVjdxZ1RXLzdlaEtNMWdhT1duWFEzdVZUK3BD" ] ]
        session_attributes
        0 of 0
        array:5 [ "_token" => "ilujvT6j1dZK2f4djgUveDiZE3C338b1rCDpkEfK" "locale" => "en" "_previous" => array:1 [ "url" => "https://www.corspedia.com/en/courses/cs25-i-stanford-seminar---transformers-in-language:-the-development-of-gpt-models-including-gpt3" ] "_flash" => array:2 [ "old" => [] "new" => [] ] "PHPDEBUGBAR_STACK_DATA" => [] ]