Corspedia | CS25 I Stanford Seminar - Transformers in Language: The development of GPT Models including GPT3

Overview

This course will introduce the concept of unsupervised learning as a means of natural language processing. In this course, we will examine the different algorithms for building generative models for text and language understanding, including 3-gram models, recurrent neural nets, big LSTM, the Transformer, GPT-2, GPT-3, the Unsupervised Sentiment Neuron, and Zero-Shot Reading Comprehension. We will also explore the possibility of applying GPT-3 to images through IGPT, examining the HumanEval dataset and the pass@k metric as a measure of human evaluation. Finally, we will cover techniques for approximation of sampling against an oracle, while considering the limitations and implications of natural language processing.

Syllabus

Introduction.
3-Gram Model (Shannon 1951).
Recurrent Neural Nets (Sutskever et al 2011).
Big LSTM (Jozefowicz et al 2016).
Transformer (Llu and Saleh et al 2018).
GPT-2: Big Transformer (Radford et al 2019).
GPT-3: Very Big Transformer (Brown et al 2019).
GPT-3: Can Humans Detect Generated News Articles?.
Why Unsupervised Learning?.
Is there a Big Trove of Unlabeled Data?.
Why Use Autoregressive Generative Models for Unsupervised Learnin.
Unsupervised Sentiment Neuron (Radford et al 2017).
Radford et al 2018).
Zero-Shot Reading Comprehension.
GPT-2: Zero-Shot Translation.
Language Model Metalearning.
GPT-3: Few Shot Arithmetic.
GPT-3: Few Shot Word Unscrambling.
GPT-3: General Few Shot Learning.
IGPT (Chen et al 2020): Can we apply GPT to images?.
IGPT: Completions.
IGPT: Feature Learning.
Isn't Code Just Another Modality?.
The HumanEval Dataset.
The Pass @ K Metric.
Codex: Training Details.
An Easy Human Eval Problem (pass@1 -0.9).
A Medium HumanEval Problem (pass@1 -0.17).
A Hard HumanEval Problem (pass@1 -0.005).
Calibrating Sampling Temperature for Pass@k.
The Unreasonable Effectiveness of Sampling.
Can We Approximate Sampling Against an Oracle?.
Main Figure.
Limitations.
Conclusion.
Acknowledgements.

Taught by

Stanford Online

1 x Booting (61.53%)	164.59ms
1 x Application (38.22%)	102.24ms

Params
0	`course`
1	`links`
2	`config`

Params
0	`__env`
1	`app`
2	`errors`
3	`course`
4	`links`
5	`config`

Params
0	`__env`
1	`app`
2	`errors`
3	`course`
4	`links`
5	`config`
6	`classes`

Params
0	`__env`
1	`app`
2	`errors`
3	`course`
4	`links`
5	`config`

CS25 I Stanford Seminar - Transformers in Language: The development of GPT Models including GPT3

بواسطة: YouTube

Overview

Syllabus

Taught by

CS25 I Stanford Seminar - Transformers in Language: The development of GPT Models including GPT3

بواسطة: YouTube

Metadata
Bindings	0. cs25-i-stanford-seminar---transformers-in-language:-the-development-of-gpt-models-including-gpt3
Backtrace	17. /app/Http/Controllers/CourseController.php:20 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205

Metadata
Bindings	0. 2025-04-09 20:06:24 1. 1627
Backtrace	17. /app/Http/Controllers/CourseController.php:21 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205

Metadata
Bindings	0. 1619
Backtrace	21. /app/Models/Course.php:84 28. view::public.courses.show:29 30. /vendor/laravel/framework/src/Illuminate/Filesystem/Filesystem.php:125 31. /vendor/laravel/framework/src/Illuminate/View/Engines/PhpEngine.php:58 32. /vendor/laravel/framework/src/Illuminate/View/Engines/CompilerEngine.php:72