Stanford Seminar - Toward Scalable Autonomy - Aleksandra Faust

بواسطة: YouTube

Overview

Reinforcement learning is a promising technique for training autonomous systems that perform complex tasks in the real world. However, training reinforcement learning agents is a tedious, human-in-the-loop process, requiring heavy engineering and often resulting in suboptimal results. In this talk we explore two main directions toward scalable reinforcement learning. First, we discuss several methods for zero-shot sim2real transfer for mobile and aerial navigation, including visual navigation and fully autonomous navigation on a severely resource constrained nano UAV. Second, we observe that the interaction between the human engineer and the agent under training as a decision-making process that the human agent performs, and consequently automate the training by learning a decision making policy. With that insight, we focus on zero-shot generalization and discuss learning RL loss functions and a compositional task curriculum that generalize to unseen tasks of evolving complexity. We show that across different applications, learning-to-learn methods improve reinforcement learning agents generalization and performance, and raise questions about nurture vs nature in training autonomous systems.

Aleksandra Faust is a Senior Staff Research Scientist and Reinforcement Learning research team co-founder at Google Brain Research.

Syllabus

Introduction.
How to train goal reaching policies?.
What about resource-constrained robots?.
Training RL Agents is a (Sequential) Decision Making Problem.
Learning Loss Functions.
Evolving RL Algorithms.
Generative Curriculum for Compositional Tasks.
Autonomous web navigation.
Learning to Navigate Web.
Compositional Design of Environments (CODE).
Does it work in reality? RealED -- CODE w/ real primitives..
Better Generalization.
Autonomous Scalable Systems Future.

Taught by

Stanford Online

Stanford Seminar -  Toward Scalable Autonomy - Aleksandra Faust
الذهاب الي الدورة

Stanford Seminar - Toward Scalable Autonomy - Aleksandra Faust

بواسطة: YouTube

  • YouTube
  • مجانية
  • الإنجليزية
  • متاح شهادة
  • متاح في أي وقت
  • الجميع
  • N/A
8.1.2PHP Version332msRequest Duration2MBMemory UsageGET ar/الدورات/{slug}Route
    • Booting (205ms)
    • Application (126ms)
    • 1 x Booting (61.91%)
      205.48ms
      1 x Application (37.84%)
      125.60ms
      14 templates were rendered
      • public.courses.show (resources/views/public/courses/show.blade.php)3bladefile
        Params
        0
        course
        1
        links
        2
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.courses.partials.details (resources/views/public/courses/partials/details.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.layouts.main (resources/views/public/layouts/main.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.meta (resources/views/public/layouts/partials/meta.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.navbar (resources/views/public/layouts/partials/navbar.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.links (resources/views/public/auth/profile/partials/links.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.layouts.partials.flash-session (resources/views/public/layouts/partials/flash-session.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      uri
      GET ar/الدورات/{slug}
      middleware
      web, localize:ar
      controller
      App\Http\Controllers\CourseController@show
      as
      ar.courses.show
      namespace
      prefix
      /ar
      where
      file
      app/Http/Controllers/CourseController.php:17-35
      7 statements were executed6.72ms
      • select * from `courses` where `slug_ar` = 'stanford-seminar----toward-scalable-autonomy---aleksandra-faust' limit 1
        5.12ms/app/Http/Controllers/CourseController.php:20corspedia
        Metadata
        Bindings
        • 0. stanford-seminar----toward-scalable-autonomy---aleksandra-faust
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:20
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • update `courses` set `visitors` = `visitors` + 1, `courses`.`updated_at` = '2025-04-11 08:11:06' where `id` = 1616
        540μs/app/Http/Controllers/CourseController.php:21corspedia
        Metadata
        Bindings
        • 0. 2025-04-11 08:11:06
        • 1. 1616
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:21
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `topic_id`, `slug_en`, `slug_ar` from `subjects` where `subjects`.`id` in (7)
        240μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `slug_en`, `slug_ar` from `topics` where `topics`.`id` in (1)
        200μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 25. /app/Http/Controllers/CourseController.php:23
        • 26. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 27. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 28. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 29. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `institutions` where `institutions`.`id` in (5) and `institutions`.`deleted_at` is null
        180μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `providers` where `providers`.`id` in (21) and `providers`.`deleted_at` is null
        170μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `html_files` where `html_files`.`id` = 1608 limit 1
        270μs/app/Models/Course.php:84corspedia
        Metadata
        Bindings
        • 0. 1608
        Backtrace
        • 21. /app/Models/Course.php:84
        • 28. view::public.courses.show:29
        • 30. /vendor/laravel/framework/src/Illuminate/Filesystem/Filesystem.php:125
        • 31. /vendor/laravel/framework/src/Illuminate/View/Engines/PhpEngine.php:58
        • 32. /vendor/laravel/framework/src/Illuminate/View/Engines/CompilerEngine.php:72
      App\Models\HtmlFile
      1
      App\Models\Provider
      1
      App\Models\Institution
      1
      App\Models\Topic
      1
      App\Models\Subject
      1
      App\Models\Course
      1
        _token
        4gcD35BJZXCc4laQSxnU5DK2jMlzgNxgY8GMvI7q
        locale
        ar
        _previous
        array:1 [ "url" => "https://www.corspedia.com/ar/%D8%A7%D9%84%D8%AF%D9%88%D8%B1%D8%A7%D8%AA/stanfo...
        _flash
        array:2 [ "old" => [] "new" => [] ]
        PHPDEBUGBAR_STACK_DATA
        []
        path_info
        /ar/%D8%A7%D9%84%D8%AF%D9%88%D8%B1%D8%A7%D8%AA/stanford-seminar----toward-scalable-autonomy---aleksandra-faust
        status_code
        200
        
        status_text
        OK
        format
        html
        content_type
        text/html; charset=UTF-8
        request_query
        []
        
        request_request
        []
        
        request_headers
        0 of 0
        array:24 [ "cf-ipcountry" => array:1 [ 0 => "US" ] "cf-connecting-ip" => array:1 [ 0 => "3.145.77.42" ] "cdn-loop" => array:1 [ 0 => "cloudflare; loops=1" ] "x-forwarded-proto" => array:1 [ 0 => "https" ] "cf-visitor" => array:1 [ 0 => "{"scheme":"https"}" ] "sec-fetch-site" => array:1 [ 0 => "none" ] "accept" => array:1 [ 0 => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" ] "user-agent" => array:1 [ 0 => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" ] "upgrade-insecure-requests" => array:1 [ 0 => "1" ] "sec-ch-ua-platform" => array:1 [ 0 => ""Windows"" ] "sec-ch-ua-mobile" => array:1 [ 0 => "?0" ] "sec-ch-ua" => array:1 [ 0 => ""HeadlessChrome";v="129", "Not=A?Brand";v="8", "Chromium";v="129"" ] "cache-control" => array:1 [ 0 => "no-cache" ] "pragma" => array:1 [ 0 => "no-cache" ] "sec-fetch-dest" => array:1 [ 0 => "document" ] "cf-ray" => array:1 [ 0 => "92e902e5086e6204-ORD" ] "accept-encoding" => array:1 [ 0 => "gzip, br" ] "priority" => array:1 [ 0 => "u=0, i" ] "sec-fetch-user" => array:1 [ 0 => "?1" ] "sec-fetch-mode" => array:1 [ 0 => "navigate" ] "x-forwarded-for" => array:1 [ 0 => "3.145.77.42" ] "host" => array:1 [ 0 => "www.corspedia.com" ] "content-length" => array:1 [ 0 => "" ] "content-type" => array:1 [ 0 => "" ] ]
        request_server
        0 of 0
        array:50 [ "USER" => "www-data" "HOME" => "/var/www" "HTTP_CF_IPCOUNTRY" => "US" "HTTP_CF_CONNECTING_IP" => "3.145.77.42" "HTTP_CDN_LOOP" => "cloudflare; loops=1" "HTTP_X_FORWARDED_PROTO" => "https" "HTTP_CF_VISITOR" => "{"scheme":"https"}" "HTTP_SEC_FETCH_SITE" => "none" "HTTP_ACCEPT" => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" "HTTP_USER_AGENT" => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" "HTTP_UPGRADE_INSECURE_REQUESTS" => "1" "HTTP_SEC_CH_UA_PLATFORM" => ""Windows"" "HTTP_SEC_CH_UA_MOBILE" => "?0" "HTTP_SEC_CH_UA" => ""HeadlessChrome";v="129", "Not=A?Brand";v="8", "Chromium";v="129"" "HTTP_CACHE_CONTROL" => "no-cache" "HTTP_PRAGMA" => "no-cache" "HTTP_SEC_FETCH_DEST" => "document" "HTTP_CF_RAY" => "92e902e5086e6204-ORD" "HTTP_ACCEPT_ENCODING" => "gzip, br" "HTTP_PRIORITY" => "u=0, i" "HTTP_SEC_FETCH_USER" => "?1" "HTTP_SEC_FETCH_MODE" => "navigate" "HTTP_X_FORWARDED_FOR" => "3.145.77.42" "HTTP_HOST" => "www.corspedia.com" "REDIRECT_STATUS" => "200" "SERVER_NAME" => "corspedia.com" "SERVER_PORT" => "443" "SERVER_ADDR" => "141.95.147.152" "REMOTE_USER" => "" "REMOTE_PORT" => "48324" "REMOTE_ADDR" => "172.69.6.189" "SERVER_SOFTWARE" => "nginx/1.18.0" "GATEWAY_INTERFACE" => "CGI/1.1" "HTTPS" => "on" "REQUEST_SCHEME" => "https" "SERVER_PROTOCOL" => "HTTP/2.0" "DOCUMENT_ROOT" => "/var/www/corspedia/public" "DOCUMENT_URI" => "/index.php" "REQUEST_URI" => "/ar/%D8%A7%D9%84%D8%AF%D9%88%D8%B1%D8%A7%D8%AA/stanford-seminar----toward-scalable-autonomy---aleksandra-faust" "SCRIPT_NAME" => "/index.php" "CONTENT_LENGTH" => "" "CONTENT_TYPE" => "" "REQUEST_METHOD" => "GET" "QUERY_STRING" => "" "SCRIPT_FILENAME" => "/var/www/corspedia/public/index.php" "PATH_INFO" => "" "FCGI_ROLE" => "RESPONDER" "PHP_SELF" => "/index.php" "REQUEST_TIME_FLOAT" => 1744359066.6646 "REQUEST_TIME" => 1744359066 ]
        request_cookies
        []
        
        response_headers
        0 of 0
        array:5 [ "content-type" => array:1 [ 0 => "text/html; charset=UTF-8" ] "cache-control" => array:1 [ 0 => "no-cache, private" ] "date" => array:1 [ 0 => "Fri, 11 Apr 2025 08:11:06 GMT" ] "set-cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6IitRN3YyQkNtYTh0aXVDNlROMFRJRGc9PSIsInZhbHVlIjoib1NlK3NsNHdDWE5tV0UyVklqeGJ2VXF1UWlDd2R2dktBaFJMNEEvNVpucWdKNEtuNFBTc3VwWXkwYVVKdXFhTE5VcnhXcmQreDJySkdNY2lxNDNSRW8wdlBjSGZycklGWHQ5UWJJdEFVUlJBMkJsNm92d2NFTTRkcm8wZ1hWTHMiLCJtYWMiOiJmMTEzYzZiMjdkYWRiYzVhNTU1ZGQ4YjAzYjQ1YTgxNWM4ZTIyZTQxYjIxNmFjNjUyYjVlMDVkZjNmZDY2YjgxIiwidGFnIjoiIn0%3D; expires=Fri, 11 Apr 2025 10:11:06 GMT; Max-Age=7200; path=/; samesite=laxXSRF-TOKEN=eyJpdiI6IitRN3YyQkNtYTh0aXVDNlROMFRJRGc9PSIsInZhbHVlIjoib1NlK3NsNHdDWE5tV0UyVklqeGJ2VXF1UWlDd2R2dktBaFJMNEEvNVpucWdKNEtuNFBTc3VwWXkwYVVKdXFhTE5VcnhXc" 1 => "laravel_session=eyJpdiI6IlYvTHN3Tm13S2xDd1MrbjdxZXVFYnc9PSIsInZhbHVlIjoidkxPcDQ4K1Z4RGVCN3hReGFnR1BQTDBrdnVjM3FycWRRUVVwTW9ZY1NUck5iTGtYaGk4bXJjTisyemJ6bkNGMUo0REhqRzhCVUtDcVBzd1F6THdYQXJMNEhrS2YrcFJxdzJwVUJ3YzEyRGRpUlBhbGFDblhmRVVTM2hiWmROckMiLCJtYWMiOiIzMzdkNGVjZjRhZGZlNzJhNzNjZGFjZjFmZTE4OGZlMmE0ODE2YzY3ZWFhYzFlNDI4N2MzOWI1MmE0OTg4YzI5IiwidGFnIjoiIn0%3D; expires=Fri, 11 Apr 2025 10:11:06 GMT; Max-Age=7200; path=/; httponly; samesite=laxlaravel_session=eyJpdiI6IlYvTHN3Tm13S2xDd1MrbjdxZXVFYnc9PSIsInZhbHVlIjoidkxPcDQ4K1Z4RGVCN3hReGFnR1BQTDBrdnVjM3FycWRRUVVwTW9ZY1NUck5iTGtYaGk4bXJjTisyemJ6bkNGMUo0" ] "Set-Cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6IitRN3YyQkNtYTh0aXVDNlROMFRJRGc9PSIsInZhbHVlIjoib1NlK3NsNHdDWE5tV0UyVklqeGJ2VXF1UWlDd2R2dktBaFJMNEEvNVpucWdKNEtuNFBTc3VwWXkwYVVKdXFhTE5VcnhXcmQreDJySkdNY2lxNDNSRW8wdlBjSGZycklGWHQ5UWJJdEFVUlJBMkJsNm92d2NFTTRkcm8wZ1hWTHMiLCJtYWMiOiJmMTEzYzZiMjdkYWRiYzVhNTU1ZGQ4YjAzYjQ1YTgxNWM4ZTIyZTQxYjIxNmFjNjUyYjVlMDVkZjNmZDY2YjgxIiwidGFnIjoiIn0%3D; expires=Fri, 11-Apr-2025 10:11:06 GMT; path=/XSRF-TOKEN=eyJpdiI6IitRN3YyQkNtYTh0aXVDNlROMFRJRGc9PSIsInZhbHVlIjoib1NlK3NsNHdDWE5tV0UyVklqeGJ2VXF1UWlDd2R2dktBaFJMNEEvNVpucWdKNEtuNFBTc3VwWXkwYVVKdXFhTE5VcnhXc" 1 => "laravel_session=eyJpdiI6IlYvTHN3Tm13S2xDd1MrbjdxZXVFYnc9PSIsInZhbHVlIjoidkxPcDQ4K1Z4RGVCN3hReGFnR1BQTDBrdnVjM3FycWRRUVVwTW9ZY1NUck5iTGtYaGk4bXJjTisyemJ6bkNGMUo0REhqRzhCVUtDcVBzd1F6THdYQXJMNEhrS2YrcFJxdzJwVUJ3YzEyRGRpUlBhbGFDblhmRVVTM2hiWmROckMiLCJtYWMiOiIzMzdkNGVjZjRhZGZlNzJhNzNjZGFjZjFmZTE4OGZlMmE0ODE2YzY3ZWFhYzFlNDI4N2MzOWI1MmE0OTg4YzI5IiwidGFnIjoiIn0%3D; expires=Fri, 11-Apr-2025 10:11:06 GMT; path=/; httponlylaravel_session=eyJpdiI6IlYvTHN3Tm13S2xDd1MrbjdxZXVFYnc9PSIsInZhbHVlIjoidkxPcDQ4K1Z4RGVCN3hReGFnR1BQTDBrdnVjM3FycWRRUVVwTW9ZY1NUck5iTGtYaGk4bXJjTisyemJ6bkNGMUo0" ] ]
        session_attributes
        0 of 0
        array:5 [ "_token" => "4gcD35BJZXCc4laQSxnU5DK2jMlzgNxgY8GMvI7q" "locale" => "ar" "_previous" => array:1 [ "url" => "https://www.corspedia.com/ar/%D8%A7%D9%84%D8%AF%D9%88%D8%B1%D8%A7%D8%AA/stanford-seminar----toward-scalable-autonomy---aleksandra-faust" ] "_flash" => array:2 [ "old" => [] "new" => [] ] "PHPDEBUGBAR_STACK_DATA" => [] ]