Stanford Seminar - Toward Scalable Autonomy - Aleksandra Faust

Brought by: YouTube

Overview

Reinforcement learning is a promising technique for training autonomous systems that perform complex tasks in the real world. However, training reinforcement learning agents is a tedious, human-in-the-loop process, requiring heavy engineering and often resulting in suboptimal results. In this talk we explore two main directions toward scalable reinforcement learning. First, we discuss several methods for zero-shot sim2real transfer for mobile and aerial navigation, including visual navigation and fully autonomous navigation on a severely resource constrained nano UAV. Second, we observe that the interaction between the human engineer and the agent under training as a decision-making process that the human agent performs, and consequently automate the training by learning a decision making policy. With that insight, we focus on zero-shot generalization and discuss learning RL loss functions and a compositional task curriculum that generalize to unseen tasks of evolving complexity. We show that across different applications, learning-to-learn methods improve reinforcement learning agents generalization and performance, and raise questions about nurture vs nature in training autonomous systems.

Aleksandra Faust is a Senior Staff Research Scientist and Reinforcement Learning research team co-founder at Google Brain Research.

Syllabus

Introduction.
How to train goal reaching policies?.
What about resource-constrained robots?.
Training RL Agents is a (Sequential) Decision Making Problem.
Learning Loss Functions.
Evolving RL Algorithms.
Generative Curriculum for Compositional Tasks.
Autonomous web navigation.
Learning to Navigate Web.
Compositional Design of Environments (CODE).
Does it work in reality? RealED -- CODE w/ real primitives..
Better Generalization.
Autonomous Scalable Systems Future.

Taught by

Stanford Online

Stanford Seminar -  Toward Scalable Autonomy - Aleksandra Faust
Go to course

Stanford Seminar - Toward Scalable Autonomy - Aleksandra Faust

Brought by: YouTube

  • YouTube
  • Free
  • English
  • Certificate Not Available
  • Available at any time
  • All
  • N/A
8.1.2PHP Version296msRequest Duration2MBMemory UsageGET en/courses/{slug}Route
    • Booting (187ms)
    • Application (108ms)
    • 1 x Booting (63.33%)
      187.44ms
      1 x Application (36.43%)
      107.82ms
      14 templates were rendered
      • public.courses.show (resources/views/public/courses/show.blade.php)3bladefile
        Params
        0
        course
        1
        links
        2
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.courses.partials.details (resources/views/public/courses/partials/details.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.layouts.main (resources/views/public/layouts/main.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.meta (resources/views/public/layouts/partials/meta.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.navbar (resources/views/public/layouts/partials/navbar.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.links (resources/views/public/auth/profile/partials/links.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.layouts.partials.flash-session (resources/views/public/layouts/partials/flash-session.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      uri
      GET en/courses/{slug}
      middleware
      web, localize:en
      controller
      App\Http\Controllers\CourseController@show
      as
      en.courses.show
      namespace
      prefix
      /en
      where
      file
      app/Http/Controllers/CourseController.php:17-35
      7 statements were executed5.56ms
      • select * from `courses` where `slug_en` = 'stanford-seminar----toward-scalable-autonomy---aleksandra-faust' limit 1
        3.9ms/app/Http/Controllers/CourseController.php:20corspedia
        Metadata
        Bindings
        • 0. stanford-seminar----toward-scalable-autonomy---aleksandra-faust
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:20
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • update `courses` set `visitors` = `visitors` + 1, `courses`.`updated_at` = '2025-04-24 23:44:27' where `id` = 1616
        510μs/app/Http/Controllers/CourseController.php:21corspedia
        Metadata
        Bindings
        • 0. 2025-04-24 23:44:27
        • 1. 1616
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:21
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `topic_id`, `slug_en`, `slug_ar` from `subjects` where `subjects`.`id` in (7)
        180μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `slug_en`, `slug_ar` from `topics` where `topics`.`id` in (1)
        160μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 25. /app/Http/Controllers/CourseController.php:23
        • 26. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 27. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 28. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 29. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `institutions` where `institutions`.`id` in (5) and `institutions`.`deleted_at` is null
        190μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `providers` where `providers`.`id` in (21) and `providers`.`deleted_at` is null
        370μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `html_files` where `html_files`.`id` = 1608 limit 1
        250μs/app/Models/Course.php:84corspedia
        Metadata
        Bindings
        • 0. 1608
        Backtrace
        • 21. /app/Models/Course.php:84
        • 28. view::public.courses.show:29
        • 30. /vendor/laravel/framework/src/Illuminate/Filesystem/Filesystem.php:125
        • 31. /vendor/laravel/framework/src/Illuminate/View/Engines/PhpEngine.php:58
        • 32. /vendor/laravel/framework/src/Illuminate/View/Engines/CompilerEngine.php:72
      App\Models\HtmlFile
      1
      App\Models\Provider
      1
      App\Models\Institution
      1
      App\Models\Topic
      1
      App\Models\Subject
      1
      App\Models\Course
      1
        _token
        7DiMTUtf7mzUdZAbk7UuoLi2bVUMYTEAwXZHYMZP
        locale
        en
        _previous
        array:1 [ "url" => "https://www.corspedia.com/en/courses/stanford-seminar----toward-scalable-auton...
        _flash
        array:2 [ "old" => [] "new" => [] ]
        PHPDEBUGBAR_STACK_DATA
        []
        path_info
        /en/courses/stanford-seminar----toward-scalable-autonomy---aleksandra-faust
        status_code
        200
        
        status_text
        OK
        format
        html
        content_type
        text/html; charset=UTF-8
        request_query
        []
        
        request_request
        []
        
        request_headers
        0 of 0
        array:24 [ "cf-ipcountry" => array:1 [ 0 => "US" ] "cf-connecting-ip" => array:1 [ 0 => "3.149.25.222" ] "cdn-loop" => array:1 [ 0 => "cloudflare; loops=1" ] "x-forwarded-proto" => array:1 [ 0 => "https" ] "cf-visitor" => array:1 [ 0 => "{"scheme":"https"}" ] "sec-fetch-site" => array:1 [ 0 => "none" ] "accept" => array:1 [ 0 => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" ] "user-agent" => array:1 [ 0 => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" ] "upgrade-insecure-requests" => array:1 [ 0 => "1" ] "sec-ch-ua-platform" => array:1 [ 0 => ""Windows"" ] "sec-ch-ua-mobile" => array:1 [ 0 => "?0" ] "sec-ch-ua" => array:1 [ 0 => ""HeadlessChrome";v="129", "Not=A?Brand";v="8", "Chromium";v="129"" ] "cache-control" => array:1 [ 0 => "no-cache" ] "pragma" => array:1 [ 0 => "no-cache" ] "sec-fetch-dest" => array:1 [ 0 => "document" ] "cf-ray" => array:1 [ 0 => "935977f73f1aeac2-ORD" ] "accept-encoding" => array:1 [ 0 => "gzip, br" ] "priority" => array:1 [ 0 => "u=0, i" ] "sec-fetch-user" => array:1 [ 0 => "?1" ] "sec-fetch-mode" => array:1 [ 0 => "navigate" ] "x-forwarded-for" => array:1 [ 0 => "3.149.25.222" ] "host" => array:1 [ 0 => "www.corspedia.com" ] "content-length" => array:1 [ 0 => "" ] "content-type" => array:1 [ 0 => "" ] ]
        request_server
        0 of 0
        array:50 [ "USER" => "www-data" "HOME" => "/var/www" "HTTP_CF_IPCOUNTRY" => "US" "HTTP_CF_CONNECTING_IP" => "3.149.25.222" "HTTP_CDN_LOOP" => "cloudflare; loops=1" "HTTP_X_FORWARDED_PROTO" => "https" "HTTP_CF_VISITOR" => "{"scheme":"https"}" "HTTP_SEC_FETCH_SITE" => "none" "HTTP_ACCEPT" => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" "HTTP_USER_AGENT" => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" "HTTP_UPGRADE_INSECURE_REQUESTS" => "1" "HTTP_SEC_CH_UA_PLATFORM" => ""Windows"" "HTTP_SEC_CH_UA_MOBILE" => "?0" "HTTP_SEC_CH_UA" => ""HeadlessChrome";v="129", "Not=A?Brand";v="8", "Chromium";v="129"" "HTTP_CACHE_CONTROL" => "no-cache" "HTTP_PRAGMA" => "no-cache" "HTTP_SEC_FETCH_DEST" => "document" "HTTP_CF_RAY" => "935977f73f1aeac2-ORD" "HTTP_ACCEPT_ENCODING" => "gzip, br" "HTTP_PRIORITY" => "u=0, i" "HTTP_SEC_FETCH_USER" => "?1" "HTTP_SEC_FETCH_MODE" => "navigate" "HTTP_X_FORWARDED_FOR" => "3.149.25.222" "HTTP_HOST" => "www.corspedia.com" "REDIRECT_STATUS" => "200" "SERVER_NAME" => "corspedia.com" "SERVER_PORT" => "443" "SERVER_ADDR" => "141.95.147.152" "REMOTE_USER" => "" "REMOTE_PORT" => "16610" "REMOTE_ADDR" => "172.70.126.24" "SERVER_SOFTWARE" => "nginx/1.18.0" "GATEWAY_INTERFACE" => "CGI/1.1" "HTTPS" => "on" "REQUEST_SCHEME" => "https" "SERVER_PROTOCOL" => "HTTP/2.0" "DOCUMENT_ROOT" => "/var/www/corspedia/public" "DOCUMENT_URI" => "/index.php" "REQUEST_URI" => "/en/courses/stanford-seminar----toward-scalable-autonomy---aleksandra-faust" "SCRIPT_NAME" => "/index.php" "CONTENT_LENGTH" => "" "CONTENT_TYPE" => "" "REQUEST_METHOD" => "GET" "QUERY_STRING" => "" "SCRIPT_FILENAME" => "/var/www/corspedia/public/index.php" "PATH_INFO" => "" "FCGI_ROLE" => "RESPONDER" "PHP_SELF" => "/index.php" "REQUEST_TIME_FLOAT" => 1745538267.0185 "REQUEST_TIME" => 1745538267 ]
        request_cookies
        []
        
        response_headers
        0 of 0
        array:5 [ "content-type" => array:1 [ 0 => "text/html; charset=UTF-8" ] "cache-control" => array:1 [ 0 => "no-cache, private" ] "date" => array:1 [ 0 => "Thu, 24 Apr 2025 23:44:27 GMT" ] "set-cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6IlprSDlpWUxNRlBJZjlOdllBNisxTkE9PSIsInZhbHVlIjoiMEI0MDIwWXUwUlpOYW03WHdLVVkyRFV3YWtSeUg3L1p3Z1VPVFhaL2RHNnhtaHFzelZNMkljbHE0ZlF4WVpiTGpWTTZYS3hrOTZZcXpNbmgxakV2dDRkME9aTnV3ZFJIR1E3bGQyRUowaUM3N2lqV2k2ajF3TUxFRnNYMkY3N3AiLCJtYWMiOiJlNTBmYmNiNTNkMjc1YmE0MmQ5NmMxMjVlNTkwN2EzMWIxYzcyNmMwNjQwYjQ0ZDFkYzNkNDczODNkZWMyOGU2IiwidGFnIjoiIn0%3D; expires=Fri, 25 Apr 2025 01:44:27 GMT; Max-Age=7200; path=/; samesite=laxXSRF-TOKEN=eyJpdiI6IlprSDlpWUxNRlBJZjlOdllBNisxTkE9PSIsInZhbHVlIjoiMEI0MDIwWXUwUlpOYW03WHdLVVkyRFV3YWtSeUg3L1p3Z1VPVFhaL2RHNnhtaHFzelZNMkljbHE0ZlF4WVpiTGpWTTZYS" 1 => "laravel_session=eyJpdiI6Ii9qVzFiUG9KRUlzZnp5WjdQNVJTd0E9PSIsInZhbHVlIjoiRTBJdW1aRzdiUWJsazhCQVluSTQ2K1hpQ2dFSjBFenBCM1N6Vml6bDFxR241VHZVKzY3Z0RyQkdBdmczYXZBdlQrZ0NtTDVpSXEwd21aRWpqd1k3WllmMlJ6dEI1MEE3R2NWZXBDcU1La3BwZElJZjFXVkdHVWxCNWV6Z0labjgiLCJtYWMiOiI4MWE3Y2UxZjdhNTg0MTM3Mzk2ODU3OGQwMzVlOWY5YzI1YTczOTVlOGU4ZjBmM2E5NDg2Njk4ODU3YmUxMDViIiwidGFnIjoiIn0%3D; expires=Fri, 25 Apr 2025 01:44:27 GMT; Max-Age=7200; path=/; httponly; samesite=laxlaravel_session=eyJpdiI6Ii9qVzFiUG9KRUlzZnp5WjdQNVJTd0E9PSIsInZhbHVlIjoiRTBJdW1aRzdiUWJsazhCQVluSTQ2K1hpQ2dFSjBFenBCM1N6Vml6bDFxR241VHZVKzY3Z0RyQkdBdmczYXZBdlQr" ] "Set-Cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6IlprSDlpWUxNRlBJZjlOdllBNisxTkE9PSIsInZhbHVlIjoiMEI0MDIwWXUwUlpOYW03WHdLVVkyRFV3YWtSeUg3L1p3Z1VPVFhaL2RHNnhtaHFzelZNMkljbHE0ZlF4WVpiTGpWTTZYS3hrOTZZcXpNbmgxakV2dDRkME9aTnV3ZFJIR1E3bGQyRUowaUM3N2lqV2k2ajF3TUxFRnNYMkY3N3AiLCJtYWMiOiJlNTBmYmNiNTNkMjc1YmE0MmQ5NmMxMjVlNTkwN2EzMWIxYzcyNmMwNjQwYjQ0ZDFkYzNkNDczODNkZWMyOGU2IiwidGFnIjoiIn0%3D; expires=Fri, 25-Apr-2025 01:44:27 GMT; path=/XSRF-TOKEN=eyJpdiI6IlprSDlpWUxNRlBJZjlOdllBNisxTkE9PSIsInZhbHVlIjoiMEI0MDIwWXUwUlpOYW03WHdLVVkyRFV3YWtSeUg3L1p3Z1VPVFhaL2RHNnhtaHFzelZNMkljbHE0ZlF4WVpiTGpWTTZYS" 1 => "laravel_session=eyJpdiI6Ii9qVzFiUG9KRUlzZnp5WjdQNVJTd0E9PSIsInZhbHVlIjoiRTBJdW1aRzdiUWJsazhCQVluSTQ2K1hpQ2dFSjBFenBCM1N6Vml6bDFxR241VHZVKzY3Z0RyQkdBdmczYXZBdlQrZ0NtTDVpSXEwd21aRWpqd1k3WllmMlJ6dEI1MEE3R2NWZXBDcU1La3BwZElJZjFXVkdHVWxCNWV6Z0labjgiLCJtYWMiOiI4MWE3Y2UxZjdhNTg0MTM3Mzk2ODU3OGQwMzVlOWY5YzI1YTczOTVlOGU4ZjBmM2E5NDg2Njk4ODU3YmUxMDViIiwidGFnIjoiIn0%3D; expires=Fri, 25-Apr-2025 01:44:27 GMT; path=/; httponlylaravel_session=eyJpdiI6Ii9qVzFiUG9KRUlzZnp5WjdQNVJTd0E9PSIsInZhbHVlIjoiRTBJdW1aRzdiUWJsazhCQVluSTQ2K1hpQ2dFSjBFenBCM1N6Vml6bDFxR241VHZVKzY3Z0RyQkdBdmczYXZBdlQr" ] ]
        session_attributes
        0 of 0
        array:5 [ "_token" => "7DiMTUtf7mzUdZAbk7UuoLi2bVUMYTEAwXZHYMZP" "locale" => "en" "_previous" => array:1 [ "url" => "https://www.corspedia.com/en/courses/stanford-seminar----toward-scalable-autonomy---aleksandra-faust" ] "_flash" => array:2 [ "old" => [] "new" => [] ] "PHPDEBUGBAR_STACK_DATA" => [] ]