Reinforcement Learning Explained

Brought by: edX

Overview

Reinforcement Learning (RL) is an area of machine learning, where an agent learns by interacting with its environment to achieve a goal.

In this course, you will be introduced to the world of reinforcement learning. You will learn how to frame reinforcement learning problems and start tackling classic examples like news recommendation, learning to navigate in a grid-world, and balancing a cart-pole.

You will explore the basic algorithms from multi-armed bandits, dynamic programming, TD (temporal difference) learning, and progress towards larger state space using function approximation, in particular using deep learning. You will also learn about algorithms that focus on searching the best policy with policy gradient and actor critic methods. Along the way, you will get introduced to Project Malmo, a platform for Artificial Intelligence experimentation and research built on top of the Minecraft game.

edX offers financial assistance for learners who want to earn Verified Certificates but who may not be able to pay the fee. To apply for financial assistance, enroll in the course, then follow this link to complete an application for assistance.

Note: These courses will retire in June. Please enroll only if you are able to finish your coursework in time.

Taught by

Jonathan Sanito, Roland Fernandez, Matthew Hausknecht, Katja Hofmann, Kenneth Tran and Adith Swaminathan

Reinforcement Learning Explained
Go to course

Reinforcement Learning Explained

Brought by: edX

  • edX
  • Free
  • English
  • Certificate Not Available
  • Certain days
  • All
  • N/A
8.1.2PHP Version312msRequest Duration2MBMemory UsageGET en/courses/{slug}Route
    • Booting (202ms)
    • Application (109ms)
    • 1 x Booting (64.84%)
      202.29ms
      1 x Application (34.93%)
      108.99ms
      14 templates were rendered
      • public.courses.show (resources/views/public/courses/show.blade.php)3bladefile
        Params
        0
        course
        1
        links
        2
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.courses.partials.details (resources/views/public/courses/partials/details.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.layouts.main (resources/views/public/layouts/main.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.meta (resources/views/public/layouts/partials/meta.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.navbar (resources/views/public/layouts/partials/navbar.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.links (resources/views/public/auth/profile/partials/links.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.layouts.partials.flash-session (resources/views/public/layouts/partials/flash-session.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      uri
      GET en/courses/{slug}
      middleware
      web, localize:en
      controller
      App\Http\Controllers\CourseController@show
      as
      en.courses.show
      namespace
      prefix
      /en
      where
      file
      app/Http/Controllers/CourseController.php:17-35
      7 statements were executed7.34ms
      • select * from `courses` where `slug_en` = 'reinforcement-learning-explained' limit 1
        5.78ms/app/Http/Controllers/CourseController.php:20corspedia
        Metadata
        Bindings
        • 0. reinforcement-learning-explained
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:20
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • update `courses` set `visitors` = `visitors` + 1, `courses`.`updated_at` = '2025-04-12 19:33:23' where `id` = 2503
        590μs/app/Http/Controllers/CourseController.php:21corspedia
        Metadata
        Bindings
        • 0. 2025-04-12 19:33:23
        • 1. 2503
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:21
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `topic_id`, `slug_en`, `slug_ar` from `subjects` where `subjects`.`id` in (4)
        170μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `slug_en`, `slug_ar` from `topics` where `topics`.`id` in (1)
        140μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 25. /app/Http/Controllers/CourseController.php:23
        • 26. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 27. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 28. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 29. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `institutions` where `institutions`.`id` in (62) and `institutions`.`deleted_at` is null
        160μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `providers` where `providers`.`id` in (1) and `providers`.`deleted_at` is null
        170μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `html_files` where `html_files`.`id` = 2494 limit 1
        330μs/app/Models/Course.php:84corspedia
        Metadata
        Bindings
        • 0. 2494
        Backtrace
        • 21. /app/Models/Course.php:84
        • 28. view::public.courses.show:29
        • 30. /vendor/laravel/framework/src/Illuminate/Filesystem/Filesystem.php:125
        • 31. /vendor/laravel/framework/src/Illuminate/View/Engines/PhpEngine.php:58
        • 32. /vendor/laravel/framework/src/Illuminate/View/Engines/CompilerEngine.php:72
      App\Models\HtmlFile
      1
      App\Models\Provider
      1
      App\Models\Institution
      1
      App\Models\Topic
      1
      App\Models\Subject
      1
      App\Models\Course
      1
        _token
        56L2y0JxFXQciWY2hflSWsa9n87UfTV5dpdqo9g4
        locale
        en
        _previous
        array:1 [ "url" => "https://www.corspedia.com/en/courses/reinforcement-learning-explained" ]
        _flash
        array:2 [ "old" => [] "new" => [] ]
        PHPDEBUGBAR_STACK_DATA
        []
        path_info
        /en/courses/reinforcement-learning-explained
        status_code
        200
        
        status_text
        OK
        format
        html
        content_type
        text/html; charset=UTF-8
        request_query
        []
        
        request_request
        []
        
        request_headers
        0 of 0
        array:24 [ "sec-ch-ua-mobile" => array:1 [ 0 => "?0" ] "sec-ch-ua" => array:1 [ 0 => ""HeadlessChrome";v="129", "Not=A?Brand";v="8", "Chromium";v="129"" ] "cache-control" => array:1 [ 0 => "no-cache" ] "pragma" => array:1 [ 0 => "no-cache" ] "upgrade-insecure-requests" => array:1 [ 0 => "1" ] "priority" => array:1 [ 0 => "u=0, i" ] "user-agent" => array:1 [ 0 => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" ] "cf-ipcountry" => array:1 [ 0 => "US" ] "cf-connecting-ip" => array:1 [ 0 => "3.144.119.207" ] "accept" => array:1 [ 0 => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" ] "sec-fetch-site" => array:1 [ 0 => "none" ] "cf-visitor" => array:1 [ 0 => "{"scheme":"https"}" ] "sec-fetch-mode" => array:1 [ 0 => "navigate" ] "sec-fetch-user" => array:1 [ 0 => "?1" ] "x-forwarded-proto" => array:1 [ 0 => "https" ] "cdn-loop" => array:1 [ 0 => "cloudflare; loops=1" ] "accept-encoding" => array:1 [ 0 => "gzip, br" ] "sec-fetch-dest" => array:1 [ 0 => "document" ] "sec-ch-ua-platform" => array:1 [ 0 => ""Windows"" ] "x-forwarded-for" => array:1 [ 0 => "3.144.119.207" ] "cf-ray" => array:1 [ 0 => "92f527b41ff22310-ORD" ] "host" => array:1 [ 0 => "www.corspedia.com" ] "content-length" => array:1 [ 0 => "" ] "content-type" => array:1 [ 0 => "" ] ]
        request_server
        0 of 0
        array:50 [ "USER" => "www-data" "HOME" => "/var/www" "HTTP_SEC_CH_UA_MOBILE" => "?0" "HTTP_SEC_CH_UA" => ""HeadlessChrome";v="129", "Not=A?Brand";v="8", "Chromium";v="129"" "HTTP_CACHE_CONTROL" => "no-cache" "HTTP_PRAGMA" => "no-cache" "HTTP_UPGRADE_INSECURE_REQUESTS" => "1" "HTTP_PRIORITY" => "u=0, i" "HTTP_USER_AGENT" => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" "HTTP_CF_IPCOUNTRY" => "US" "HTTP_CF_CONNECTING_IP" => "3.144.119.207" "HTTP_ACCEPT" => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" "HTTP_SEC_FETCH_SITE" => "none" "HTTP_CF_VISITOR" => "{"scheme":"https"}" "HTTP_SEC_FETCH_MODE" => "navigate" "HTTP_SEC_FETCH_USER" => "?1" "HTTP_X_FORWARDED_PROTO" => "https" "HTTP_CDN_LOOP" => "cloudflare; loops=1" "HTTP_ACCEPT_ENCODING" => "gzip, br" "HTTP_SEC_FETCH_DEST" => "document" "HTTP_SEC_CH_UA_PLATFORM" => ""Windows"" "HTTP_X_FORWARDED_FOR" => "3.144.119.207" "HTTP_CF_RAY" => "92f527b41ff22310-ORD" "HTTP_HOST" => "www.corspedia.com" "REDIRECT_STATUS" => "200" "SERVER_NAME" => "corspedia.com" "SERVER_PORT" => "443" "SERVER_ADDR" => "141.95.147.152" "REMOTE_USER" => "" "REMOTE_PORT" => "33396" "REMOTE_ADDR" => "172.71.254.143" "SERVER_SOFTWARE" => "nginx/1.18.0" "GATEWAY_INTERFACE" => "CGI/1.1" "HTTPS" => "on" "REQUEST_SCHEME" => "https" "SERVER_PROTOCOL" => "HTTP/2.0" "DOCUMENT_ROOT" => "/var/www/corspedia/public" "DOCUMENT_URI" => "/index.php" "REQUEST_URI" => "/en/courses/reinforcement-learning-explained" "SCRIPT_NAME" => "/index.php" "CONTENT_LENGTH" => "" "CONTENT_TYPE" => "" "REQUEST_METHOD" => "GET" "QUERY_STRING" => "" "SCRIPT_FILENAME" => "/var/www/corspedia/public/index.php" "PATH_INFO" => "" "FCGI_ROLE" => "RESPONDER" "PHP_SELF" => "/index.php" "REQUEST_TIME_FLOAT" => 1744486403.4723 "REQUEST_TIME" => 1744486403 ]
        request_cookies
        []
        
        response_headers
        0 of 0
        array:5 [ "content-type" => array:1 [ 0 => "text/html; charset=UTF-8" ] "cache-control" => array:1 [ 0 => "no-cache, private" ] "date" => array:1 [ 0 => "Sat, 12 Apr 2025 19:33:23 GMT" ] "set-cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6ImZ3NW1Ia05yTGV3Y2xMa0lSOHg5M2c9PSIsInZhbHVlIjoiWC9wMXVNTDdTQVROOG1NS3A4cEp2UUU0clhSK1ZuTktmQmNLejd6QWxDMWtUQUpWcEM0TnhHN3FVbTVYSVYxWURwZGhFcGJnNjljb0IvTUM4TzVvRURzdnFQN3QzQmIrZ0o4dVdaenBiYk5JSmtJNlpOZXRVTGJFK0xvem5pSEMiLCJtYWMiOiJiNTYzY2UxZWM5NWU1YTUyMDZmODAyMjRkZWU0OWFmNzQxMjQ1YWUzNTUyM2VlZmZmYmY2NmRhMmE4NTRiYzE5IiwidGFnIjoiIn0%3D; expires=Sat, 12 Apr 2025 21:33:23 GMT; Max-Age=7200; path=/; samesite=laxXSRF-TOKEN=eyJpdiI6ImZ3NW1Ia05yTGV3Y2xMa0lSOHg5M2c9PSIsInZhbHVlIjoiWC9wMXVNTDdTQVROOG1NS3A4cEp2UUU0clhSK1ZuTktmQmNLejd6QWxDMWtUQUpWcEM0TnhHN3FVbTVYSVYxWURwZGhFc" 1 => "laravel_session=eyJpdiI6IlNhaHNmYjl6UXFrVHBUNkUxc25uQmc9PSIsInZhbHVlIjoiSXhXaldBU291SEJxUDZNck9Ud3hhWFZkMWhvZk9nczB2aWlFRE5zQXhHTmhDMXQrbEx4L0JWTjJSVmRCR0ErMVhsOHFkSTRIbTNYVVVCekxkR0t3S1E2aytKWUxKbnlqenZRUGxKMkVxeUFIZDdvMnNsQTcwdUlscVZyNGY1M0YiLCJtYWMiOiIyZTkwYjI2MWMwNDViOTQxMmFlYzYxMzg0M2EzZTM2OTIxYjBkMzk4Y2I4NTI0ZDg3ZmYzMGE3ZmUzNTJlNTA2IiwidGFnIjoiIn0%3D; expires=Sat, 12 Apr 2025 21:33:23 GMT; Max-Age=7200; path=/; httponly; samesite=laxlaravel_session=eyJpdiI6IlNhaHNmYjl6UXFrVHBUNkUxc25uQmc9PSIsInZhbHVlIjoiSXhXaldBU291SEJxUDZNck9Ud3hhWFZkMWhvZk9nczB2aWlFRE5zQXhHTmhDMXQrbEx4L0JWTjJSVmRCR0ErMVhs" ] "Set-Cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6ImZ3NW1Ia05yTGV3Y2xMa0lSOHg5M2c9PSIsInZhbHVlIjoiWC9wMXVNTDdTQVROOG1NS3A4cEp2UUU0clhSK1ZuTktmQmNLejd6QWxDMWtUQUpWcEM0TnhHN3FVbTVYSVYxWURwZGhFcGJnNjljb0IvTUM4TzVvRURzdnFQN3QzQmIrZ0o4dVdaenBiYk5JSmtJNlpOZXRVTGJFK0xvem5pSEMiLCJtYWMiOiJiNTYzY2UxZWM5NWU1YTUyMDZmODAyMjRkZWU0OWFmNzQxMjQ1YWUzNTUyM2VlZmZmYmY2NmRhMmE4NTRiYzE5IiwidGFnIjoiIn0%3D; expires=Sat, 12-Apr-2025 21:33:23 GMT; path=/XSRF-TOKEN=eyJpdiI6ImZ3NW1Ia05yTGV3Y2xMa0lSOHg5M2c9PSIsInZhbHVlIjoiWC9wMXVNTDdTQVROOG1NS3A4cEp2UUU0clhSK1ZuTktmQmNLejd6QWxDMWtUQUpWcEM0TnhHN3FVbTVYSVYxWURwZGhFc" 1 => "laravel_session=eyJpdiI6IlNhaHNmYjl6UXFrVHBUNkUxc25uQmc9PSIsInZhbHVlIjoiSXhXaldBU291SEJxUDZNck9Ud3hhWFZkMWhvZk9nczB2aWlFRE5zQXhHTmhDMXQrbEx4L0JWTjJSVmRCR0ErMVhsOHFkSTRIbTNYVVVCekxkR0t3S1E2aytKWUxKbnlqenZRUGxKMkVxeUFIZDdvMnNsQTcwdUlscVZyNGY1M0YiLCJtYWMiOiIyZTkwYjI2MWMwNDViOTQxMmFlYzYxMzg0M2EzZTM2OTIxYjBkMzk4Y2I4NTI0ZDg3ZmYzMGE3ZmUzNTJlNTA2IiwidGFnIjoiIn0%3D; expires=Sat, 12-Apr-2025 21:33:23 GMT; path=/; httponlylaravel_session=eyJpdiI6IlNhaHNmYjl6UXFrVHBUNkUxc25uQmc9PSIsInZhbHVlIjoiSXhXaldBU291SEJxUDZNck9Ud3hhWFZkMWhvZk9nczB2aWlFRE5zQXhHTmhDMXQrbEx4L0JWTjJSVmRCR0ErMVhs" ] ]
        session_attributes
        0 of 0
        array:5 [ "_token" => "56L2y0JxFXQciWY2hflSWsa9n87UfTV5dpdqo9g4" "locale" => "en" "_previous" => array:1 [ "url" => "https://www.corspedia.com/en/courses/reinforcement-learning-explained" ] "_flash" => array:2 [ "old" => [] "new" => [] ] "PHPDEBUGBAR_STACK_DATA" => [] ]