Introduction to Reinforcement Learning

بواسطة: Coursera

Overview

The goal of «Intro to Reinforcement learning» is in its name: introduce students to reinforcement learning – the prominent area of modern research in artificial intelligence. The reinforcement learning differs much from both supervised and unsupervised learning and is more about how humans learn in reality.
Students will learn from this course both theoretical core and recent practical RL methods. Most importantly, they will learn how to apply such methods to practical problems. In six weeks students will be guided through the basics of Reinforcement Learning (RL): we will talk about essential theory of RL, value-based methods (such as SARSA and Q-learning), policy based algorithms and methods, designed to solve the optimal exploration problem. In addition to algorithms and theory, during the course we will also present useful practical tips and tricks, needed for learning stabilization, and study how to apply the methods to large scale problems with deep neural networks.

Taught by

Pavel Shvechikov and Alexander Panin

Introduction to Reinforcement Learning
الذهاب الي الدورة

Introduction to Reinforcement Learning

بواسطة: Coursera

  • Coursera
  • مجانية
  • الإنجليزية
  • متاح شهادة
  • أيام محددة
  • الجميع
  • N/A
8.1.2PHP Version305msRequest Duration2MBMemory UsageGET ar/الدورات/{slug}Route
    • Booting (193ms)
    • Application (112ms)
    • 1 x Booting (63.15%)
      192.64ms
      1 x Application (36.61%)
      111.68ms
      14 templates were rendered
      • public.courses.show (resources/views/public/courses/show.blade.php)3bladefile
        Params
        0
        course
        1
        links
        2
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.courses.partials.details (resources/views/public/courses/partials/details.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.breadcrumbs (resources/views/public/courses/partials/breadcrumbs.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.courses.partials.heading (resources/views/public/courses/partials/heading.blade.php)7bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        classes
      • public.layouts.main (resources/views/public/layouts/main.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.meta (resources/views/public/layouts/partials/meta.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.layouts.partials.navbar (resources/views/public/layouts/partials/navbar.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.links (resources/views/public/auth/profile/partials/links.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.auth.profile.partials.link (resources/views/public/auth/profile/partials/link.blade.php)8bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
        6
        route
        7
        title
      • public.layouts.partials.flash-session (resources/views/public/layouts/partials/flash-session.blade.php)6bladefile
        Params
        0
        __env
        1
        app
        2
        errors
        3
        course
        4
        links
        5
        config
      uri
      GET ar/الدورات/{slug}
      middleware
      web, localize:ar
      controller
      App\Http\Controllers\CourseController@show
      as
      ar.courses.show
      namespace
      prefix
      /ar
      where
      file
      app/Http/Controllers/CourseController.php:17-35
      7 statements were executed6.23ms
      • select * from `courses` where `slug_ar` = 'introduction-to-reinforcement-learning' limit 1
        4.99ms/app/Http/Controllers/CourseController.php:20corspedia
        Metadata
        Bindings
        • 0. introduction-to-reinforcement-learning
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:20
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • update `courses` set `visitors` = `visitors` + 1, `courses`.`updated_at` = '2025-04-17 05:36:48' where `id` = 2453
        430μs/app/Http/Controllers/CourseController.php:21corspedia
        Metadata
        Bindings
        • 0. 2025-04-17 05:36:48
        • 1. 2453
        Backtrace
        • 17. /app/Http/Controllers/CourseController.php:21
        • 18. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 19. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 20. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `topic_id`, `slug_en`, `slug_ar` from `subjects` where `subjects`.`id` in (4)
        150μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select `id`, `name_en`, `name_ar`, `slug_en`, `slug_ar` from `topics` where `topics`.`id` in (1)
        140μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 25. /app/Http/Controllers/CourseController.php:23
        • 26. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 27. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 28. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 29. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `institutions` where `institutions`.`id` in (35) and `institutions`.`deleted_at` is null
        170μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `providers` where `providers`.`id` in (2) and `providers`.`deleted_at` is null
        160μs/app/Http/Controllers/CourseController.php:23corspedia
        Metadata
        Backtrace
        • 20. /app/Http/Controllers/CourseController.php:23
        • 21. /vendor/laravel/framework/src/Illuminate/Routing/Controller.php:54
        • 22. /vendor/laravel/framework/src/Illuminate/Routing/ControllerDispatcher.php:43
        • 23. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:260
        • 24. /vendor/laravel/framework/src/Illuminate/Routing/Route.php:205
      • select * from `html_files` where `html_files`.`id` = 2444 limit 1
        190μs/app/Models/Course.php:84corspedia
        Metadata
        Bindings
        • 0. 2444
        Backtrace
        • 21. /app/Models/Course.php:84
        • 28. view::public.courses.show:29
        • 30. /vendor/laravel/framework/src/Illuminate/Filesystem/Filesystem.php:125
        • 31. /vendor/laravel/framework/src/Illuminate/View/Engines/PhpEngine.php:58
        • 32. /vendor/laravel/framework/src/Illuminate/View/Engines/CompilerEngine.php:72
      App\Models\HtmlFile
      1
      App\Models\Provider
      1
      App\Models\Institution
      1
      App\Models\Topic
      1
      App\Models\Subject
      1
      App\Models\Course
      1
        _token
        g18NTsbbG4WGA3hiXu6hH0WfXjxtFHjwIzKQ8Sjl
        locale
        ar
        _previous
        array:1 [ "url" => "https://www.corspedia.com/ar/%D8%A7%D9%84%D8%AF%D9%88%D8%B1%D8%A7%D8%AA/introd...
        _flash
        array:2 [ "old" => [] "new" => [] ]
        PHPDEBUGBAR_STACK_DATA
        []
        path_info
        /ar/%D8%A7%D9%84%D8%AF%D9%88%D8%B1%D8%A7%D8%AA/introduction-to-reinforcement-learning
        status_code
        200
        
        status_text
        OK
        format
        html
        content_type
        text/html; charset=UTF-8
        request_query
        []
        
        request_request
        []
        
        request_headers
        0 of 0
        array:24 [ "cf-ipcountry" => array:1 [ 0 => "US" ] "cf-connecting-ip" => array:1 [ 0 => "18.227.13.192" ] "cdn-loop" => array:1 [ 0 => "cloudflare; loops=1" ] "x-forwarded-proto" => array:1 [ 0 => "https" ] "cf-visitor" => array:1 [ 0 => "{"scheme":"https"}" ] "sec-fetch-site" => array:1 [ 0 => "none" ] "accept" => array:1 [ 0 => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" ] "user-agent" => array:1 [ 0 => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" ] "upgrade-insecure-requests" => array:1 [ 0 => "1" ] "sec-ch-ua-platform" => array:1 [ 0 => ""Windows"" ] "sec-ch-ua-mobile" => array:1 [ 0 => "?0" ] "sec-ch-ua" => array:1 [ 0 => ""HeadlessChrome";v="129", "Not=A?Brand";v="8", "Chromium";v="129"" ] "cache-control" => array:1 [ 0 => "no-cache" ] "pragma" => array:1 [ 0 => "no-cache" ] "sec-fetch-dest" => array:1 [ 0 => "document" ] "cf-ray" => array:1 [ 0 => "9319911b6c411407-ORD" ] "accept-encoding" => array:1 [ 0 => "gzip, br" ] "priority" => array:1 [ 0 => "u=0, i" ] "sec-fetch-user" => array:1 [ 0 => "?1" ] "sec-fetch-mode" => array:1 [ 0 => "navigate" ] "x-forwarded-for" => array:1 [ 0 => "18.227.13.192" ] "host" => array:1 [ 0 => "www.corspedia.com" ] "content-length" => array:1 [ 0 => "" ] "content-type" => array:1 [ 0 => "" ] ]
        request_server
        0 of 0
        array:50 [ "USER" => "www-data" "HOME" => "/var/www" "HTTP_CF_IPCOUNTRY" => "US" "HTTP_CF_CONNECTING_IP" => "18.227.13.192" "HTTP_CDN_LOOP" => "cloudflare; loops=1" "HTTP_X_FORWARDED_PROTO" => "https" "HTTP_CF_VISITOR" => "{"scheme":"https"}" "HTTP_SEC_FETCH_SITE" => "none" "HTTP_ACCEPT" => "text/html,application/xhtml+xml,application/xml;q=0.9,image/avif,image/webp,image/apng,*/*;q=0.8,application/signed-exchange;v=b3;q=0.7" "HTTP_USER_AGENT" => "Mozilla/5.0 AppleWebKit/537.36 (KHTML, like Gecko; compatible; ClaudeBot/1.0; +claudebot@anthropic.com)" "HTTP_UPGRADE_INSECURE_REQUESTS" => "1" "HTTP_SEC_CH_UA_PLATFORM" => ""Windows"" "HTTP_SEC_CH_UA_MOBILE" => "?0" "HTTP_SEC_CH_UA" => ""HeadlessChrome";v="129", "Not=A?Brand";v="8", "Chromium";v="129"" "HTTP_CACHE_CONTROL" => "no-cache" "HTTP_PRAGMA" => "no-cache" "HTTP_SEC_FETCH_DEST" => "document" "HTTP_CF_RAY" => "9319911b6c411407-ORD" "HTTP_ACCEPT_ENCODING" => "gzip, br" "HTTP_PRIORITY" => "u=0, i" "HTTP_SEC_FETCH_USER" => "?1" "HTTP_SEC_FETCH_MODE" => "navigate" "HTTP_X_FORWARDED_FOR" => "18.227.13.192" "HTTP_HOST" => "www.corspedia.com" "REDIRECT_STATUS" => "200" "SERVER_NAME" => "corspedia.com" "SERVER_PORT" => "443" "SERVER_ADDR" => "141.95.147.152" "REMOTE_USER" => "" "REMOTE_PORT" => "53906" "REMOTE_ADDR" => "172.70.127.233" "SERVER_SOFTWARE" => "nginx/1.18.0" "GATEWAY_INTERFACE" => "CGI/1.1" "HTTPS" => "on" "REQUEST_SCHEME" => "https" "SERVER_PROTOCOL" => "HTTP/2.0" "DOCUMENT_ROOT" => "/var/www/corspedia/public" "DOCUMENT_URI" => "/index.php" "REQUEST_URI" => "/ar/%D8%A7%D9%84%D8%AF%D9%88%D8%B1%D8%A7%D8%AA/introduction-to-reinforcement-learning" "SCRIPT_NAME" => "/index.php" "CONTENT_LENGTH" => "" "CONTENT_TYPE" => "" "REQUEST_METHOD" => "GET" "QUERY_STRING" => "" "SCRIPT_FILENAME" => "/var/www/corspedia/public/index.php" "PATH_INFO" => "" "FCGI_ROLE" => "RESPONDER" "PHP_SELF" => "/index.php" "REQUEST_TIME_FLOAT" => 1744868208.1521 "REQUEST_TIME" => 1744868208 ]
        request_cookies
        []
        
        response_headers
        0 of 0
        array:5 [ "content-type" => array:1 [ 0 => "text/html; charset=UTF-8" ] "cache-control" => array:1 [ 0 => "no-cache, private" ] "date" => array:1 [ 0 => "Thu, 17 Apr 2025 05:36:48 GMT" ] "set-cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6InMyRXA4aEs0bkRSekdrMG0ydlh1ZWc9PSIsInZhbHVlIjoiMklSaDJKQWxWOXdYblhMbksxY3hsWVI1NkhNUDNXSXpDSGRBNytwU05vNFFWZW9US0ttKzg3R0pqTStSSjVYbGgrbEVOcE10a2NIdUNVRlI0SjQzdmFGaUNvbVgzU3VqOGk1V3hsY1lRV2VsZ3FKTVBpcjVBbmFKSFRkMTUwRmUiLCJtYWMiOiJiZmE1N2Q3MGMwNzQwY2Q5MGIyMjYzOTI0MDdlZjRmMzQ2YWVjY2I4YjVjNzQyZmYxNzhmYTUzZDRkNjNjZTY3IiwidGFnIjoiIn0%3D; expires=Thu, 17 Apr 2025 07:36:48 GMT; Max-Age=7200; path=/; samesite=laxXSRF-TOKEN=eyJpdiI6InMyRXA4aEs0bkRSekdrMG0ydlh1ZWc9PSIsInZhbHVlIjoiMklSaDJKQWxWOXdYblhMbksxY3hsWVI1NkhNUDNXSXpDSGRBNytwU05vNFFWZW9US0ttKzg3R0pqTStSSjVYbGgrbEVOc" 1 => "laravel_session=eyJpdiI6IkxCNS9jdmtRRWVyRWN4c0lUdlFGb0E9PSIsInZhbHVlIjoiN3J6WkdRU2NZMXpDNVk0RlRXNUNMYnBnYUFWQkFNRjlTZ1pNNU9oYVVTbUpsdjg1bnpKOUtiSmMwMnR0cytKNDZkd2srR2N4bUNkRGx5cEo2WnI2NEI5ZnBidFBYNXhXb2FFYWd5WDRqa0h1ZUNnbW5INXFnQmxtZk1hTDhjWTEiLCJtYWMiOiIyMWExM2ZiNWY4YWU1ZjZhMWQ0NTJjZWI2YTdjMTcwZmY2MGRiMWY5MDc0Y2M2ZDczMmE1OTFiZDE0MDE4NTNjIiwidGFnIjoiIn0%3D; expires=Thu, 17 Apr 2025 07:36:48 GMT; Max-Age=7200; path=/; httponly; samesite=laxlaravel_session=eyJpdiI6IkxCNS9jdmtRRWVyRWN4c0lUdlFGb0E9PSIsInZhbHVlIjoiN3J6WkdRU2NZMXpDNVk0RlRXNUNMYnBnYUFWQkFNRjlTZ1pNNU9oYVVTbUpsdjg1bnpKOUtiSmMwMnR0cytKNDZk" ] "Set-Cookie" => array:2 [ 0 => "XSRF-TOKEN=eyJpdiI6InMyRXA4aEs0bkRSekdrMG0ydlh1ZWc9PSIsInZhbHVlIjoiMklSaDJKQWxWOXdYblhMbksxY3hsWVI1NkhNUDNXSXpDSGRBNytwU05vNFFWZW9US0ttKzg3R0pqTStSSjVYbGgrbEVOcE10a2NIdUNVRlI0SjQzdmFGaUNvbVgzU3VqOGk1V3hsY1lRV2VsZ3FKTVBpcjVBbmFKSFRkMTUwRmUiLCJtYWMiOiJiZmE1N2Q3MGMwNzQwY2Q5MGIyMjYzOTI0MDdlZjRmMzQ2YWVjY2I4YjVjNzQyZmYxNzhmYTUzZDRkNjNjZTY3IiwidGFnIjoiIn0%3D; expires=Thu, 17-Apr-2025 07:36:48 GMT; path=/XSRF-TOKEN=eyJpdiI6InMyRXA4aEs0bkRSekdrMG0ydlh1ZWc9PSIsInZhbHVlIjoiMklSaDJKQWxWOXdYblhMbksxY3hsWVI1NkhNUDNXSXpDSGRBNytwU05vNFFWZW9US0ttKzg3R0pqTStSSjVYbGgrbEVOc" 1 => "laravel_session=eyJpdiI6IkxCNS9jdmtRRWVyRWN4c0lUdlFGb0E9PSIsInZhbHVlIjoiN3J6WkdRU2NZMXpDNVk0RlRXNUNMYnBnYUFWQkFNRjlTZ1pNNU9oYVVTbUpsdjg1bnpKOUtiSmMwMnR0cytKNDZkd2srR2N4bUNkRGx5cEo2WnI2NEI5ZnBidFBYNXhXb2FFYWd5WDRqa0h1ZUNnbW5INXFnQmxtZk1hTDhjWTEiLCJtYWMiOiIyMWExM2ZiNWY4YWU1ZjZhMWQ0NTJjZWI2YTdjMTcwZmY2MGRiMWY5MDc0Y2M2ZDczMmE1OTFiZDE0MDE4NTNjIiwidGFnIjoiIn0%3D; expires=Thu, 17-Apr-2025 07:36:48 GMT; path=/; httponlylaravel_session=eyJpdiI6IkxCNS9jdmtRRWVyRWN4c0lUdlFGb0E9PSIsInZhbHVlIjoiN3J6WkdRU2NZMXpDNVk0RlRXNUNMYnBnYUFWQkFNRjlTZ1pNNU9oYVVTbUpsdjg1bnpKOUtiSmMwMnR0cytKNDZk" ] ]
        session_attributes
        0 of 0
        array:5 [ "_token" => "g18NTsbbG4WGA3hiXu6hH0WfXjxtFHjwIzKQ8Sjl" "locale" => "ar" "_previous" => array:1 [ "url" => "https://www.corspedia.com/ar/%D8%A7%D9%84%D8%AF%D9%88%D8%B1%D8%A7%D8%AA/introduction-to-reinforcement-learning" ] "_flash" => array:2 [ "old" => [] "new" => [] ] "PHPDEBUGBAR_STACK_DATA" => [] ]