Sadegh Talebi
Tenure Track Assistant Professor
Machine Learning
Universitetsparken 1
2100 København Ø
Most downloads
-
57 downloadsPublished
Adversarial Bandits with Corruptions
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
-
46 downloadsPublished
Tightening Exploration in Upper Confidence Reinforcement Learning
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
-
30 downloadsPublished
Scaling Up Q-Learning via Exploiting State–Action Equivalence
Research output: Contribution to journal › Journal article › Research › peer-review
-
22 downloadsPublished
Bandit-Based Power Control in Full-Duplex Cooperative Relay Networks with Strict-Sense Stationary and Non-Stationary Wireless Communication Channels
Research output: Contribution to journal › Journal article › Research › peer-review
-
20 downloadsPublished
Improved Exploration in Factored Average-Reward MDPs
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
ID: 235125478
Most downloads
-
57
downloads
Adversarial Bandits with Corruptions
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published -
46
downloads
Tightening Exploration in Upper Confidence Reinforcement Learning
Research output: Chapter in Book/Report/Conference proceeding › Article in proceedings › Research › peer-review
Published -
30
downloads
Scaling Up Q-Learning via Exploiting State–Action Equivalence
Research output: Contribution to journal › Journal article › Research › peer-review
Published