Convex Optimization and Reinforcement Learning

News

A new paradigm for AI: How ‘thinking as optimization’ leads to better general-purpose models

A new AI model learns to "think" longer on hard problems, achieving more robust reasoning and better generalization to novel, unseen tasks.

IEEE3d

Energy-Efficient Design for a NOMA Assisted STAR-RIS Network With Deep ...

In this article, an energy efficiency (EE) optimization problem for non-orthogonal multiple access (NOMA) assisted STAR-RIS downlink network is investigated. Due to the fractional form of the ...

IEEE2d

TVDO: Tchebycheff Value-Decomposition Optimization for Multiagent ...

In cooperative multiagent reinforcement learning (MARL), centralized training with decentralized execution (CTDE) has recently attracted more attention due to the physical demand. However, the most ...

GitHub4d

chauncygu/Safe-Reinforcement-Learning-Baselines - GitHub

Reinforcement learning with convex constraints, Paper, Code (Accepted by NeurIPS 2019) Reward constrained policy optimization, Paper, Not Find Code (Accepted by ICLR 2019) ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results