#reinforcementLearning hashtag - Bluesky

EkasCloud – Personalized Training Platform

@ekascloud.bsky.social

11 hours ago

Post from EkasCloud Online Courses - YouTube Reinforcement Learning in Everyday Apps — The Silent Revolution #ReinforcementLearning #MachineLearning #ArtificialIntelligence #AIApps #SmartTechnology #Tec...

Reinforcement Learning in Everyday Apps — The Silent Revolution
www.youtube.com/post/UgkxMzf...
#ReinforcementLearning #MachineLearning #ArtificialIntelligence #AIApps #SmartTechnology #TechInnovation #FutureTech #DataScience

0 0 0 0

Daniel Bösch

@encrux.bsky.social

4 days ago

Just set up my own blog and wrote the first post about teaching AI to play my favorite childhood video game: boesch.dev/posts/ddnet-... #machinelearning #AI #reinforcementlearning

1 0 0 0

Rationality & Society

@rss-journal.bsky.social

4 days ago

Sven Banisch, @eckolb.bsky.social et al. built an #ABM where #ReinforcementLearning users choose #SocialMedia platforms based on social approval and #diversity. They show a #SocialDilemma: #EchoChambers can arise even when users prefer diversity, making everyone worse off.

🔗 doi.org/10.1177/1043...

2 2 0 0

ByteJournal

@bytejournal.bsky.social

4 days ago

"Unlock the secrets of autonomous systems with Explainable RL 🤖! Boost performance, trust, and accou

"Unlock the secrets of autonomous systems with Explainable RL 🤖! Boost performance, trust, and accountability. Learn more! #ExplainableAI #ReinforcementLearning #AI"

🔗 bytejournal.online/blog/explainable-reinfor...

0 0 0 0

SiegeLord

@siegelordex.bsky.social

1 week ago

Got the robot to lift its feet in sim. Was going to try it on the real robot, but it stopped detecting my IMU after I removed an unused display: turns out that display was the only thing with a pull-up resistor to make the I2C bus work.

#robotics #reinforcementlearning

0 0 0 0

Gerrit Eicker

@eicker.bsky.social

1 week ago

Chinese #AIstartup #MiniMax has released its new proprietary LLM, M2.7, which is designed to power #AIagents and third-party tools. The model is notable for its #selfevolving capabilities, handling 30-50% of its own #reinforcementlearning workflow.…

0 0 0 0

Transactions on Artificial Intelligence

@transaai.bsky.social

1 week ago

#newpaper #TAI #ReinforcementLearning Combining constraint-aware offline learning with runtime safety filtering provides a practical pathway toward safe and effective RL-based clinical decision support systems. Please view the full article at www.sciltp.com/journals/tai...

0 0 0 0

AI Daily Post

@aidailypost.com

1 week ago

MiniMax M2.7 just turned RL research into a co‑pilot—self‑evolving, polyglot code, and handling 30‑50% of the workflow. Could this be the first step toward GPT‑5‑level automation? Dive in to see how it reshapes ML engineering. #MiniMaxM27 #SelfEvolvingAI #ReinforcementLearning

🔗

2 0 0 0

Shawn Hymel

@shawnhymel.bsky.social

1 week ago

I’m considering making a video series that teaches #ReinforcementLearning using a 2-wheel balancing bot. Would you be interested in learning that? If you've done RL, what frameworks do you recommend?
👇
shawnhymel.com/3219/an-idea...

#edgeAI #AI #embedded #robotics #education

8 1 2 0

Raphaël Fonteneau

@raphfont.bsky.social

1 week ago

Today, we welcome Dr. Ir. @adrienbolland.bsky.social in the context of the Reinforcement Learning class for a lesson about policy gradient methods. Many thanks to Adrien for sharing his knowledge about these methods that have enabled many successful implementations! #ReinforcementLearning

3 1 0 0

Markus Wulfmeier

@mwulfmeier.bsky.social

2 weeks ago

Review request:
As usual for the time of year, I'll be looking for #IROS2026 reviewers. Highly interesting stack of papers on #ReinforcementLearning #Sim2Real #RewardLearning #LLMs #DataEfficiency #RoboticManipulation

Reach out with your ID or papercept registered mail address and background.

1 0 0 0

Claas Voelcker

@cvoelcker.bsky.social

2 weeks ago

cookie monster is sitting at a table with a tray of food and the words choices written on it Alt: cookie monster is sitting at a table with a tray of food and the words choices written on it

Following advice by the always-wise @eugenevinitsky.bsky.social , I am trying to get back into the habit of blogging (again) ✏️!

Featuring today's post: How to pick an RL algorithm for your problem cvoelcker.de/blog/2026/ch... Please share and give feedback!

#reinforcementlearning

30 4 2 2

allPhoto Bangkok / Advanced Ventures

@allphotobangkok.bsky.social

2 weeks ago

These aren’t AI firms, they’re defense contractors. We can’t let them hide behind their models From Gaza to Iran, the pattern is the same: precision weapons, chosen blindness, and dead children. The cost of failing to regulate AI warfare is already too high

AI warfare's cost is high with precision weapons, chosen blindness, and civilian casualties. The 'fog procedure' exemplifies this dangerous trend.
www.theguardian.com/us-news/ng-interactive/2...
#AI #AIethics #MachineLearning #ReinforcementLearning #AIS...

2 0 0 0

Ezgi Korkmaz

@ezgikorkmaz.bsky.social

2 weeks ago

✨Two single author papers accepted to ICLR 2026!✨

Truly excited to present these results at #ICLR2026 !
@iclr-conf.bsky.social #ICLR26 #ReinforcementLearning

0 1 0 0

FierceMind

@ostroumni.bsky.social

2 weeks ago

🚀 Google discovered:

AI agents learn to COOPERATE on their own when trained against diverse and unpredictable opponents!

#AI #GoogleAI #MultiAgent #ReinforcementLearning #LLM #AISystems

1 0 0 0

FierceMind

@ostroumni.bsky.social

2 weeks ago

🚀 Google discovered:

AI agents learn to COOPERATE on their own when trained against diverse and unpredictable opponents!

#AI #GoogleAI #MultiAgent #ReinforcementLearning #LLM #AISystems

1 0 0 0

AI Daily Post

@aidailypost.com

2 weeks ago

Google’s new research shows AI agents can team up and outsmart unpredictable opponents using standard RL and decentralized training. Curious how GRPO drives cooperative strategies? Dive in! #AIAgents #ReinforcementLearning #MultiAgentLearning

🔗 aidailypost.com/news/google-...

0 0 0 0

Awesome Agents

@awesomeagents.bsky.social

2 weeks ago

16 Open-Source RL Libraries, One Shared GPU Bottleneck A Hugging Face survey of 16 open-source reinforcement learning libraries finds the entire ecosystem has converged on async disaggregated training to fix a single brutal bottleneck: GPU idle time during long rollouts.

16 Open-Source RL Libraries, One Shared GPU Bottleneck

awesomeagents.ai/news/huggingface-async-r...

#HuggingFace #ReinforcementLearning #OpenSource

1 0 0 0

Alexis Kirke

@alexiskirke.bsky.social

2 weeks ago

Image

I discovered this thought-provoking paper about RoboPocket - a new way to boost robot learning with real-time feedback from your phone. No fancy gear needed! See link below. #robotics #reinforcementlearning #humantech
https://arxiv.org/abs/2603.05504

0 0 0 0

David @ InnoVirtuoso

@innovirtuoso.bsky.social

3 weeks ago

🚀 Check out "The AI That Learned to Play with Itself" — researchers let a neural network play a game against copies of itself! 🤖💥 It discovered strategies humans hadn’t thought of! Talk about self-improvement! 🔄 #AI #ReinforcementLearning #MindBlown

3 0 0 0

Winbuzzer

@winbuzzer.com

3 weeks ago

winbuzzer.com/2026/03/05/d...

New Databricks KARL RAG Agent Promises 33% Cost Reduction vs. Claude Opus 4.6

#AI #Databricks #DatabricksKARL #Anthropic #Claude #GenerativeAI #MachineLearning #AIAgents #EnterpriseAI #RAG #KARL #ReinforcementLearning

0 0 0 0

Winbuzzer

@winbuzzer.com

3 weeks ago

OpenAI VP Joins Anthropic After Pentagon Deal Backlash OpenAI VP Max Schwarzer has joined Anthropic, citing trusted colleagues and shared values, hours after backlash over OpenAI's Pentagon military AI deal.

winbuzzer.com/2026/03/06/o...

OpenAI's Post Training Lead Max Schwarzer Joins Anthropic After Pentagon Deal Backlash

#AI #ChatGPT #Anthropic #Claude #OpenAI #MaxSchwarzer #Pentagon #ReinforcementLearning

0 0 0 0

Wahnsinnwissen.de

@wahnsinnwissen.bsky.social

1 month ago

Richard S. #KünstlicheIntelligenz #LernenausErfahrung #ReinforcementLearning #RichardSutton #Sprachmodelle
wahnsinnwissen.de/?p=1124

0 0 0 0

Awesome Agents

@awesomeagents.bsky.social

1 month ago

OpenClaw-RL Lets You Train a Personal AI Agent Just by Talking to It Gen-Verse's new open-source framework uses asynchronous reinforcement learning to personalize LLMs through natural conversation - no labeling, no datasets, just feedback.

OpenClaw-RL Lets You Train a Personal AI Agent Just by Talking to It

awesomeagents.ai/news/openclaw-rl-persona...

#Openclaw #ReinforcementLearning #OpenSource

2 0 2 0

Ezgi Korkmaz

@ezgikorkmaz.bsky.social

1 month ago

✨Two single author papers accepted to ICLR 2026!✨

Truly excited to present these results at #ICLR2026 !

@iclr-conf.bsky.social #ICLR26 #DeepRL #ICLR #ReinforcementLearning

0 0 0 0

Why We Do What We Do podcast

@wwdwwdpodcast.bsky.social

1 month ago

Mini: Rock, Paper, Scissors Rock, Paper, Scissors, Shoot! This 5-second two-player game to settle disputes began in ancient China and quickly spread throughout the world. Some research has also attempted to use game theory to understand decisions in this game and were surprised by the results, but we weren't! You'll see why. Join our supporters' club: www.patreon.com/wwdwwpodcast Links and References: - https://www.annarahmanan.com/the-history-of-rock-paper-scissors-game - https://www.playworks.org/game-library/ro-sham-bo-or-rock-paper-scissors/ - https://www.tandfonline.com/doi/abs/10.1080/00107514.2015.1026556

📣 New Podcast! "Mini: Rock, Paper, Scissors" on @Spreaker #ancientchina #cyclicalcompetition #dei #gametheory #learningtheory #operantlearning #psychology #reinforcementlearning #rockpaperscissors #roshambo #rps #science #shoushling #skepticism #whywedowhatwedo #wwdwwdpodcast

0 0 0 0

SiegeLord

@siegelordex.bsky.social

1 month ago

More improvements to my AI locomotion. This time I trained it using a randomly bumpy terrain, random variation on the robot weight etc. The next step is, testing it on the real robot!

#robot #machinelearning #reinforcementlearning

1 0 0 0

nothing to see here folks

@mcmahon.bsky.social

1 month ago

Spent the weekend trying to learn about #ReinforcementLearning by training an agent to play Xs & Os / Tic-Tac-Toe.
An unexpected side effect is that by playing dozens of games of Xs & Os over a 48-hour period, I have Stockholm syndromed myself into believing that it is the greatest game of all time

1 0 1 1

Agerico M. De Villa

@propjerry.bsky.social

1 month ago

Boundary and handshake between Philosophy of Science, on one hand, and Science and Engineering (Geometric Manifold Rectification), on the other hand: Testing Bridge360 Metatheory Model v20.4 Handshake Version

agericomontecillodevilla.substack.com/p/boundary-a...

#ReinforcementLearning

0 0 0 0