#webagents hashtag - Bluesky

@veripura.bsky.social

3 weeks ago

Plan active. Agents deployed. 🐟

Thanks #TinyFish for the boost! We’re using the Web Agent API to power VeriPura with human-like navigation and anti-bot protection.

Next stop: Recording our demo and shipping for the #TinyFishAccelerator. Let’s build. 🛠️

#VeriPura #WebAgents

0 0 0 0

Naoya

@naoyacreates.bsky.social

4 months ago

AgentFold Solves Web Agent Memory Issues? | AI News AgentFold tackles the web agent memory bottleneck. Explore how it's changing the game in AI.

AIMindUpdate News!
Memory is the wall for web agents. AgentFold may have the solution. Learn more! #AgentFold #WebAgents #AI

Click here↓↓↓
aimindupdate.com/2025/11/16/a...

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

PolicyGuardBench Introduces Guardrails for Web Agent Policy Compliance

PolicyGuardBench releases a dataset of about 60,000 labeled web-agent trajectories and a lightweight guardrail model, PolicyGuard-4B, that detects policy violations with fast inference. getnews.me/policyguardbench-introdu... #policyguard #webagents

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

Evaluating Web Agent Reliability: Introducing the WAREX Benchmark

The new WAREX benchmark adds network jitter, TLS errors and security threats to suites like WebArena, WebVoyager and REAL. Under moderate jitter agents’ success rates fell below 50%. Read more: getnews.me/evaluating-web-agent-rel... #warex #webagents

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

FocusAgent Boosts Web Agent Efficiency by Trimming Large Page Contexts

FocusAgent trims web-agent observations by over 50% while keeping task success comparable to full-page baselines; its security variant reduces prompt-injection attacks. Read more: getnews.me/focusagent-boosts-web-ag... #focusagent #webagents

0 0 0 0

GetNews.me

@getnews-me.bsky.social

5 months ago

WAInjectBench: Benchmark for Detecting Prompt Injection in Web Agents

WAInjectBench, a benchmark for detecting prompt injection in web agents, provides a dataset with malicious text snippets and images, and benign controls. Code and data are on GitHub. getnews.me/wainjectbench-benchmark-... #promptinj #webagents

0 0 0 0

GetNews.me

@getnews-me.bsky.social

6 months ago

Fine-Grained Evaluation Framework Improves Reliability of AI Web Agents

Evaluation framework splits AI agents into perception, decision‑making, execution, verification stages to spot errors. Tested on SeeAct and Mind2Web, paper posted 17 September 2025. Read more: getnews.me/fine-grained-evaluation-... #ai #webagents

0 0 0 0

GetNews.me

@getnews-me.bsky.social

6 months ago

ReSum: Enhancing Long‑Horizon Web Search with Context Summarization

ReSum condenses LLM web‑agent dialogues into reasoning states, letting agents search. It yields a 4.5% boost over ReAct, rising to 8.2% with ReSum‑GRPO finetuning. Read more: getnews.me/resum-enhancing-long-hor... #webagents #summarization

0 0 0 0

Micha the DevOp

@michabbb.bsky.social

7 months ago

#MagenticUI by #Microsoft
Human-centered web automation with a multi-agent system 🤖
#AI #automation #webagents #opensource #research #Python #Docker #AutoGen

🤝 Co-planning
Collaboratively create step-by-step plans using chat and a plan editor for transparent task execution.

🧵 👇

2 2 1 0

Bartosz Sokoliński`

@bortys.bsky.social

11 months ago

Building AI Browser Agents Build agents that navigate and interact with websites, and learn how to make them more reliable.

🔎 Nowy kurs DeepLearningAI to jak „Szkoła dla agentów AI”, tylko zamiast martini – Monte Carlo Tree Search, samokrytyka i DPO.
AgentQ uczy boty ogarniać przeglądarkę lepiej niż ja ogarniam zakładki.
➡️ deeplearning.ai/short-courses/building-ai-browser-agents
#AI #WebAgents

0 0 0 0

Thanasis Mandaltsis

@trippinjaguar.bsky.social

1 year ago

This is neat 🔥 I added my web agent to my bluesky profile and just passed a copy of my private key so it’s stored within the app and can trust it’s me.

I can load my P2P apps or I can get auto logged-in to my website’s WP admin.

Zero integration needed 🤓

#Agents #WebAgents

0 0 0 0