#AILuminate hashtag - Bluesky

4 weeks ago

AI safety benchmarks built on Western data miss how risk actually looks across cultures.

MLCommons is fixing that — 7,000+ multimodal prompts from APAC, built with regional experts from Singapore, India, and Korea.

mlcommons.org/2026/03/airr...

#MLCommons #AILuminate #MultimodalAI

0 0 0 0

MLCommons

@mlcommons.org

1 month ago

AI risk assessment now has a global standard. The MLCommons AILuminate Global Assurance Program gives organizations a structured, independent path to evaluate AI reliability. 🔗mlcommons.org/2026/02/ailuminate-globa...

#AIGovernance #AILuminate

0 0 0 0

MLCommons

@mlcommons.org

1 month ago

MLCommons Lays the Foundation for Defensible Jailbreak Benchmarking - MLCommons MLCommons introduces a mechanism-first taxonomy for single-turn jailbreak attacks, providing the structural foundation for defensible, reproducible AI security evaluation

How do you make jailbreak benchmarks defensible to auditors? MLCommons just published a mechanism-first taxonomy for reproducible, governance-aligned LLM robustness evaluation. Not a leaderboard — the foundation for trustworthy ones. 🔗 https://bit.ly/3ZCHqlZ
#AILuminate

0 0 0 0

MLCommons

@mlcommons.org

1 month ago

Last week, Rebecca Weiss announced the AILuminate Global Assurance Program from the stage of a global AI standards panel in New Delhi. Read about the program and watch the panel below.
📖 https://bit.ly/4kIS18x
▶️ https://bit.ly/46nDtp3
#AIImpactSummit #MLCommons #AILuminate

0 0 0 0

MLCommons

@mlcommons.org

11 months ago

MLCommons Releases French AILuminate Benchmark Demo Prompt Dataset to Github - MLCommons MLCommons announces the release of two French language datasets for the AILuminate benchmark. A 1,200 prompt Creative-Commons licensed version, and 12,000 Practice Test prompts.

#MLCommons just released two new French prompt #datasets for #AILuminate:
🔹Demo set: 1,200+ prompts, free for AI safety testing
🔹Practice set: 12,000 prompts for deeper evaluation (on request)
Native speakers made both and are ready for #ModelBench. Details: mlcommons.org/2025/04/ailu...

#AI #AIRR

1 1 0 0

MLCommons

@mlcommons.org

1 year ago

MLCommons, in partnership with the AI Verify Foundation, released the AILuminate v1.1, incorporating new French language capabilities into its first-of-its-kind AI safety benchmark.

Learn more: mlcommons.org/2025/02/ailu...

#ailuminate #parisaiactionsummit #aiverifyfoundation

0 0 0 0

Winbuzzer

@winbuzzer.com

1 year ago

MLCommons Unveils AILuminate Benchmark for AI Safety Risk Testing - WinBuzzer AILuminate provides a structured framework for assessing AI safety, tackling issues like hate speech, misinformation, and contextual misuse in LLMs.

MLCommons has launched AILuminate, a new benchmark focused on evaluating safety risks in large language models. #ai #llms #aibenchmarks #aisafety #mlcommons #AILuminate @mlcommons

winbuzzer.com/2024/12/07/m...

1 0 0 0