2025 BERT is NeoBERT! We have fully pre-trained a next-generation encoder for 2.1T tokens with the latest advances in data, training, and architecture. This is a heroic effort from my PhD student, Lola Le Breton, in collaboration with Quentin Fournier and Mariam El Mezouar (1/n)
Posts by Chris Pal
π Excited to introduce BigDocs!
An open, transparent multimodal dataset designed for:
π Documents
π Web content
π₯οΈ GUI understanding
π¨βπ» Code generation from images
Weβre also launching BigDocs-Bench:
β‘οΈ Document, Web, GUI Visual reasoning
β‘οΈ Converting images into JSON, Markdown, LaTeX, SVG, and more!
I think it is the AC's job to detect when a reviewer's criticisms have been addressed, but the reviewer has checked out, and then throw out that reviewer's score. I think this should be made an official policy of ICLR. Reviewers who do this are being far worse that just rude.
This could easily be a photo of exactly what you describe.
LLMs have a lot of potential for science, but scientists can be particularly sensitive to factuality, nuances, and hallucinations. The new ScholarQABench benchmark in this paper looks pretty useful for the community to monitor progress on LLMs for science. arxiv.org/html/2411.14199