Ai2

Enterprise

non-profit

Verified

https://allenai.org/

allen_ai

allenai

AI & ML interests

Building breatkthrough AI to solve the world's biggest problems.

Recent Activity

ljvmiranda921 authored a paper 2 days ago

MMTEB: Massive Multilingual Text Embedding Benchmark

ljvmiranda921 authored a paper 2 days ago

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

faezeb authored a paper 10 days ago

Large-Scale Data Selection for Instruction Tuning

View all activity

Articles

Introducing the Open Chain of Thought Leaderboard

allenai's activity

shannons

authored a paper 1 day ago

SelfCite: Self-Supervised Alignment for Context Attribution in Large Language Models

Paper • 2502.09604 • Published 29 days ago • 33

ljvmiranda921

authored 2 papers 2 days ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published 23 days ago • 32

Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia

Paper • 2503.07920 • Published 4 days ago • 89

yakazimir

authored a paper 15 days ago

ZebraLogic: On the Scaling Limits of LLMs for Logical Reasoning

Paper • 2502.01100 • Published Feb 3 • 17

yyupenn

authored 3 papers 21 days ago

Scaling Text-Rich Image Understanding via Code-Guided Synthetic Multimodal Data Generation

Paper • 2502.14846 • Published 22 days ago • 13

Articulate-Anything: Automatic Modeling of Articulated Objects via a Vision-Language Foundation Model

Paper • 2410.13882 • Published Oct 3, 2024

MiRAGeNews: Multimodal Realistic AI-Generated News Detection

Paper • 2410.09045 • Published Oct 11, 2024 • 4

swj0419

authored a paper about 1 month ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 111

Muennighoff

authored a paper about 1 month ago

s1: Simple test-time scaling

Paper • 2501.19393 • Published Jan 31 • 111

natolambert

authored 8 papers 2 months ago

Objective Mismatch in Model-based Reinforcement Learning

Paper • 2002.04523 • Published Feb 11, 2020

Confidence-Building Measures for Artificial Intelligence: Workshop Proceedings

Paper • 2308.00862 • Published Aug 1, 2023

A Survey on Data Selection for Language Models

Paper • 2402.16827 • Published Feb 26, 2024 • 4

D2PO: Discriminator-Guided DPO with Response Evaluation Models

Paper • 2405.01511 • Published May 2, 2024

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Paper • 2406.09279 • Published Jun 13, 2024 • 2

WildGuard: Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

Paper • 2406.18495 • Published Jun 26, 2024 • 13

Towards a Framework for Openness in Foundation Models: Proceedings from the Columbia Convening on Openness in Artificial Intelligence

Paper • 2405.15802 • Published May 17, 2024

Molmo and PixMo: Open Weights and Open Data for State-of-the-Art Multimodal Models

Paper • 2409.17146 • Published Sep 25, 2024 • 108

ljvmiranda921

authored a paper 2 months ago

Bridging the Data Provenance Gap Across Text, Speech and Video

Paper • 2412.17847 • Published Dec 19, 2024 • 9

dirkgr

authored a paper 2 months ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 18

natolambert

authored a paper 2 months ago

2 OLMo 2 Furious

Paper • 2501.00656 • Published Dec 31, 2024 • 18