Toggle navigation
OpenReview
.net
Login
×
Back to
CMU
CMU 2025 LTI-SRS Submissions
Human-Aligned Chess With a Bit of Search
Yiming Zhang
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Oral
Readers:
Everyone
Programming with Pixels: Towards Generalist Software Engineering Agents
Pranjal Aggarwal
,
Sean Welleck
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Poster
Readers:
Everyone
AlphaVerus: Bootstrapping Formally Verified Code Generation through Self-Improving Translation and Treefinement
Pranjal Aggarwal
,
Bryan Parno
,
Sean Welleck
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Oral
Readers:
Everyone
Leveraging Machine-generated Rationales for Conversational Forecasting
Ritam Dutt
,
Gayathri Ganesh Lakshmy
,
Carolyn Rose
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Poster
Readers:
Everyone
Should I Agree with You? Simulating Persuasion and Decision Dynamics in Multi-Agent Moral Dilemmas
Jiarui Liu
,
Mingqian Zheng
,
Yueqi Song
,
Yunze Xiao
,
Lindia Tjuatja
,
Maarten Sap
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Oral
Readers:
Everyone
Toward Global AI Inclusivity: A Large-Scale Multilingual Terminology Dataset (GIST)
Jiarui Liu
,
Iman Ouzzani
,
Wenkai Li
,
Lechen Zhang
,
Tianyue Ou
,
Houda Bouamor
,
Zhijing Jin
,
Mona T. Diab
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Oral
Readers:
Everyone
Energy Considerations of Large Language Model Inference and Efficiency Optimizations
Jared Fernandez
,
Clara Na
,
Vashisth Tiwari
,
Yonatan Bisk
,
Sasha Luccioni
,
Emma Strubell
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Oral
Readers:
Everyone
Loki: Studying MARL Collusion using LLMs in a Kuhn Poker Environment
Calvin Qin
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Poster
Readers:
Everyone
Can Long-Context Language Models Solve Repository-Level Code Generation?
YIBO PENG
,
Zora Zhiruo Wang
,
Daniel Fried
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Poster
Readers:
Everyone
MICE for CATs: Model-Internal Confidence Estimation for Calibrating Agents with Tools
Nishant Subramani
,
Jason Eisner
,
Justin Svegliato
,
Benjamin Van Durme
,
Yu Su
,
Sam Thomson
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Oral
Readers:
Everyone
Can KBQA Models Predict Their Reasoning Paths? Isomorphism Prediction Task as a Proxy
Zhen Wu
,
Ritam Dutt
,
Dhruv Gupta
,
Carolyn Rose
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Poster
Readers:
Everyone
Social Scaffolds: A Generalization Framework for Social Understanding Tasks
Ritam Dutt
,
Carolyn Rose
,
Maarten Sap
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Oral
Readers:
Everyone
Mind the Gesture: Evaluating AI Sensitivity to Culturally Offensive Non-Verbal Gestures
Akhila Yerukola
,
Saadia Gabriel
,
Nanyun Peng
,
Maarten Sap
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Oral
Readers:
Everyone
Fairshare Data Pricing for Large Language Models
Cathy Jiao
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Oral
Readers:
Everyone
On the Feasibility of In-Context Probing for Data Attribution
Cathy Jiao
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Oral
Readers:
Everyone
Decoding Dark Matter: Specialized Sparse Autoencoders for Interpreting Rare Concepts in Foundation Models
Aashiq Muhamed
,
Mona T. Diab
,
Virginia Smith
Published: 06 Apr 2025, Last Modified: 18 Apr 2025
LTI-SRS 2025 Oral
Readers:
Everyone