| 2024 | "A good pun is its own reword": Can Large Language Models Understand Puns? Zhijun Xu, Siyu Yuan, Lingjie Chen, Deqing Yang |
| 2024 | "Flex Tape Can't Fix That": Bias and Misinformation in Edited Language Models. Karina Halevy, Anna Sotnikova, Badr AlKhamissi, Syrielle Montariol, Antoine Bosselut |
| 2024 | "Global is Good, Local is Bad?": Understanding Brand Bias in LLMs. Mahammed Kamruzzaman, Hieu Nguyen, Gene Louis Kim |
| 2024 | "Image, Tell me your story!" Predicting the original meta-context of visual misinformation. Jonathan Tonglet, Marie-Francine Moens, Iryna Gurevych |
| 2024 | "In-Dialogues We Learn": Towards Personalized Dialogue Without Pre-defined Profiles through In-Dialogue Learning. Chuanqi Cheng, Quan Tu, Wei Wu, Shuo Shang, Cunli Mao, Zhengtao Yu, Rui Yan |
| 2024 | "They are uncultured": Unveiling Covert Harms and Social Threats in LLM Generated Conversations. Preetam Prabhu Srikar Dammu, Hayoung Jung, Anjali Singh, Monojit Choudhury, Tanushree Mitra |
| 2024 | "Thinking" Fair and Slow: On the Efficacy of Structured Prompts for Debiasing Language Models. Shaz Furniturewala, Surgan Jandial, Abhinav Java, Pragyan Banerjee, Simra Shahid, Sumit Bhatia, Kokil Jaidka |
| 2024 | "We Demand Justice!": Towards Social Context Grounding of Political Texts. Rajkumar Pujari, Chengfei Wu, Dan Goldwasser |
| 2024 | "You Gotta be a Doctor, Lin" : An Investigation of Name-Based Bias of Large Language Models in Employment Recommendations. Huy Nghiem, John Prindle, Jieyu Zhao, Hal Daumé III |
| 2024 | 'Quis custodiet ipsos custodes?' Who will watch the watchmen? On Detecting AI-generated peer-reviews. Sandeep Kumar, Mohit Sahu, Vardhan Gacche, Tirthankar Ghosal, Asif Ekbal |
| 2024 | ***YesBut***: A High-Quality Annotated Multimodal Dataset for evaluating Satire Comprehension capability of Vision-Language Models. Abhilash Nandy, Yash Agarwal, Ashish Patwa, Millon Madhur Das, Aman Bansal, Ankit Raj, Pawan Goyal, Niloy Ganguly |
| 2024 | 1+1\textgreater2: Can Large Language Models Serve as Cross-Lingual Knowledge Aggregators? Yue Huang, Chenrui Fan, Yuan Li, Siyuan Wu, Tianyi Zhou, Xiangliang Zhang, Lichao Sun |
| 2024 | A Bayesian Approach to Harnessing the Power of LLMs in Authorship Attribution. Zhengmian Hu, Tong Zheng, Heng Huang |
| 2024 | A Closer Look at Multidimensional Online Political Incivility. Sagi Pendzel, Nir Lotan, Alon Zoizner, Einat Minkov |
| 2024 | A Comparison of Language Modeling and Translation as Multilingual Pretraining Objectives. Zihao Li, Shaoxiong Ji, Timothee Mickus, Vincent Segonne, Jörg Tiedemann |
| 2024 | A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery. Yu Zhang, Xiusi Chen, Bowen Jin, Sheng Wang, Shuiwang Ji, Wei Wang, Jiawei Han |
| 2024 | A Fast and Sound Tagging Method for Discontinuous Named-Entity Recognition. Caio Corro |
| 2024 | A Generic Method for Fine-grained Category Discovery in Natural Language Texts. Chang Tian, Matthew B. Blaschko, Wenpeng Yin, Mingzhe Xing, Yinliang Yue, Marie-Francine Moens |
| 2024 | A Learning Rate Path Switching Training Paradigm for Version Updates of Large Language Models. Zhihao Wang, Shiyu Liu, Jianheng Huang, Wang Zheng, Yixuan Liao, Xiaoxin Chen, Junfeng Yao, Jinsong Su |
| 2024 | A Morphology-Based Investigation of Positional Encodings. Poulami Ghosh, Shikhar Vashishth, Raj Dabre, Pushpak Bhattacharyya |
| 2024 | A Multi-Perspective Analysis of Memorization in Large Language Models. Bowen Chen, Namgi Han, Yusuke Miyao |
| 2024 | A New Pipeline for Knowledge Graph Reasoning Enhanced by Large Language Models Without Fine-Tuning. Zhongwu Chen, Long Bai, Zixuan Li, Zhen Huang, Xiaolong Jin, Yong Dou |
| 2024 | A Peek into Token Bias: Large Language Models Are Not Yet Genuine Reasoners. Bowen Jiang, Yangxinyu Xie, Zhuoqun Hao, Xiaomeng Wang, Tanwi Mallick, Weijie Su, Camillo J. Taylor, Dan Roth |
| 2024 | A Probability-Quality Trade-off in Aligned Language Models and its Relation to Sampling Adaptors. Naaman Tan, Josef Valvoda, Tianyu Liu, Anej Svete, Yanxia Qin, Min-Yen Kan, Ryan Cotterell |
| 2024 | A SMART Mnemonic Sounds like "Glue Tonic": Mixing LLMs with Student Feedback to Make Mnemonic Learning Stick. Nishant Balepur, Matthew Shu, Alexander Miserlis Hoyle, Alison Robey, Shi Feng, Seraphina Goldfarb-Tarrant, Jordan L. Boyd-Graber |
| 2024 | A Simple LLM Framework for Long-Range Video Question-Answering. Ce Zhang, Taixi Lu, Md Mohaiminul Islam, Ziyang Wang, Shoubin Yu, Mohit Bansal, Gedas Bertasius |
| 2024 | A Simple and Effective L_2 Norm-Based Strategy for KV Cache Compression. Alessio Devoto, Yu Zhao, Simone Scardapane, Pasquale Minervini |
| 2024 | A Simple yet Effective Training-free Prompt-free Approach to Chinese Spelling Correction Based on Large Language Models. Houquan Zhou, Zhenghua Li, Bo Zhang, Chen Li, Shaopeng Lai, Ji Zhang, Fei Huang, Min Zhang |
| 2024 | A Study of Nationality Bias in Names and Perplexity using Off-the-Shelf Affect-related Tweet Classifiers. Valentin Barrière, Sebastian Cifuentes |
| 2024 | A Survey of AMR Applications. Shira Wein, Juri Opitz |
| 2024 | A Survey of Ontology Expansion for Conversational Understanding. Jinggui Liang, Yuxia Wu, Yuan Fang, Hao Fei, Lizi Liao |
| 2024 | A Survey on In-context Learning. Qingxiu Dong, Lei Li, Damai Dai, Ce Zheng, Jingyuan Ma, Rui Li, Heming Xia, Jingjing Xu, Zhiyong Wu, Baobao Chang, Xu Sun, Lei Li, Zhifang Sui |
| 2024 | A Systematic Analysis of Large Language Models as Soft Reasoners: The Case of Syllogistic Inferences. Leonardo Bertolazzi, Albert Gatt, Raffaella Bernardi |
| 2024 | A Systematic Survey and Critical Review on Evaluating Large Language Models: Challenges, Limitations, and Recommendations. Md. Tahmid Rahman Laskar, Sawsan Alqahtani, M. Saiful Bari, Mizanur Rahman, Mohammad Abdullah Matin Khan, Haidar Khan, Israt Jahan, Amran Bhuiyan, Chee-Wei Tan, Md. Rizwan Parvez, Enamul Hoque, Shafiq Joty, Jimmy Huang |
| 2024 | A Thorough Examination of Decoding Methods in the Era of LLMs. Chufan Shi, Haoran Yang, Deng Cai, Zhisong Zhang, Yifan Wang, Yujiu Yang, Wai Lam |
| 2024 | A Two-Step Approach for Data-Efficient French Pronunciation Learning. Hoyeon Lee, Hyeeun Jang, Jong-Hwan Kim, Jae-Min Kim |
| 2024 | A Usage-centric Take on Intent Understanding in E-Commerce. Wendi Zhou, Tianyi Li, Pavlos Vougiouklis, Mark Steedman, Jeff Z. Pan |
| 2024 | A User-Centric Multi-Intent Benchmark for Evaluating Large Language Models. Jiayin Wang, Fengran Mo, Weizhi Ma, Peijie Sun, Min Zhang, Jian-Yun Nie |
| 2024 | A linguistically-motivated evaluation methodology for unraveling model's abilities in reading comprehension tasks. Elie Antoine, Frédéric Béchet, Géraldine Damnati, Philippe Langlais |
| 2024 | ABLE: Personalized Disability Support with Politeness and Empathy Integration. Kshitij Mishra, Manisha Burja, Asif Ekbal |
| 2024 | ABSEval: An Agent-based Framework for Script Evaluation. Sirui Liang, Baoli Zhang, Jun Zhao, Kang Liu |
| 2024 | ACE: A LLM-based Negotiation Coaching System. Ryan Shea, Aymen Kallala, Xin Liu, Michael W. Morris, Zhou Yu |
| 2024 | ADELIE: Aligning Large Language Models on Information Extraction. Yunjia Qi, Hao Peng, Xiaozhi Wang, Bin Xu, Lei Hou, Juanzi Li |
| 2024 | AGRaME: Any-Granularity Ranking with Multi-Vector Embeddings. Revanth Gangi Reddy, Omar Attia, Yunyao Li, Heng Ji, Saloni Potdar |
| 2024 | AKEW: Assessing Knowledge Editing in the Wild. Xiaobao Wu, Liangming Pan, William Yang Wang, Anh Tuan Luu |
| 2024 | ALVIN: Active Learning Via INterpolation. Michalis Korakakis, Andreas Vlachos, Adrian Weller |
| 2024 | AMPO: Automatic Multi-Branched Prompt Optimization. Sheng Yang, Yurong Wu, Yan Gao, Zineng Zhou, Bin Zhu, Xiaodi Sun, Jian-Guang Lou, Zhiming Ding, Anbang Hu, Yuan Fang, Yunsong Li, Junyan Chen, Linjun Yang |
| 2024 | AMR-Evol: Adaptive Modular Response Evolution Elicits Better Knowledge Distillation for Large Language Models in Code Generation. Ziyang Luo, Xin Li, Hongzhan Lin, Jing Ma, Lidong Bing |
| 2024 | APPLS: Evaluating Evaluation Metrics for Plain Language Summarization. Yue Guo, Tal August, Gondy Leroy, Trevor Cohen, Lucy Lu Wang |
| 2024 | ARES: Alternating Reinforcement Learning and Supervised Fine-Tuning for Enhanced Multi-Modal Chain-of-Thought Reasoning Through Diverse AI Feedback. Ju-Seung Byun, Jiyun Chun, Jihyung Kil, Andrew Perrault |
| 2024 | ARM: An Alignment-and-Replacement Module for Chinese Spelling Check Based on LLMs. Changchun Liu, Kai Zhang, Junzhe Jiang, Zirui Liu, Hanqing Tao, Min Gao, Enhong Chen |
| 2024 | ASETF: A Novel Method for Jailbreak Attack on LLMs through Translate Suffix Embeddings. Hao Wang, Hao Li, Minlie Huang, Lei Sha |
| 2024 | ASL STEM Wiki: Dataset and Benchmark for Interpreting STEM Articles. Kayo Yin, Chinmay Singh, Fyodor Minakov, Vanessa Milan, Hal Daumé III, Cyril Zhang, Alex Lu, Danielle Bragg |
| 2024 | ATAP: Automatic Template-Augmented Commonsense Knowledge Graph Completion via Pre-Trained Language Models. Fu Zhang, Yifan Ding, Jingwei Cheng |
| 2024 | ATM: Adversarial Tuning Multi-agent System Makes a Robust Retrieval-Augmented Generator. Junda Zhu, Lingyong Yan, Haibo Shi, Dawei Yin, Lei Sha |
| 2024 | Academics Can Contribute to Domain-Specialized Language Models. Mark Dredze, Genta Indra Winata, Prabhanjan Kambadur, Shijie Wu, Ozan Irsoy, Steven Lu, Vadim Dabravolski, David S. Rosenberg, Sebastian Gehrmann |
| 2024 | Accurate and Data-Efficient Toxicity Prediction when Annotators Disagree. Harbani Jaggi, Kashyap Coimbatore Murali, Eve Fleisig, Erdem Biyik |
| 2024 | ActPlan-1K: Benchmarking the Procedural Planning Ability of Visual Language Models in Household Activities. Ying Su, Zhan Ling, Haochen Shi, Cheng Jiayang, Yauwai Yim, Yangqiu Song |
| 2024 | AdaSwitch: Adaptive Switching between Small and Large Agents for Effective Cloud-Local Collaborative Learning. Hao Sun, Jiayi Wu, Hengyi Cai, Xiaochi Wei, Yue Feng, Bo Wang, Shuaiqiang Wang, Yan Zhang, Dawei Yin |
| 2024 | AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning. Yifan Yang, Kai Zhen, Ershad Banijamali, Athanasios Mouchtaris, Zheng Zhang |
| 2024 | Adaptable Moral Stances of Large Language Models on Sexist Content: Implications for Society and Gender Discourse. Rongchen Guo, Isar Nejadgholi, Hillary Dawkins, Kathleen C. Fraser, Svetlana Kiritchenko |
| 2024 | Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve? Firat Öncel, Matthias Bethge, Beyza Ermis, Mirco Ravanelli, Cem Subakan, Çagatay Yildiz |
| 2024 | Adapters Mixup: Mixing Parameter-Efficient Adapters to Enhance the Adversarial Robustness of Fine-tuned Pre-trained Text Classifiers. Tuc Nguyen, Thai Le |
| 2024 | Adaption-of-Thought: Learning Question Difficulty Improves Large Language Models for Reasoning. Mayi Xu, Yongqi Li, Ke Sun, Tieyun Qian |
| 2024 | Adaptive Axes: A Pipeline for In-domain Social Stereotype Analysis. Qingcheng Zeng, Mingyu Jin, Rob Voigt |
| 2024 | Adaptive Immune-based Sound-Shape Code Substitution for Adversarial Chinese Text Attacks. Ao Wang, Xinghao Yang, Chen Li, Baodi Liu, Weifeng Liu |
| 2024 | Adaptive Query Rewriting: Aligning Rewriters through Marginal Probability of Conversational Answers. Tianhua Zhang, Kun Li, Hongyin Luo, Xixin Wu, James R. Glass, Helen Meng |
| 2024 | Adaptive Question Answering: Enhancing Language Model Proficiency for Addressing Knowledge Conflicts with Source Citations. Sagi Shaier, Ari Kobren, Philip V. Ogren |
| 2024 | Advancing Adversarial Suffix Transfer Learning on Aligned Large Language Models. Hongfu Liu, Yuxi Xie, Ye Wang, Michael Shieh |
| 2024 | Advancing Event Causality Identification via Heuristic Semantic Dependency Inquiry Network. Haoran Li, Qiang Gao, Hongmei Wu, Li Huang |
| 2024 | Advancing Large Language Model Attribution through Self-Improving. Lei Huang, Xiaocheng Feng, Weitao Ma, Liang Zhao, Yuchun Fan, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin |
| 2024 | Advancing Process Verification for Large Language Models via Tree-Based Preference Learning. Mingqian He, Yongliang Shen, Wenqi Zhang, Zeqi Tan, Weiming Lu |
| 2024 | Advancing Semantic Textual Similarity Modeling: A Regression Framework with Translated ReLU and Smooth K2 Loss. Bowen Zhang, Chunping Li |
| 2024 | Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions. Leena Mathur, Paul Pu Liang, Louis-Philippe Morency |
| 2024 | Advancing Test-Time Adaptation in Wild Acoustic Test Settings. Hongfu Liu, Hengguan Huang, Ye Wang |
| 2024 | Adversarial Text Generation using Large Language Models for Dementia Detection. Youxiang Zhu, Nana Lin, Kiran Balivada, Daniel Haehn, Xiaohui Liang |
| 2024 | African or European Swallow? Benchmarking Large Vision-Language Models for Fine-Grained Object Classification. Gregor Geigle, Radu Timofte, Goran Glavas |
| 2024 | AgentReview: Exploring Peer Review Dynamics with LLM Agents. Yiqiao Jin, Qinlin Zhao, Yiyang Wang, Hao Chen, Kaijie Zhu, Yijia Xiao, Jindong Wang |
| 2024 | AlignCap: Aligning Speech Emotion Captioning to Human Preferences. Ziqi Liang, Haoxiang Shi, Hanhui Chen |
| 2024 | Aligning Language Models to Explicitly Handle Ambiguity. Hyuhng Joon Kim, Youna Kim, Cheonbok Park, Junyeob Kim, Choonghyun Park, Kang Min Yoo, Sang-goo Lee, Taeuk Kim |
| 2024 | Aligning Large Language Models with Diverse Political Viewpoints. Dominik Stammbach, Philine Widmer, Eunjung Cho, Caglar Gulcehre, Elliott Ash |
| 2024 | Aligning Translation-Specific Understanding to General Understanding in Large Language Models. Yichong Huang, Baohang Li, Xiaocheng Feng, Wenshuai Huo, Chengpeng Fu, Ting Liu, Bing Qin |
| 2024 | Alignment-Enhanced Decoding: Defending Jailbreaks via Token-Level Adaptive Refining of Probability Distributions. Quan Liu, Zhenhong Zhou, Longzhu He, Yi Liu, Wei Zhang, Sen Su |
| 2024 | AlphaLoRA: Assigning LoRA Experts Based on Layer Training Quality. Peijun Qing, Chongyang Gao, Yefan Zhou, Xingjian Diao, Yaoqing Yang, Soroush Vosoughi |
| 2024 | Altogether: Image Captioning via Re-aligning Alt-text. Hu Xu, Po-Yao Huang, Xiaoqing Ellen Tan, Ching-Feng Yeh, Jacob Kahn, Christine Jou, Gargi Ghosh, Omer Levy, Luke Zettlemoyer, Wen-tau Yih, Shang-wen Li, Saining Xie, Christoph Feichtenhofer |
| 2024 | AmbigNLG: Addressing Task Ambiguity in Instruction for NLG. Ayana Niwa, Hayate Iso |
| 2024 | An Analysis and Mitigation of the Reversal Curse. Ang Lv, Kaiyi Zhang, Shufang Xie, Quan Tu, Yuhan Chen, Ji-Rong Wen, Rui Yan |
| 2024 | An Analysis of Multilingual FActScore. Vu Trong Kim, Michael Krumdick, Varshini Reddy, Franck Dernoncourt, Viet Dac Lai |
| 2024 | An Audit on the Perspectives and Challenges of Hallucinations in NLP. Pranav Narayanan Venkit, Tatiana Chakravorti, Vipul Gupta, Heidi Biggs, Mukund Srinath, Koustava Goswami, Sarah Rajtmajer, Shomir Wilson |
| 2024 | An Effective Deployment of Diffusion LM for Data Augmentation in Low-Resource Sentiment Classification. Zhuowei Chen, Lianxi Wang, Yuben Wu, Xinfeng Liao, Yujia Tian, Junyang Zhong |
| 2024 | An Electoral Approach to Diversify LLM-based Multi-Agent Collective Decision-Making. Xiutian Zhao, Ke Wang, Wei Peng |
| 2024 | An Empirical Analysis of the Writing Styles of Persona-Assigned LLMs. Manuj Malik, Jing Jiang, Kian Ming A. Chai |
| 2024 | An Empirical Analysis on Spatial Reasoning Capabilities of Large Multimodal Models. Fatemeh Shiri, Xiao-Yu Guo, Mona Far, Xin Yu, Reza Haf, Yuan-Fang Li |
| 2024 | An Empirical Study of Multilingual Reasoning Distillation for Question Answering. Patomporn Payoungkhamdee, Peerat Limkonchotiwat, Jinheon Baek, Potsawee Manakul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong |
| 2024 | An Experimental Analysis on Evaluating Patent Citations. Rabindra Nath Nandi, Suman Kalyan Maity, Brian Uzzi, Sourav Medya |
| 2024 | An Inversion Attack Against Obfuscated Embedding Matrix in Language Model Inference. Yu Lin, Qizhi Zhang, Quanwei Cai, Jue Hong, Wu Ye, Huiqi Liu, Bing Duan |
| 2024 | An L* Algorithm for Deterministic Weighted Regular Languages. Clemente Pasti, Talu Karagöz, Franz Nowak, Anej Svete, Reda Boumasmoud, Ryan Cotterell |
| 2024 | An LLM Feature-based Framework for Dialogue Constructiveness Assessment. Lexin Zhou, Youmna Farag, Andreas Vlachos |
| 2024 | An Unsupervised Approach to Achieve Supervised-Level Explainability in Healthcare Records. Joakim Edin, Maria Maistro, Lars Maaløe, Lasse Borgholt, Jakob D. Havtorn, Tuukka Ruotsalo |
| 2024 | An image speaks a thousand words, but can everyone listen? On image transcreation for cultural relevance. Simran Khanuja, Sathyanarayanan Ramamoorthy, Yueqi Song, Graham Neubig |
| 2024 | AnaloBench: Benchmarking the Identification of Abstract and Long-context Analogies. Xiao Ye, Andrew Wang, Jacob Choi, Yining Lu, Shreya Sharma, Lingfeng Shen, Vijay Murari Tiyyala, Nicholas Andrews, Daniel Khashabi |
| 2024 | Analysis of Plan-based Retrieval for Grounded Text Generation. Ameya Godbole, Nicholas Monath, Seungyeon Kim, Ankit Singh Rawat, Andrew McCallum, Manzil Zaheer |
| 2024 | Analyzing Key Factors Influencing Emotion Prediction Performance of VLLMs in Conversational Contexts. Jaewook Lee, Yeajin Jang, Hongjin Kim, Woojin Lee, Harksoo Kim |
| 2024 | Annotation alignment: Comparing LLM and human annotations of conversational safety. Rajiv Movva, Pang Wei Koh, Emma Pierson |
| 2024 | Annotator-Centric Active Learning for Subjective NLP Tasks. Michiel van der Meer, Neele Falk, Pradeep K. Murukannaiah, Enrico Liscio |
| 2024 | ApiQ: Finetuning of 2-Bit Quantized Large Language Model. Baohao Liao, Christian Herold, Shahram Khadivi, Christof Monz |
| 2024 | AppBench: Planning of Multiple APIs from Various APPs for Complex User Instruction. Hongru Wang, Rui Wang, Boyang Xue, Heming Xia, Jingtao Cao, Zeming Liu, Jeff Z. Pan, Kam-Fai Wong |
| 2024 | Applying Contrastive Learning to Code Vulnerability Type Classification. Chen Ji, Su Yang, Hongyu Sun, Yuqing Zhang |
| 2024 | Applying Intrinsic Debiasing on Downstream Tasks: Challenges and Considerations for Machine Translation. Bar Iluz, Yanai Elazar, Asaf Yehudai, Gabriel Stanovsky |
| 2024 | ArMeme: Propagandistic Content in Arabic Memes. Firoj Alam, Abul Hasnat, Fatema Ahmad, Md. Arid Hasan, Maram Hasanain |
| 2024 | Are Data Augmentation Methods in Named Entity Recognition Applicable for Uncertainty Estimation? Wataru Hashimoto, Hidetaka Kamigaito, Taro Watanabe |
| 2024 | Are LLMs Good Zero-Shot Fallacy Classifiers? Fengjun Pan, Xiaobao Wu, Zongrui Li, Anh Tuan Luu |
| 2024 | Are Large Language Models Capable of Generating Human-Level Narratives? Yufei Tian, Tenghao Huang, Miri Liu, Derek Jiang, Alexander Spangher, Muhao Chen, Jonathan May, Nanyun Peng |
| 2024 | Are Large Language Models Good Classifiers? A Study on Edit Intent Classification in Scientific Document Revisions. Qian Ruan, Ilia Kuznetsov, Iryna Gurevych |
| 2024 | Are Large Language Models In-Context Personalized Summarizers? Get an iCOPERNICUS Test Done! Divya Patel, Pathik Patel, Ankush Chander, Sourish Dasgupta, Tanmoy Chakraborty |
| 2024 | Argument Relation Classification through Discourse Markers and Adversarial Training. Michele Contalbo, Francesco Guerra, Matteo Paganelli |
| 2024 | ArxivDIGESTables: Synthesizing Scientific Literature into Tables using Language Models. Benjamin Newman, Yoonjoo Lee, Aakanksha Naik, Pao Siangliulue, Raymond Fok, Juho Kim, Daniel S. Weld, Joseph Chee Chang, Kyle Lo |
| 2024 | Assessing "Implicit" Retrieval Robustness of Large Language Models. Xiaoyu Shen, Rexhina Blloshmi, Dawei Zhu, Jiahuan Pei, Wei Zhang |
| 2024 | Assessing and Verifying Task Utility in LLM-Powered Applications. Negar Arabzadeh, Siqing Huo, Nikhil Mehta, Qingyun Wu, Chi Wang, Ahmed Awadallah, Charles L. A. Clarke, Julia Kiseleva |
| 2024 | AssistantBench: Can Web Agents Solve Realistic and Time-Consuming Tasks? Ori Yoran, Samuel Joseph Amouyal, Chaitanya Malaviya, Ben Bogin, Ofir Press, Jonathan Berant |
| 2024 | Atomic Inference for NLI with Generated Facts as Atoms. Joe Stacey, Pasquale Minervini, Haim Dubossarsky, Oana-Maria Camburu, Marek Rei |
| 2024 | Atomic Self-Consistency for Better Long Form Generations. Raghuveer Thirukovalluru, Yukun Huang, Bhuwan Dhingra |
| 2024 | Attention Score is not All You Need for Token Importance Indicator in KV Cache Reduction: Value Also Matters. Zhiyu Guo, Hidetaka Kamigaito, Taro Watanabe |
| 2024 | Attribute Diversity Determines the Systematicity Gap in VQA. Ian Berlot-Attwell, Kumar Krishna Agrawal, Annabelle Michael Carrell, Yash Sharma, Naomi Saphra |
| 2024 | Attribute or Abstain: Large Language Models as Long Document Assistants. Jan Buchmann, Xiao Liu, Iryna Gurevych |
| 2024 | AudioVSR: Enhancing Video Speech Recognition with Audio Data. Xiaoda Yang, Xize Cheng, Jiaqi Duan, Hongshun Qiu, Minjie Hong, Minghui Fang, Shengpeng Ji, Jialong Zuo, Zhiqing Hong, Zhimeng Zhang, Tao Jin |
| 2024 | AutoPersuade: A Framework for Evaluating and Explaining Persuasive Arguments. Till Saenger, Musashi Hinck, Justin Grimmer, Brandon M. Stewart |
| 2024 | AutoScraper: A Progressive Understanding Web Agent for Web Scraper Generation. Wenhao Huang, Zhouhong Gu, Chenghao Peng, Jiaqing Liang, Zhixu Li, Yanghua Xiao, Liqian Wen, Zulong Chen |
| 2024 | Automated Essay Scoring: A Reflection on the State of the Art. Shengjie Li, Vincent Ng |
| 2024 | Automatic Instruction Evolving for Large Language Models. Weihao Zeng, Can Xu, Yingxiu Zhao, Jian-Guang Lou, Weizhu Chen |
| 2024 | Automatic sentence segmentation of clinical record narratives in real-world data. Dongfang Xu, Davy Weissenbacher, Karen O'Connor, Siddharth Rawal, Graciela Gonzalez-Hernandez |
| 2024 | Automatically Generated Definitions and their utility for Modeling Word Meaning. Francesco Periti, David Alfter, Nina Tahmasebi |
| 2024 | Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards. Heejin Do, Sangwon Ryu, Gary Geunbae Lee |
| 2024 | Autoregressive Pre-Training on Pixels and Texts. Yekun Chai, Qingyi Liu, Jingwu Xiao, Shuohuan Wang, Yu Sun, Hua Wu |
| 2024 | BC-Prover: Backward Chaining Prover for Formal Theorem Proving. Yuhang He, Jihai Zhang, Jianzhu Bao, Fangquan Lin, Cheng Yang, Bing Qin, Ruifeng Xu, Wotao Yin |
| 2024 | BEEAR: Embedding-based Adversarial Removal of Safety Backdoors in Instruction-tuned Language Models. Yi Zeng, Weiyu Sun, Tran Ngoc Huynh, Dawn Song, Bo Li, Ruoxi Jia |
| 2024 | BLSP-Emo: Towards Empathetic Large Speech-Language Models. Chen Wang, Minpeng Liao, Zhongqiang Huang, Junhong Wu, Chengqing Zong, Jiajun Zhang |
| 2024 | BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers. Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Yanqiao Zhu, May Dongmei Wang, Joyce C. Ho, Chao Zhang, Carl Yang |
| 2024 | BPE Gets Picky: Efficient Vocabulary Refinement During Tokenizer Training. Pavel Chizhov, Catherine Arnett, Elizaveta Korotkova, Ivan P. Yamshchikov |
| 2024 | BPO: Staying Close to the Behavior LLM Creates Better Online LLM Alignment. Wenda Xu, Jiachen Li, William Yang Wang, Lei Li |
| 2024 | Back to School: Translation Using Grammar Books. Jonathan Hus, Antonios Anastasopoulos |
| 2024 | Backward Lens: Projecting Language Model Gradients into the Vocabulary Space. Shahar Katz, Yonatan Belinkov, Mor Geva, Lior Wolf |
| 2024 | BaitAttack: Alleviating Intention Shift in Jailbreak Attacks via Adaptive Bait Crafting. Rui Pu, Chaozhuo Li, Rui Ha, Litian Zhang, Lirong Qiu, Xi Zhang |
| 2024 | Bayesian Calibration of Win Rate Estimation with LLM Evaluators. Yicheng Gao, Gonghan Xu, Zhe Wang, Arman Cohan |
| 2024 | Bayesian Example Selection Improves In-Context Learning for Speech, Text and Visual Modalities. Siyin Wang, Chao-Han Huck Yang, Ji Wu, Chao Zhang |
| 2024 | Be Helpful but Don't Talk too Much - Enhancing Helpfulness in Conversations through Relevance in Multi-Turn Emotional Support. Junlin Li, Bo Peng, Yu-Yin Hsu, Chu-Ren Huang |
| 2024 | Belief Revision: The Adaptability of Large Language Models Reasoning. Bryan Wilie, Samuel Cahyawijaya, Etsuko Ishii, Junxian He, Pascale Fung |
| 2024 | Benchmarking Vision Language Models for Cultural Understanding. Shravan Nayak, Kanishk Jain, Rabiul Awal, Siva Reddy, Sjoerd van Steenkiste, Lisa Anne Hendricks, Karolina Stanczak, Aishwarya Agrawal |
| 2024 | Beyond Correlation: Interpretable Evaluation of Machine Translation Metrics. Stefano Perrella, Lorenzo Proietti, Pere-Lluís Huguet Cabot, Edoardo Barba, Roberto Navigli |
| 2024 | Beyond Embeddings: The Promise of Visual Table in Visual Reasoning. Yiwu Zhong, Zi-Yuan Hu, Michael R. Lyu, Liwei Wang |
| 2024 | Beyond Label Attention: Transparency in Language Models for Automated Medical Coding via Dictionary Learning. John Wu, David Wu, Jimeng Sun |
| 2024 | Beyond Reference: Evaluating High Quality Translations Better than Human References. Keonwoong Noh, Seokjin Oh, Woohwan Jung |
| 2024 | Beyond Turn-Based Interfaces: Synchronous LLMs as Full-Duplex Dialogue Agents. Bandhav Veluri, Benjamin N. Peloquin, Bokai Yu, Hongyu Gong, Shyamnath Gollakota |
| 2024 | Beyond the Turn-Based Game: Enabling Real-Time Conversations with Duplex Models. Xinrong Zhang, Yingfa Chen, Shengding Hu, Xu Han, Zihang Xu, Yuanwei Xu, Weilin Zhao, Maosong Sun, Zhiyuan Liu |
| 2024 | BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs. Zhiting Fan, Ruizhe Chen, Ruiling Xu, Zuozhu Liu |
| 2024 | BiasWipe: Mitigating Unintended Bias in Text Classifiers through Model Interpretability. Mamta Mamta, Rishikant Chigrupaatii, Asif Ekbal |
| 2024 | Bio-RFX: Refining Biomedical Extraction via Advanced Relation Classification and Structural Constraints. Minjia Wang, Fangzhou Liu, Xiuxing Li, Bowen Dong, Zhenyu Li, Tengyu Pan, Jianyong Wang |
| 2024 | Birdie: Advancing State Space Language Modeling with Dynamic Mixtures of Training Objectives. Sam Blouir, Jimmy T. H. Smith, Antonios Anastasopoulos, Amarda Shehu |
| 2024 | BlendFilter: Advancing Retrieval-Augmented Large Language Models via Query Generation Blending and Knowledge Filtering. Haoyu Wang, Ruirui Li, Haoming Jiang, Jinjin Tian, Zhengyang Wang, Chen Luo, Xianfeng Tang, Monica Xiao Cheng, Tuo Zhao, Jing Gao |
| 2024 | Boosting Logical Fallacy Reasoning in LLMs via Logical Structure Tree. Yuanyuan Lei, Ruihong Huang |
| 2024 | Boosting Scientific Concepts Understanding: Can Analogy from Teacher Models Empower Student Models? Siyu Yuan, Cheng Jiayang, Lin Qiu, Deqing Yang |
| 2024 | Bootstrapped Policy Learning for Task-oriented Dialogue through Goal Shaping. Yangyang Zhao, Ben Niu, Mehdi Dastani, Shihan Wang |
| 2024 | Breaking Language Barriers: Cross-Lingual Continual Pre-Training at Scale. Wenzhen Zheng, Wenbo Pan, Xu Xu, Libo Qin, Li Yue, Ming Zhou |
| 2024 | Breaking ReLU Barrier: Generalized MoEfication for Dense Pretrained Models. Jaeseong Lee, Seung-won Hwang, Wonpyo Park, Mingi Ji |
| 2024 | Breaking the Curse of Multilinguality with Cross-lingual Expert Language Models. Terra Blevins, Tomasz Limisiewicz, Suchin Gururangan, Margaret Li, Hila Gonen, Noah A. Smith, Luke Zettlemoyer |
| 2024 | Bridging Cultures in the Kitchen: A Framework and Benchmark for Cross-Cultural Recipe Retrieval. Tianyi Hu, Maria Maistro, Daniel Hershcovich |
| 2024 | Bridging Local Details and Global Context in Text-Attributed Graphs. Yaoke Wang, Yun Zhu, Wenqiao Zhang, Yueting Zhuang, Liyunfei, Siliang Tang |
| 2024 | Bridging Modalities: Enhancing Cross-Modality Hate Speech Detection with Few-Shot In-Context Learning. Ming Shan Hee, Aditi Kumaresan, Roy Ka-Wei Lee |
| 2024 | Building Resources for Emakhuwa: Machine Translation and News Classification Benchmarks. Felermino Dário Mário António Ali, Henrique Lopes Cardoso, Rui Sousa-Silva |
| 2024 | By My Eyes: Grounding Multimodal Large Language Models with Sensor Data via Visual Prompting. Hyungjun Yoon, Biniyam Aschalew Tolera, Taesik Gong, Kimin Lee, Sung-Ju Lee |
| 2024 | C-LLM: Learn to Check Chinese Spelling Errors Character by Character. Kunting Li, Yong Hu, Liang He, Fandong Meng, Jie Zhou |
| 2024 | C3PA: An Open Dataset of Expert-Annotated and Regulation-Aware Privacy Policies to Enable Scalable Regulatory Compliance Audits. Maaz Bin Musa, Steven M. Winston, Garrison Allen, Jacob Schiller, Kevin Moore, Sean Quick, Johnathan Melvin, Padmini Srinivasan, Mihailis Diamantis, Rishab Nithyanand |
| 2024 | CARER - ClinicAl Reasoning-Enhanced Representation for Temporal Health Risk Prediction. Tuan Nguyen, Thanh Trung Huynh, Minh Hieu Phan, Quoc Viet Hung Nguyen, Phi Le Nguyen |
| 2024 | CELLO: Causal Evaluation of Large Vision-Language Models. Meiqi Chen, Bo Peng, Yan Zhang, Chaochao Lu |
| 2024 | CHESS: Optimizing LLM Inference via Channel-Wise Thresholding and Selective Sparsification. Junhui He, Shangyu Wu, Weidong Wen, Chun Jason Xue, Qingan Li |
| 2024 | CHIQ: Contextual History Enhancement for Improving Query Rewriting in Conversational Search. Fengran Mo, Abbas Ghaddar, Kelong Mao, Mehdi Rezagholizadeh, Boxing Chen, Qun Liu, Jian-Yun Nie |
| 2024 | CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling. Yu Bai, Xiyuan Zou, Heyan Huang, Sanxing Chen, Marc-Antoine Rondeau, Yang Gao, Jackie C. K. Cheung |
| 2024 | CMD: a framework for Context-aware Model self-Detoxification. Zecheng Tang, Keyan Zhou, Juntao Li, Yuyang Ding, Pinzheng Wang, Yan Bowen, Renjie Hua, Min Zhang |
| 2024 | CMR Scaling Law: Predicting Critical Mixture Ratios for Continual Pre-training of Language Models. Jiawei Gu, Zacc Yang, Chuanghao Ding, Rui Zhao, Fei Tan |
| 2024 | CONTESTS: a Framework for Consistency Testing of Span Probabilities in Language Models. Eitan Wagner, Yuli Slavutsky, Omri Abend |
| 2024 | CSSL: Contrastive Self-Supervised Learning for Dependency Parsing on Relatively Free Word Ordered and Morphologically Rich Low Resource Languages. Pretam Ray, Jivnesh Sandhan, Amrith Krishna, Pawan Goyal |
| 2024 | CURE: Context- and Uncertainty-Aware Mental Disorder Detection. Migyeong Kang, Goun Choi, Hyolim Jeon, Ji Hyun An, Daejin Choi, Jinyoung Han |
| 2024 | CUTE: Measuring LLMs' Understanding of Their Tokens. Lukas Edman, Helmut Schmid, Alexander Fraser |
| 2024 | CaT-Bench: Benchmarking Language Model Understanding of Causal and Temporal Dependencies in Plans. Yash Kumar Lal, Vanya Cohen, Nathanael Chambers, Niranjan Balasubramanian, Raymond J. Mooney |
| 2024 | Calibrating Language Models with Adaptive Temperature Scaling. Johnathan Xie, Annie S. Chen, Yoonho Lee, Eric Mitchell, Chelsea Finn |
| 2024 | Calibrating the Confidence of Large Language Models by Eliciting Fidelity. Mozhi Zhang, Mianqiu Huang, Rundong Shi, Linsen Guo, Chong Peng, Peng Yan, Yaqian Zhou, Xipeng Qiu |
| 2024 | Can Active Label Correction Improve LLM-based Modular AI Systems? Karan Taneja, Ashok K. Goel |
| 2024 | Can Automatic Metrics Assess High-Quality Translations? Sweta Agrawal, António Farinhas, Ricardo Rei, André F. T. Martins |
| 2024 | Can LLM Generate Culturally Relevant Commonsense QA Data? Case Study in Indonesian and Sundanese. Rifki Afina Putri, Faiz Ghifari Haznitrama, Dea Adhista, Alice Oh |
| 2024 | Can LLMs Learn Uncertainty on Their Own? Expressing Uncertainty Effectively in A Self-Training Manner. Shudong Liu, Zhaocong Li, Xuebo Liu, Runzhe Zhan, Derek F. Wong, Lidia S. Chao, Min Zhang |
| 2024 | Can LLMs replace Neil deGrasse Tyson? Evaluating the Reliability of LLMs as Science Communicators. Prasoon Bajpai, Niladri Chatterjee, Subhabrata Dutta, Tanmoy Chakraborty |
| 2024 | Can Language Models Induce Grammatical Knowledge from Indirect Evidence? Miyu Oba, Yohei Oseki, Akiyo Fukatsu, Akari Haga, Hiroki Ouchi, Taro Watanabe, Saku Sugawara |
| 2024 | Can Large Language Models Always Solve Easy Problems if They Can Solve Harder Ones? Zhe Yang, Yichang Zhang, Tianyu Liu, Jian Yang, Junyang Lin, Chang Zhou, Zhifang Sui |
| 2024 | Can Large Language Models Enhance Predictions of Disease Progression? Investigating Through Disease Network Link Prediction. Haohui Lu, Usman Naseem |
| 2024 | Can Large Language Models Faithfully Express Their Intrinsic Uncertainty in Words? Gal Yona, Roee Aharoni, Mor Geva |
| 2024 | Can Large Language Models Learn Independent Causal Mechanisms? Gaël Gendron, Bao Trung Nguyen, Alex Yuxuan Peng, Michael J. Witbrock, Gillian Dobbie |
| 2024 | Can Transformers Learn n-gram Language Models? Anej Svete, Nadav Borenstein, Mike Zhou, Isabelle Augenstein, Ryan Cotterell |
| 2024 | Can We Trust the Performance Evaluation of Uncertainty Estimation Methods in Text Summarization? Jianfeng He, Runing Yang, Linlin Yu, Changbin Li, Ruoxi Jia, Feng Chen, Ming Jin, Chang-Tien Lu |
| 2024 | Can visual language models resolve textual ambiguity with visual cues? Let visual puns tell you! Jiwan Chung, Seungwon Lim, Jaehyun Jeon, Seungbeen Lee, Youngjae Yu |
| 2024 | CareCorpus+: Expanding and Augmenting Caregiver Strategy Data to Support Pediatric Rehabilitation. Shahla Farzana, Ivana Lucero, Vivian Villegas, Vera C. Kaelin, Mary A. Khetani, Natalie Parde |
| 2024 | Casablanca: Data and Models for Multidialectal Arabic Speech Recognition. Bashar Talafha, Karima Kadaoui, Samar Mohamed Magdy, Mariem Habiboullah, Chafei Mohamed Chafei, Ahmed Oumar El-Shangiti, Hiba Zayed, Mohamedou Cheikh Tourad, Rahaf Alhamouri, Rwaa Assi, Aisha Alraeesi, Hour Mohamed, Fakhraddin Alwajih, Abdelrahman Mohamed, Abdellah El Mekki, El Moatez Billah Nagoudi, Benelhadj Saadia, Hamzah A. Alsayadi, Walid Al-Dhabyani, Sara Shatnawi, Yasir Ech-Chammakhy, Amal Makouar, Yousra Berrachedi, Mustafa Jarrar, Shady Shehata, Ismail Berrada, Muhammad Abdul-Mageed |
| 2024 | CasiMedicos-Arg: A Medical Question Answering Dataset Annotated with Explanatory Argumentative Structures. Ekaterina Sviridova, Anar Yeginbergen, Ainara Estarrona, Elena Cabrio, Serena Villata, Rodrigo Agerri |
| 2024 | Chain and Causal Attention for Efficient Entity Tracking. Erwan Fagnou, Paul Caillon, Blaise Delattre, Alexandre Allauzen |
| 2024 | Chain-of-Dictionary Prompting Elicits Translation in Large Language Models. Hongyuan Lu, Haoran Yang, Haoyang Huang, Dongdong Zhang, Wai Lam, Furu Wei |
| 2024 | Chain-of-Note: Enhancing Robustness in Retrieval-Augmented Language Models. Wenhao Yu, Hongming Zhang, Xiaoman Pan, Peixin Cao, Kaixin Ma, Jian Li, Hongwei Wang, Dong Yu |
| 2024 | ChatGPT Doesn't Trust Chargers Fans: Guardrail Sensitivity in Context. Victoria R. Li, Yida Chen, Naomi Saphra |
| 2024 | ChatRetriever: Adapting Large Language Models for Generalized and Robust Conversational Dense Retrieval. Kelong Mao, Chenlong Deng, Haonan Chen, Fengran Mo, Zheng Liu, Tetsuya Sakai, Zhicheng Dou |
| 2024 | CleanGen: Mitigating Backdoor Attacks for Generation Tasks in Large Language Models. Yuetai Li, Zhangchen Xu, Fengqing Jiang, Luyao Niu, Dinuka Sahabandu, Bhaskar Ramasubramanian, Radha Poovendran |
| 2024 | CliMedBench: A Large-Scale Chinese Benchmark for Evaluating Medical Large Language Models in Clinical Scenarios. Zetian Ouyang, Yishuai Qiu, Linlin Wang, Gerard de Melo, Ya Zhang, Yanfeng Wang, Liang He |
| 2024 | ClimRetrieve: A Benchmarking Dataset for Information Retrieval from Corporate Climate Disclosures. Tobias Schimanski, Jingwei Ni, Roberto Martín, Nicola Ranger, Markus Leippold |
| 2024 | Closing the Loop: Learning to Generate Writing Feedback via Language Model Simulated Student Revisions. Inderjeet Nair, Jiaye Tan, Xiaotian Su, Anne Gere, Xu Wang, Lu Wang |
| 2024 | Cluster-Norm for Unsupervised Probing of Knowledge. Walter Laurito, Sharan Maiya, Grégoire Dhimoïla, Owen Yeung, Kaarel Hänni |
| 2024 | Clustering and Ranking: Diversity-preserved Instruction Selection through Expert-aligned Quality Estimation. Yuan Ge, Yilun Liu, Chi Hu, Weibin Meng, Shimin Tao, Xiaofeng Zhao, Mahong Xia, Zhang Li, Boxing Chen, Hao Yang, Bei Li, Tong Xiao, Jingbo Zhu |
| 2024 | CmdCaliper: A Semantic-Aware Command-Line Embedding Model and Dataset for Security Research. Sian-Yao Huang, Cheng-Lin Yang, Che-Yu Lin, Chun-Ying Huang |
| 2024 | CoBa: Convergence Balancer for Multitask Finetuning of Large Language Models. Zi Gong, Hang Yu, Cong Liao, Bingchang Liu, Chaoyu Chen, Jianguo Li |
| 2024 | CoCoLoFa: A Dataset of News Comments with Common Logical Fallacies Written by LLM-Assisted Crowds. Min-Hsuan Yeh, Ruyuan Wan, Ting-Hao Huang |
| 2024 | CoCoST: Automatic Complex Code Generation with Online Searching and Correctness Testing. Xinyi He, Jiaru Zou, Yun Lin, Mengyu Zhou, Shi Han, Zejian Yuan, Dongmei Zhang |
| 2024 | CoEvol: Constructing Better Responses for Instruction Finetuning through Multi-Agent Cooperation. Renhao Li, Minghuan Tan, Derek F. Wong, Min Yang |
| 2024 | CoGen: Learning from Feedback with Coupled Comprehension and Generation. Mustafa Omer Gul, Yoav Artzi |
| 2024 | CoSafe: Evaluating Large Language Model Safety in Multi-Turn Dialogue Coreference. Erxin Yu, Jing Li, Ming Liao, Siqi Wang, Zuchen Gao, Fei Mi, Lanqing Hong |
| 2024 | CoTKR: Chain-of-Thought Enhanced Knowledge Rewriting for Complex Knowledge Graph Question Answering. Yike Wu, Yi Huang, Nan Hu, Yuncheng Hua, Guilin Qi, Jiaoyan Chen, Jeff Z. Pan |
| 2024 | Code Prompting Elicits Conditional Reasoning Abilities in Text+Code LLMs. Haritz Puerto, Martin Tutek, Somak Aditya, Xiaodan Zhu, Iryna Gurevych |
| 2024 | CodeAgent: Autonomous Communicative Agents for Code Review. Xunzhu Tang, Kisub Kim, Yewei Song, Cedric Lothritz, Bei Li, Saad Ezzini, Haoye Tian, Jacques Klein, Tegawendé F. Bissyandé |
| 2024 | CodeJudge: Evaluating Code Generation with Large Language Models. Weixi Tong, Tianyi Zhang |
| 2024 | Coffee-Gym: An Environment for Evaluating and Improving Natural Language Feedback on Erroneous Code. Hyungjoo Chae, Taeyoon Kwon, Seungjun Moon, Yongho Song, Dongjin Kang, Kai Tzu-iunn Ong, Beong-woo Kwak, Seonghyeon Bae, Seung-won Hwang, Jinyoung Yeo |
| 2024 | Collaborative Performance Prediction for Large Language Models. Qiyuan Zhang, Fuyuan Lyu, Xue Liu, Chen Ma |
| 2024 | Collective Critics for Creative Story Generation. Minwook Bae, Hyounghun Kim |
| 2024 | CommVQA: Situating Visual Question Answering in Communicative Contexts. Nandita Naik, Christopher Potts, Elisa Kreiss |
| 2024 | CommonIT: Commonality-Aware Instruction Tuning for Large Language Models via Data Partitions. Jun Rao, Xuebo Liu, Lian Lian, Shengjun Cheng, Yunjie Liao, Min Zhang |
| 2024 | Commonsense Knowledge Editing Based on Free-Text in LLMs. Xiusheng Huang, Yequan Wang, Jun Zhao, Kang Liu |
| 2024 | Communicating with Speakers and Listeners of Different Pragmatic Levels. Kata Naszádi, Frans A. Oliehoek, Christof Monz |
| 2024 | Community-Cross-Instruct: Unsupervised Instruction Generation for Aligning Large Language Models to Online Communities. Zihao He, Minh Duc Chu, Rebecca Dorn, Siyi Guo, Kristina Lerman |
| 2024 | CompAct: Compressing Retrieved Documents Actively for Question Answering. Chanwoong Yoon, Taewhoo Lee, Hyeon Hwang, Minbyul Jeong, Jaewoo Kang |
| 2024 | Comparing Neighbors Together Makes it Easy: Jointly Comparing Multiple Candidates for Efficient and Effective Retrieval. Jonghyun Song, Cheyon Jin, Wenlong Zhao, Andrew McCallum, Jay-Yoon Lee |
| 2024 | Comparing a BERT Classifier and a GPT classifier for Detecting Connective Language Across Multiple Social Media. Josephine Lukito, Bin Chen, Gina M. Masullo, Natalie Jomini Stroud |
| 2024 | Computational Meme Understanding: A Survey. Khoi P. N. Nguyen, Vincent Ng |
| 2024 | Concept Space Alignment in Multilingual LLMs. Qiwei Peng, Anders Søgaard |
| 2024 | Concept-skill Transferability-based Data Selection for Large Vision-Language Models. Jaewoo Lee, Boyang Li, Sung Ju Hwang |
| 2024 | Conditional and Modal Reasoning in Large Language Models. Wesley H. Holliday, Matthew Mandelkern, Cedegao Zhang |
| 2024 | Connecting the Dots: Evaluating Abstract Reasoning Capabilities of LLMs Using the New York Times Connections Word Game. Prisha Samadarshi, Mariam Mustafa, Anushka Kulkarni, Raven Rothkopf, Tuhin Chakrabarty, Smaranda Muresan |
| 2024 | Consecutive Batch Model Editing with HooK Layers. Shuaiyi Li, Yang Deng, Deng Cai, Hongyuan Lu, Liang Chen, Wai Lam |
| 2024 | Consistent Autoformalization for Constructing Mathematical Libraries. Lan Zhang, Xin Quan, André Freitas |
| 2024 | Consistent Bidirectional Language Modelling: Expressive Power and Representational Conciseness. Georgi Shopov, Stefan Gerdjikov |
| 2024 | Consolidating Ranking and Relevance Predictions of Large Language Models through Post-Processing. Le Yan, Zhen Qin, Honglei Zhuang, Rolf Jagerman, Xuanhui Wang, Michael Bendersky, Harrie Oosterhuis |
| 2024 | Context-Aware Adapter Tuning for Few-Shot Relation Learning in Knowledge Graphs. Liu Ran, Zhongzhou Liu, Xiaoli Li, Yuan Fang |
| 2024 | Context-Aware Assistant Selection for Improved Inference Acceleration with Large Language Models. Jerry Huang, Prasanna Parthasarathi, Mehdi Rezagholizadeh, Sarath Chandar |
| 2024 | Context-aware Watermark with Semantic Balanced Green-red Lists for Large Language Models. Yuxuan Guo, Zhiliang Tian, Yiping Song, Tianlun Liu, Liang Ding, Dongsheng Li |
| 2024 | Contextualized Sequence Likelihood: Enhanced Confidence Scores for Natural Language Generation. Zhen Lin, Shubhendu Trivedi, Jimeng Sun |
| 2024 | Continual Test-time Adaptation for End-to-end Speech Recognition on Noisy Speech. Guan-Ting Lin, Wei Huang, Hung-yi Lee |
| 2024 | Contrastive Entity Coreference and Disambiguation for Historical Texts. Abhishek Arora, Emily Silcock, Melissa Dell, Leander Heldring |
| 2024 | Contrastive Policy Gradient: Aligning LLMs on sequence-level scores in a supervised-friendly fashion. Yannis Flet-Berliac, Nathan Grinsztajn, Florian Strub, Eugene Choi, Bill Wu, Chris Cremer, Arash Ahmadian, Yash Chandak, Mohammad Gheshlaghi Azar, Olivier Pietquin, Matthieu Geist |
| 2024 | Contribution of Linguistic Typology to Universal Dependency Parsing: An Empirical Investigation. Ali Basirat, Navid Hemmati |
| 2024 | Control Large Language Models via Divide and Conquer. Bingxuan Li, Yiwei Wang, Tao Meng, Kai-Wei Chang, Nanyun Peng |
| 2024 | ControlMath: Controllable Data Generation Promotes Math Generalist Models. Nuo Chen, Ning Wu, Jianhui Chang, Linjun Shou, Jia Li |
| 2024 | Controllable Preference Optimization: Toward Controllable Multi-Objective Alignment. Yiju Guo, Ganqu Cui, Lifan Yuan, Ning Ding, Zexu Sun, Bowen Sun, Huimin Chen, Ruobing Xie, Jie Zhou, Yankai Lin, Zhiyuan Liu, Maosong Sun |
| 2024 | CopyBench: Measuring Literal and Non-Literal Reproduction of Copyright-Protected Text in Language Model Generation. Tong Chen, Akari Asai, Niloofar Mireshghallah, Sewon Min, James Grimmelmann, Yejin Choi, Hannaneh Hajishirzi, Luke Zettlemoyer, Pang Wei Koh |
| 2024 | CorrSynth - A Correlated Sampling Method for Diverse Dataset Generation from LLMs. Suhas S. Kowshik, Abhishek Divekar, Vijit Malik |
| 2024 | CoverICL: Selective Annotation for In-Context Learning via Active Graph Coverage. Costas Mavromatis, Balasubramaniam Srinivasan, Zhengyuan Shen, Jiani Zhang, Huzefa Rangwala, Christos Faloutsos, George Karypis |
| 2024 | Crafting Personalized Agents through Retrieval-Augmented Generation on Editable Memory Graphs. Zheng Wang, Zhongyang Li, Zeren Jiang, Dandan Tu, Wei Shi |
| 2024 | Cross-Domain Audio Deepfake Detection: Dataset and Analysis. Yuang Li, Min Zhang, Mengxin Ren, Xiaosong Qiao, Miaomiao Ma, Daimeng Wei, Hao Yang |
| 2024 | Cross-domain NER with Generated Task-Oriented Knowledge: An Empirical Study from Information Density Perspective. Zhihao Zhang, Sophia Yat Mei Lee, Junshuang Wu, Dong Zhang, Shoushan Li, Erik Cambria, Guodong Zhou |
| 2024 | Cross-lingual Back-Parsing: Utterance Synthesis from Meaning Representation for Zero-Resource Semantic Parsing. Deokhyung Kang, Seonjeong Hwang, Yunsu Kim, Gary Geunbae Lee |
| 2024 | Cross-lingual Transfer for Automatic Question Generation by Learning Interrogative Structures in Target Languages. Seonjeong Hwang, Yunsu Kim, Gary Geunbae Lee |
| 2024 | CryptoTrade: A Reflective LLM-based Agent to Guide Zero-shot Cryptocurrency Trading. Yuan Li, Bingqiao Luo, Qian Wang, Nuo Chen, Xu Liu, Bingsheng He |
| 2024 | Cultural Conditioning or Placebo? On the Effectiveness of Socio-Demographic Prompting. Sagnik Mukherjee, Muhammad Farid Adilazuarda, Sunayana Sitaram, Kalika Bali, Alham Fikri Aji, Monojit Choudhury |
| 2024 | Curriculum Consistency Learning for Conditional Sentence Generation. Liangxin Liu, Xuebo Liu, Lian Lian, Shengjun Cheng, Jun Rao, Tengfei Yu, Hexuan Deng, Min Zhang |
| 2024 | D2R: Dual-Branch Dynamic Routing Network for Multimodal Sentiment Detection. Yifan Chen, Kuntao Li, Weixing Mai, Qiaofeng Wu, Yun Xue, Fenghuan Li |
| 2024 | D3CODE: Disentangling Disagreements in Data across Cultures on Offensiveness Detection and Evaluation. Aida Mostafazadeh Davani, Mark Diaz, Dylan K. Baker, Vinodkumar Prabhakaran |
| 2024 | DA-Code: Agent Data Science Code Generation Benchmark for Large Language Models. Yiming Huang, Jianwen Luo, Yan Yu, Yitong Zhang, Fangyu Lei, Yifan Wei, Shizhu He, Lifu Huang, Xiao Liu, Jun Zhao, Kang Liu |
| 2024 | DAMRO: Dive into the Attention Mechanism of LVLM to Reduce Object Hallucination. Xuan Gong, Tianshi Ming, Xinpeng Wang, Zhihua Wei |
| 2024 | DA³: A Distribution-Aware Adversarial Attack against Language Models. Yibo Wang, Xiangjue Dong, James Caverlee, Philip S. Yu |
| 2024 | DC-Instruct: An Effective Framework for Generative Multi-intent Spoken Language Understanding. Bowen Xing, Lizi Liao, Minlie Huang, Ivor W. Tsang |
| 2024 | DECOR: Improving Coherence in L2 English Writing with a Novel Benchmark for Incoherence Detection, Reasoning, and Rewriting. Xuanming Zhang, Anthony Diaz, Zixun Chen, Qingyang Wu, Kun Qian, Erik Voss, Zhou Yu |
| 2024 | DEFT-UCS: Data Efficient Fine-Tuning for Pre-Trained Language Models via Unsupervised Core-Set Selection for Text-Editing. Devleena Das, Vivek Khetan |
| 2024 | DEM: Distribution Edited Model for Training with Mixed Data Distributions. Dhananjay Ram, Aditya Rawal, Momchil Hardalov, Nikolaos Pappas, Sheng Zha |
| 2024 | DGLF: A Dual Graph-based Learning Framework for Multi-modal Sarcasm Detection. Zhihong Zhu, Kefan Shen, Zhaorun Chen, Yunyan Zhang, Yuyan Chen, Xiaoqi Jiao, Zhongwei Wan, Shaorong Xie, Wei Liu, Xian Wu, Yefeng Zheng |
| 2024 | DISCERN: Decoding Systematic Errors in Natural Language for Text Classifiers. Rakesh R. Menon, Shashank Srivastava |
| 2024 | DKEC: Domain Knowledge Enhanced Multi-Label Classification for Diagnosis Prediction. Xueren Ge, Abhishek Satpathy, Ronald D. Williams, John A. Stankovic, Homa Alemzadeh |
| 2024 | DVD: Dynamic Contrastive Decoding for Knowledge Amplification in Multi-Document Question Answering. Jing Jin, Houfeng Wang, Hao Zhang, Xiaoguang Li, Zhijiang Guo |
| 2024 | Dancing in Chains: Reconciling Instruction Following and Faithfulness in Language Models. Zhengxuan Wu, Yuhao Zhang, Peng Qi, Yumo Xu, Rujun Han, Yian Zhang, Jifan Chen, Bonan Min, Zhiheng Huang |
| 2024 | Data Advisor: Dynamic Data Curation for Safety Alignment of Large Language Models. Fei Wang, Ninareh Mehrabi, Palash Goyal, Rahul Gupta, Kai-Wei Chang, Aram Galstyan |
| 2024 | Data Contamination Can Cross Language Barriers. Feng Yao, Yufan Zhuang, Zihao Sun, Sunan Xu, Animesh Kumar, Jingbo Shang |
| 2024 | Data, Data Everywhere: A Guide for Pretraining Dataset Construction. Jupinder Parmar, Shrimai Prabhumoye, Joseph Jennings, Bo Liu, Aastha Jhunjhunwala, Zhilin Wang, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro |
| 2024 | DataNarrative: Automated Data-Driven Storytelling with Visualizations and Texts. Mohammed Saidul Islam, Md. Tahmid Rahman Laskar, Md. Rizwan Parvez, Enamul Hoque, Shafiq Joty |
| 2024 | DataTales: A Benchmark for Real-World Intelligent Data Narration. Yajing Yang, Qian Liu, Min-Yen Kan |
| 2024 | De-Identification of Sensitive Personal Data in Datasets Derived from IIT-CDIP. Stefan Larson, Nicole Lima, Santiago Diaz, Amogh Manoj Joshi, Siddharth Betala, Jamiu Suleiman, Yash Mathur, Kaushal Prajapati, Ramla Alakraa, Junjie Shen, Temi Okotore, Kevin Leach |
| 2024 | DeMPT: Decoding-enhanced Multi-phase Prompt Tuning for Making LLMs Be Better Context-aware Translators. Xinglin Lyu, Junhui Li, Yanqing Zhao, Min Zhang, Daimeng Wei, Shimin Tao, Hao Yang, Min Zhang |
| 2024 | Deciphering Cognitive Distortions in Patient-Doctor Mental Health Conversations: A Multimodal LLM-Based Detection and Reasoning Framework. Gopendra Vikram Singh, Sai Vemulapalli, Mauajama Firdaus, Asif Ekbal |
| 2024 | Deciphering Rumors: A Multi-Task Learning Approach with Intent-aware Hierarchical Contrastive Learning. Chang Yang, Peng Zhang, Hui Gao, Jing Zhang |
| 2024 | Deciphering the Interplay of Parametric and Non-parametric Memory in Retrieval-augmented Language Models. Mehrdad Farahani, Richard Johansson |
| 2024 | Decoding Matters: Addressing Amplification Bias and Homogeneity Issue in Recommendations for Large Language Models. Keqin Bao, Jizhi Zhang, Yang Zhang, Xinyue Huo, Chong Chen, Fuli Feng |
| 2024 | Decoding Susceptibility: Modeling Misbelief to Misinformation Through a Computational Approach. Yanchen Liu, Mingyu Derek Ma, Wenna Qin, Azure Zhou, Jiaao Chen, Weiyan Shi, Wei Wang, Diyi Yang |
| 2024 | Decoding the Echoes of Vision from fMRI: Memory Disentangling for Past Semantic Information. Runze Xia, Congchi Yin, Piji Li |
| 2024 | Decoding with Limited Teacher Supervision Requires Understanding When to Trust the Teacher. Hyunjong Ok, Jegwang Ryu, Jaeho Lee |
| 2024 | Decompose and Compare Consistency: Measuring VLMs' Answer Reliability via Task-Decomposition Consistency Comparison. Qian Yang, Weixiang Yan, Aishwarya Agrawal |
| 2024 | DecorateLM: Data Engineering through Corpus Rating, Tagging, and Editing with Language Models. Ranchi Zhao, Zhen Leng Thai, Yifan Zhang, Shengding Hu, Jie Zhou, Yunqi Ba, Jie Cai, Zhiyuan Liu, Maosong Sun |
| 2024 | Defending Against Social Engineering Attacks in the Age of LLMs. Lin Ai, Tharindu Kumarage, Amrita Bhattacharjee, Zizhou Liu, Zheng Hui, Michael Davinroy, James Cook, Laura Cassani, Kirill Trapeznikov, Matthias Kirchner, Arslan Basharat, Anthony Hoogs, Joshua Garland, Huan Liu, Julia Hirschberg |
| 2024 | Defending Jailbreak Prompts via In-Context Adversarial Game. Yujun Zhou, Yufei Han, Haomin Zhuang, Kehan Guo, Zhenwen Liang, Hongyan Bao, Xiangliang Zhang |
| 2024 | Defining Knowledge: Bridging Epistemology and Large Language Models. Constanza Fierro, Ruchira Dhar, Filippos Stamatiou, Nicolas Garneau, Anders Søgaard |
| 2024 | Delving into Qualitative Implications of Synthetic Data for Hate Speech Detection. Camilla Casula, Sebastiano Vecellio Salto, Alan Ramponi, Sara Tonelli |
| 2024 | Democratizing Large Language Models via Personalized Parameter-Efficient Fine-tuning. Zhaoxuan Tan, Qingkai Zeng, Yijun Tian, Zheyuan Liu, Bing Yin, Meng Jiang |
| 2024 | Demystifying Verbatim Memorization in Large Language Models. Jing Huang, Diyi Yang, Christopher Potts |
| 2024 | Dense X Retrieval: What Retrieval Granularity Should We Use? Tong Chen, Hongwei Wang, Sihao Chen, Wenhao Yu, Kaixin Ma, Xinran Zhao, Hongming Zhang, Dong Yu |
| 2024 | Dependency Graph Parsing as Sequence Labeling. Ana Ezquerro, David Vilares, Carlos Gómez-Rodríguez |
| 2024 | Detecting Errors through Ensembling Prompts (DEEP): An End-to-End LLM Framework for Detecting Factual Errors. Alex Chandler, Devesh Surve, Hui Su |
| 2024 | Detecting Online Community Practices with Large Language Models: A Case Study of Pro-Ukrainian Publics on Twitter. Kateryna Kasianenko, Shima Khanehzar, Stephen Wan, Ehsan Dehghan, Axel Bruns |
| 2024 | Detecting Subtle Differences between Human and Model Languages Using Spectrum of Relative Likelihood. Yang Xu, Yu Wang, Hao An, Zhichen Liu, Yongyuan Li |
| 2024 | Detection and Measurement of Syntactic Templates in Generated Text. Chantal Shaib, Yanai Elazar, Junyi Jessy Li, Byron C. Wallace |
| 2024 | DetoxLLM: A Framework for Detoxification with Explanations. Md. Tawkat Islam Khondaker, Muhammad Abdul-Mageed, Laks V. S. Lakshmanan |
| 2024 | Development of Cognitive Intelligence in Pre-trained Language Models. Raj Sanjay Shah, Khushi Bhardwaj, Sashank Varma |
| 2024 | DiVERT: Distractor Generation with Variational Errors Represented as Text for Math Multiple-choice Questions. Nigel Fernandez, Alexander Scarlatos, Wanyong Feng, Simon Woodhead, Andrew S. Lan |
| 2024 | Dialog2Flow: Pre-training Soft-Contrastive Action-Driven Sentence Embeddings for Automatic Dialog Flow Extraction. Sergio Burdisso, Srikanth R. Madikeri, Petr Motlícek |
| 2024 | Direct Multi-Turn Preference Optimization for Language Agents. Wentao Shi, Mengqi Yuan, Junkang Wu, Qifan Wang, Fuli Feng |
| 2024 | Discovering Biases in Information Retrieval Models Using Relevance Thesaurus as Global Explanation. Youngwoo Kim, Razieh Rahimi, James Allan |
| 2024 | Discovering Knowledge-Critical Subnetworks in Pretrained Language Models. Deniz Bayazit, Negar Foroutan, Zeming Chen, Gail Weiss, Antoine Bosselut |
| 2024 | Dissecting Fine-Tuning Unlearning in Large Language Models. Yihuai Hong, Yuelin Zou, Lijie Hu, Ziqian Zeng, Di Wang, Haiqin Yang |
| 2024 | Distilling Knowledge from Text-to-Image Generative Models Improves Visio-Linguistic Reasoning in CLIP. Samyadeep Basu, Shell Xu Hu, Maziar Sanjabi, Daniela Massiceti, Soheil Feizi |
| 2024 | Distract Large Language Models for Automatic Jailbreak Attack. Zeguan Xiao, Yan Yang, Guanhua Chen, Yun Chen |
| 2024 | Distractor Generation in Multiple-Choice Tasks: A Survey of Methods, Datasets, and Evaluation. Elaf Alhazmi, Quan Sheng, Wei Emma Zhang, Munazza Zaib, Ahoud Alhazmi |
| 2024 | Distributional Properties of Subword Regularization. Marco Cognetta, Vilém Zouhar, Naoaki Okazaki |
| 2024 | Diversity Over Size: On the Effect of Sample and Topic Sizes for Topic-Dependent Argument Mining Datasets. Benjamin Schiller, Johannes Daxenberger, Andreas Waldis, Iryna Gurevych |
| 2024 | Divide and Conquer Radiology Report Generation via Observation Level Fine-grained Pretraining and Prompt Tuning. Yuanpin Zhou, Huogen Wang |
| 2024 | Do LLMs Know to Respect Copyright Notice? Jialiang Xu, Shenglan Li, Zhaozhuo Xu, Denghui Zhang |
| 2024 | Do LLMs Overcome Shortcut Learning? An Evaluation of Shortcut Challenges in Large Language Models. Yu Yuan, Lili Zhao, Kai Zhang, Guangting Zheng, Qi Liu |
| 2024 | Do LLMs Plan Like Human Writers? Comparing Journalist Coverage of Press Releases with LLMs. Alexander Spangher, Nanyun Peng, Sebastian Gehrmann, Mark Dredze |
| 2024 | Do LLMs learn a true syntactic universal? John T. Hale, Milos Stanojevic |
| 2024 | Do LLMs suffer from Multi-Party Hangover? A Diagnostic Approach to Addressee Recognition and Response Selection in Conversations. Nicolò Penzo, Maryam Sajedinia, Bruno Lepri, Sara Tonelli, Marco Guerini |
| 2024 | Do Large Language Models Know How Much They Know? Gabriele Prato, Jerry Huang, Prasanna Parthasarathi, Shagun Sodhani, Sarath Chandar |
| 2024 | Do Text-to-Vis Benchmarks Test Real Use of Visualisations? Hy Nguyen, Xuefei He, Andrew Reeson, Cécile Paris, Josiah Poon, Jonathan K. Kummerfeld |
| 2024 | Do We Need Language-Specific Fact-Checking Models? The Case of Chinese. Caiqi Zhang, Zhijiang Guo, Andreas Vlachos |
| 2024 | Do You Know What You Are Talking About? Characterizing Query-Knowledge Relevance For Reliable Retrieval Augmented Generation. Zhuohang Li, Jiaxin Zhang, Chao Yan, Kamalika Das, Kumar Sricharan, Murat Kantarcioglu, Bradley A. Malin |
| 2024 | Do great minds think alike? Investigating Human-AI Complementarity in Question Answering with CAIMIRA. Maharshi Gor, Hal Daumé III, Tianyi Zhou, Jordan L. Boyd-Graber |
| 2024 | DocCGen: Document-based Controlled Code Generation. Sameer Pimparkhede, Mehant Kammakomati, Srikanth Tamilselvam, Prince Kumar, Ashok Pon Kumar, Pushpak Bhattacharyya |
| 2024 | DocEdit-v2: Document Structure Editing Via Multimodal LLM Grounding. Manan Suri, Puneet Mathur, Franck Dernoncourt, Rajiv Jain, Vlad I. Morariu, Ramit Sawhney, Preslav Nakov, Dinesh Manocha |
| 2024 | DocHieNet: A Large and Diverse Dataset for Document Hierarchy Parsing. Hangdi Xing, Changxu Cheng, Feiyu Gao, Zirui Shao, Zhi Yu, Jiajun Bu, Qi Zheng, Cong Yao |
| 2024 | DocKD: Knowledge Distillation from LLMs for Open-World Document Understanding Models. Sungnyun Kim, Haofu Liao, Srikar Appalaraju, Peng Tang, Zhuowen Tu, Ravi Kumar Satzoda, R. Manmatha, Vijay Mahadevan, Stefano Soatto |
| 2024 | Does Fine-Tuning LLMs on New Knowledge Encourage Hallucinations? Zorik Gekhman, Gal Yona, Roee Aharoni, Matan Eyal, Amir Feder, Roi Reichart, Jonathan Herzig |
| 2024 | Does Large Language Model Contain Task-Specific Neurons? Ran Song, Shizhu He, Shuting Jiang, Yantuan Xian, Shengxiang Gao, Kang Liu, Zhengtao Yu |
| 2024 | Does Object Grounding Really Reduce Hallucination of Large Vision-Language Models? Gregor Geigle, Radu Timofte, Goran Glavas |
| 2024 | DogeRM: Equipping Reward Models with Domain Knowledge through Model Merging. Tzu-Han Lin, Chen-An Li, Hung-yi Lee, Yun-Nung Chen |
| 2024 | Domain adapted machine translation: What does catastrophic forgetting forget and why? Danielle Saunders, Steve DeNeefe |
| 2024 | Don't Forget Your Reward Values: Language Model Alignment via Value-based Calibration. Xin Mao, Feng-Lin Li, Huimin Xu, Wei Zhang, Wang Chen, Anh Tuan Luu |
| 2024 | Don't Just Say "I don't know"! Self-aligning Large Language Models for Responding to Unknown Questions with Explanations. Yang Deng, Yong Zhao, Moxin Li, See-Kiong Ng, Tat-Seng Chua |
| 2024 | Dual-Space Knowledge Distillation for Large Language Models. Songming Zhang, Xue Zhang, Zengkui Sun, Yufeng Chen, Jinan Xu |
| 2024 | Dual-oriented Disentangled Network with Counterfactual Intervention for Multimodal Intent Detection. Zhanpeng Chen, Zhihong Zhu, Xianwei Zhuang, Zhiqi Huang, Yuexian Zou |
| 2024 | DyVo: Dynamic Vocabularies for Learned Sparse Retrieval with Entities. Thong Nguyen, Shubham Chatterjee, Sean MacAvaney, Iain Mackie, Jeff Dalton, Andrew Yates |
| 2024 | DynaThink: Fast or Slow? A Dynamic Decision-Making Framework for Large Language Models. Jiabao Pan, Yan Zhang, Chen Zhang, Zuozhu Liu, Hongwei Wang, Haizhou Li |
| 2024 | Dynamic Multi-Reward Weighting for Multi-Style Controllable Generation. Karin de Langis, Ryan Koo, Dongyeop Kang |
| 2024 | Dynamic Multi-granularity Attribution Network for Aspect-based Sentiment Analysis. Yanjiang Chen, Kai Zhang, Feng Hu, Xianquan Wang, Ruikang Li, Qi Liu |
| 2024 | Dynamic Rewarding with Prompt Optimization Enables Tuning-free Self-Alignment of Language Models. Somanshu Singla, Zhen Wang, Tianyang Liu, Abdullah Ashfaq, Zhiting Hu, Eric P. Xing |
| 2024 | DynamicER: Resolving Emerging Mentions to Dynamic Entities for RAG. Jinyoung Kim, Dayoon Ko, Gunhee Kim |
| 2024 | EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees. Yuhui Li, Fangyun Wei, Chao Zhang, Hongyang Zhang |
| 2024 | ECCO: Can We Improve Model-Generated Code Efficiency Without Sacrificing Functional Correctness? Siddhant Waghjale, Vishruth Veerendranath, Zhiruo Wang, Daniel Fried |
| 2024 | ECIS-VQG: Generation of Entity-centric Information-seeking Questions from Videos. Arpan Phukan, Manish Gupta, Asif Ekbal |
| 2024 | ECON: On the Detection and Resolution of Evidence Conflicts. Cheng Jiayang, Chunkit Chan, Qianqian Zhuang, Lin Qiu, Tianhang Zhang, Tengxiao Liu, Yangqiu Song, Yue Zhang, Pengfei Liu, Zheng Zhang |
| 2024 | EFUF: Efficient Fine-Grained Unlearning Framework for Mitigating Hallucinations in Multimodal Large Language Models. Shangyu Xing, Fei Zhao, Zhen Wu, Tuo An, Weihao Chen, Chunhui Li, Jianbing Zhang, Xinyu Dai |
| 2024 | EH-MAM: Easy-to-Hard Masked Acoustic Modeling for Self-Supervised Speech Representation Learning. Ashish Seth, Ramaneswaran Selvakumar, S. Sakshi, Sonal Kumar, Sreyan Ghosh, Dinesh Manocha |
| 2024 | EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records. Wenqi Shi, Ran Xu, Yuchen Zhuang, Yue Yu, Jieyu Zhang, Hang Wu, Yuanda Zhu, Joyce C. Ho, Carl Yang, May Dongmei Wang |
| 2024 | EPO: Hierarchical LLM Agents with Environment Preference Optimization. Qi Zhao, Haotian Fu, Chen Sun, George Konidaris |
| 2024 | ERVQA: A Dataset to Benchmark the Readiness of Large Vision Language Models in Hospital Environments. Sourjyadip Ray, Kushal Gupta, Soumi Kundu, Payal Arvind Kasat, Somak Aditya, Pawan Goyal |
| 2024 | ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models. Haiquan Zhao, Lingyu Li, Shisong Chen, Shuqi Kong, Jiaan Wang, Kexin Huang, Tianle Gu, Yixu Wang, Jian Wang, Dandan Liang, Zhixu Li, Yan Teng, Yanghua Xiao, Yingchun Wang |
| 2024 | ESC: Efficient Speech Coding with Cross-Scale Residual Vector Quantized Transformers. Yuzhe Gu, Enmao Diao |
| 2024 | EVEDIT: Event-based Knowledge Editing for Deterministic Knowledge Propagation. Jiateng Liu, Pengfei Yu, Yuji Zhang, Sha Li, Zixuan Zhang, Ruhi Sarikaya, Kevin Small, Heng Ji |
| 2024 | EXPLORA: Efficient Exemplar Subset Selection for Complex Reasoning. Kiran Purohit, Venktesh V, Raghuram Devalla, Krishna Yerragorla, Sourangshu Bhattacharya, Avishek Anand |
| 2024 | Effective Demonstration Annotation for In-Context Learning via Language Model-Based Determinantal Point Process. Peng Wang, Xiaobin Wang, Chao Lou, Shengyu Mao, Pengjun Xie, Yong Jiang |
| 2024 | Effective Synthetic Data and Test-Time Adaptation for OCR Correction. Shuhao Guan, Cheng Xu, Moule Lin, Derek Greene |
| 2024 | Efficient LLM Comparative Assessment: A Product of Experts Framework for Pairwise Comparisons. Adian Liusie, Vatsal Raina, Yassir Fathullah, Mark J. F. Gales |
| 2024 | Efficient Overshadowed Entity Disambiguation by Mitigating Shortcut Learning. Panuthep Tasawong, Peerat Limkonchotiwat, Potsawee Manakul, Can Udomcharoenchaikit, Ekapol Chuangsuwanich, Sarana Nutanong |
| 2024 | Efficient Performance Tracking: Leveraging Large Language Models for Automated Construction of Scientific Leaderboards. Furkan Sahinuç, Thy Thy Tran, Yulia Grishina, Yufang Hou, Bei Chen, Iryna Gurevych |
| 2024 | Efficient Sequential Decision Making with Large Language Models. Dingyang Chen, Qi Zhang, Yinglun Zhu |
| 2024 | Efficient Temporal Extrapolation of Multimodal Large Language Models with Temporal Grounding Bridge. Yuxuan Wang, Yueqian Wang, Pengfei Wu, Jianxin Liang, Dongyan Zhao, Yang Liu, Zilong Zheng |
| 2024 | Efficient Unseen Language Adaptation for Multilingual Pre-Trained Language Models. Po-Heng Chen, Yun-Nung Chen |
| 2024 | Efficient Vision-Language pre-training via domain-specific learning for human activities. Adrian Bulat, Yassine Ouali, Ricardo Guerrero, Brais Martínez, Georgios Tzimiropoulos |
| 2024 | EfficientRAG: Efficient Retriever for Multi-Hop Question Answering. Ziyuan Zhuang, Zhiyang Zhang, Sitao Cheng, Fangkai Yang, Jia Liu, Shujian Huang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang |
| 2024 | Eliciting In-Context Learning in Vision-Language Models for Videos Through Curated Data Distributional Properties. Keunwoo Peter Yu, Zheyuan Zhang, Fengyuan Hu, Shane Storks, Joyce Chai |
| 2024 | Eliminating Biased Length Reliance of Direct Preference Optimization via Down-Sampled KL Divergence. Junru Lu, Jiazheng Li, Siyu An, Meng Zhao, Yulan He, Di Yin, Xing Sun |
| 2024 | Embedded Named Entity Recognition using Probing Classifiers. Nicholas Popovic, Michael Färber |
| 2024 | Embedding and Gradient Say Wrong: A White-Box Method for Hallucination Detection. Xiaomeng Hu, Yiming Zhang, Ru Peng, Haozhe Zhang, Chenwei Wu, Gang Chen, Junbo Zhao |
| 2024 | EmoKnob: Enhance Voice Cloning with Fine-Grained Emotion Control. Haozhe Chen, Run Chen, Julia Hirschberg |
| 2024 | Emotion Granularity from Text: An Aggregate-Level Indicator of Mental Health. Krishnapriya Vishnubhotla, Daniela Teodorescu, Mallory J. Feldman, Kristen A. Lindquist, Saif M. Mohammad |
| 2024 | EmphAssess : a Prosodic Benchmark on Assessing Emphasis Transfer in Speech-to-Speech Models. Maureen de Seyssel, Antony D'Avirro, Adina Williams, Emmanuel Dupoux |
| 2024 | Empowering Backbone Models for Visual Text Generation with Input Granularity Control and Glyph-Aware Training. Wenbo Li, Guohao Li, Zhibin Lan, Xue Xu, Wanru Zhuang, Jiachen Liu, Xinyan Xiao, Jinsong Su |
| 2024 | Empowering Large Language Model for Continual Video Question Answering with Collaborative Prompting. Chen Cai, Zheng Wang, Jianjun Gao, Wenyang Liu, Ye Lu, Runzhong Zhang, Kim-Hui Yap |
| 2024 | Empowering Multi-step Reasoning across Languages via Program-Aided Language Models. Leonardo Ranaldi, Giulia Pucci, Barry Haddow, Alexandra Birch |
| 2024 | Encoding Spreadsheets for Large Language Models. Haoyu Dong, Jianbo Zhao, Yuzhang Tian, Junyu Xiong, Mengyu Zhou, Yun Lin, José Cambronero, Yeye He, Shi Han, Dongmei Zhang |
| 2024 | Encoding and Controlling Global Semantics for Long-form Video Question Answering. Thong Nguyen, Zhiyuan Hu, Xiaobao Wu, Cong-Duy Nguyen, See-Kiong Ng, Anh Tuan Luu |
| 2024 | Encourage or Inhibit Monosemanticity? Revisit Monosemanticity from a Feature Decorrelation Perspective. Hanqi Yan, Yanzheng Xiang, Guangyi Chen, Yifei Wang, Lin Gui, Yulan He |
| 2024 | Encouraging Divergent Thinking in Large Language Models through Multi-Agent Debate. Tian Liang, Zhiwei He, Wenxiang Jiao, Xing Wang, Yan Wang, Rui Wang, Yujiu Yang, Shuming Shi, Zhaopeng Tu |
| 2024 | Enhanced Hallucination Detection in Neural Machine Translation through Simple Detector Aggregation. Anas Himmi, Guillaume Staerman, Marine Picot, Pierre Colombo, Nuno Miguel Guerreiro |
| 2024 | Enhancing AI Assisted Writing with One-Shot Implicit Negative Feedback. Benjamin Towle, Ke Zhou |
| 2024 | Enhancing Advanced Visual Reasoning Ability of Large Language Models. Zhiyuan Li, Dongnan Liu, Chaoyi Zhang, Heng Wang, Tengfei Xue, Weidong Cai |
| 2024 | Enhancing Data Quality through Simple De-duplication: Navigating Responsible Computational Social Science Research. Yida Mu, Mali Jin, Xingyi Song, Nikolaos Aletras |
| 2024 | Enhancing High-order Interaction Awareness in LLM-based Recommender Model. Xinfeng Wang, Jin Cui, Fumiyo Fukumoto, Yoshimi Suzuki |
| 2024 | Enhancing Language Model Alignment: A Confidence-Based Approach to Label Smoothing. Baihe Huang, Hiteshi Sharma, Yi Mao |
| 2024 | Enhancing Language Model Factuality via Activation-Based Confidence Calibration and Guided Decoding. Xin Liu, Farima Fatahi Bayat, Lu Wang |
| 2024 | Enhancing Legal Case Retrieval via Scaling High-quality Synthetic Query-Candidate Pairs. Cheng Gao, Chaojun Xiao, Zhenghao Liu, Huimin Chen, Zhiyuan Liu, Maosong Sun |
| 2024 | Enhancing Post-Hoc Attributions in Long Document Comprehension via Coarse Grained Answer Decomposition. Pritika Ramu, Koustava Goswami, Apoorv Saxena, Balaji Vasan Srinivasan |
| 2024 | Enhancing Pre-Trained Generative Language Models with Question Attended Span Extraction on Machine Reading Comprehension. Lin Ai, Zheng Hui, Zizhou Liu, Julia Hirschberg |
| 2024 | Enhancing Reinforcement Learning with Dense Rewards from Language Model Critic. Meng Cao, Lei Shu, Lei Yu, Yun Zhu, Nevan Wichers, Yinxiao Liu, Lei Meng |
| 2024 | Enhancing Systematic Decompositional Natural Language Inference Using Informal Logic. Nathaniel Weir, Kate Sanders, Orion Weller, Shreya Sharma, Dongwei Jiang, Zhengping Jiang, Bhavana Dalvi Mishra, Oyvind Tafjord, Peter A. Jansen, Peter Clark, Benjamin Van Durme |
| 2024 | Enhancing Training Data Attribution for Large Language Models with Fitting Error Consideration. Kangxi Wu, Liang Pang, Huawei Shen, Xueqi Cheng |
| 2024 | Entity Insertion in Multilingual Linked Corpora: The Case of Wikipedia. Tomás Feith, Akhil Arora, Martin Gerlach, Debjit Paul, Robert West |
| 2024 | Error Analysis of Multilingual Language Models in Machine Translation: A Case Study of English-Amharic Translation. Hizkiel Alemayehu, Hamada M. Zahera, Axel-Cyrille Ngonga Ngomo |
| 2024 | Estimating Knowledge in Large Language Models Without Generating a Single Token. Daniela Gottesman, Mor Geva |
| 2024 | Evaluating Character Understanding of Large Language Models via Character Profiling from Fictional Works. Xinfeng Yuan, Siyu Yuan, Yuhan Cui, Tianhe Lin, Xintao Wang, Rui Xu, Jiangjie Chen, Deqing Yang |
| 2024 | Evaluating Concurrent Robustness of Language Models Across Diverse Challenge Sets. Vatsal Gupta, Pranshu Pandya, Tushar Kataria, Vivek Gupta, Dan Roth |
| 2024 | Evaluating D-MERIT of Partial-annotation on Information Retrieval. Royi Rassin, Yaron Fairstein, Oren Kalinsky, Guy Kushilevitz, Nachshon Cohen, Alexander Libov, Yoav Goldberg |
| 2024 | Evaluating Diversity in Automatic Poetry Generation. Yanran Chen, Hannes Gröner, Sina Zarrieß, Steffen Eger |
| 2024 | Evaluating LLMs for Targeted Concept Simplification for Domain-Specific Texts. Sumit Asthana, Hannah Rashkin, Elizabeth Clark, Fantine Huot, Mirella Lapata |
| 2024 | Evaluating Large Language Models along Dimensions of Language Variation: A Systematik Invesdigatiom uv Cross-lingual Generalization. Niyati Bafna, Kenton Murray, David Yarowsky |
| 2024 | Evaluating Large Language Models on Time Series Feature Understanding: A Comprehensive Taxonomy and Benchmark. Elizabeth Fons, Rachneet Kaur, Soham Palande, Zhen Zeng, Tucker Balch, Manuela Veloso, Svitlana Vyetrenko |
| 2024 | Evaluating Large Language Models via Linguistic Profiling. Alessio Miaschi, Felice Dell'Orletta, Giulia Venturi |
| 2024 | Evaluating Psychological Safety of Large Language Models. Xingxuan Li, Yutong Li, Lin Qiu, Shafiq Joty, Lidong Bing |
| 2024 | Evaluating Readability and Faithfulness of Concept-based Explanations. Meng Li, Haoran Jin, Ruixuan Huang, Zhihao Xu, Defu Lian, Zijia Lin, Di Zhang, Xiting Wang |
| 2024 | Evaluating Short-Term Temporal Fluctuations of Social Biases in Social Media Data and Masked Language Models. Yi Zhou, Danushka Bollegala, José Camacho-Collados |
| 2024 | Evaluating n-Gram Novelty of Language Models Using Rusty-DAWG. William Merrill, Noah A. Smith, Yanai Elazar |
| 2024 | Evaluating the Effectiveness of Large Language Models in Establishing Conversational Grounding. Biswesh Mohapatra, Manav Nitin Kapadnis, Laurent Romary, Justine Cassell |
| 2024 | Evaluating the Instruction-Following Robustness of Large Language Models to Prompt Injection. Zekun Li, Baolin Peng, Pengcheng He, Xifeng Yan |
| 2024 | Event Causality Identification with Synthetic Control. Haoyu Wang, Fengze Liu, Jiayao Zhang, Dan Roth, Kyle Richardson |
| 2024 | Evidence-Focused Fact Summarization for Knowledge-Augmented Zero-Shot Question Answering. Sungho Ko, Hyunjin Cho, Hyungjoo Chae, Jinyoung Yeo, Dongha Lee |
| 2024 | Experimental Contexts Can Facilitate Robust Semantic Property Inference in Language Models, but Inconsistently. Kanishka Misra, Allyson Ettinger, Kyle Mahowald |
| 2024 | Explaining and Improving Contrastive Decoding by Extrapolating the Probabilities of a Huge and Hypothetical LM. Haw-Shiuan Chang, Nanyun Peng, Mohit Bansal, Anil Ramakrishna, Tagyoung Chung |
| 2024 | Explicit Memory Learning with Expectation Maximization. Zhangyue Yin, Qiushi Sun, Qipeng Guo, Zhiyuan Zeng, Qinyuan Cheng, Xipeng Qiu, Xuanjing Huang |
| 2024 | Explicit, Implicit, and Scattered: Revisiting Event Extraction to Capture Complex Arguments. Omar Sharif, Joseph Gatto, Madhusudan Basak, Sarah Masud Preum |
| 2024 | Exploring Intra and Inter-language Consistency in Embeddings with ICA. Rongzhi Li, Takeru Matsuda, Hitomi Yanaka |
| 2024 | Exploring Intrinsic Language-specific Subspaces in Fine-tuning Multilingual Neural Machine Translation. Zhe Cao, Zhi Qu, Hidetaka Kamigaito, Taro Watanabe |
| 2024 | Exploring Nested Named Entity Recognition with Large Language Models: Methods, Challenges, and Insights. Hongjin Kim, Jai-Eun Kim, Harksoo Kim |
| 2024 | Exploring Space Efficiency in a Tree-based Linear Model for Extreme Multi-label Classification. He-Zhe Lin, Cheng-Hung Liu, Chih-Jen Lin |
| 2024 | Exploring Union and Intersection of Visual Regions for Generating Questions, Answers, and Distractors. Wenjian Ding, Yao Zhang, Jun Wang, Adam Jatowt, Zhenglu Yang |
| 2024 | Exploring the Compositional Deficiency of Large Language Models in Mathematical Reasoning Through Trap Problems. Jun Zhao, Jingqi Tong, Yurong Mou, Ming Zhang, Qi Zhang, Xuanjing Huang |
| 2024 | Exploring the Learning Capabilities of Language Models using LEVERWORLDS. Eitan Wagner, Amir Feder, Omri Abend |
| 2024 | Exploring the Practicality of Generative Retrieval on Dynamic Corpora. Chaeeun Kim, Soyoung Yoon, Hyunji Lee, Joel Jang, Sohee Yang, Minjoon Seo |
| 2024 | Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models. Zi'ou Zheng, Christopher Malon, Martin Renqiang Min, Xiaodan Zhu |
| 2024 | Extending Context Window of Large Language Models from a Distributional Perspective. Yingsheng Wu, Yuxuan Gu, Xiaocheng Feng, Weihong Zhong, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin |
| 2024 | External Knowledge-Driven Argument Mining: Leveraging Attention-Enhanced Multi-Network Models. Debela Gemechu, Chris Reed |
| 2024 | Extract, Define, Canonicalize: An LLM-based Framework for Knowledge Graph Construction. Bowen Zhang, Harold Soh |
| 2024 | Extracting Prompts by Inverting LLM Outputs. Collin Zhang, John X. Morris, Vitaly Shmatikov |
| 2024 | Eyes Don't Lie: Subjective Hate Annotation and Detection with Gaze. Özge Alaçam, Sanne Hoeken, Sina Zarrieß |
| 2024 | FAC²E: Better Understanding Large Language Model Capabilities by Dissociating Language and Cognition. Xiaoqiang Wang, Lingfei Wu, Tengfei Ma, Bang Liu |
| 2024 | FAME: Towards Factual Multi-Task Model Editing. Zeng Li, Yingyu Shan, Zeming Liu, Jiashu Yao, Yuhang Guo |
| 2024 | FEDKIM: Adaptive Federated Knowledge Injection into Medical Foundation Models. Xiaochen Wang, Jiaqi Wang, Houping Xiao, Jinghui Chen, Fenglong Ma |
| 2024 | FFN-SkipLLM: A Hidden Gem for Autoregressive Decoding with Adaptive Feed Forward Skipping. Ajay Jaiswal, Bodun Hu, Lu Yin, Yeonju Ro, Tianlong Chen, Shiwei Liu, Aditya Akella |
| 2024 | FIRST: Faster Improved Listwise Reranking with Single Token Decoding. Revanth Gangi Reddy, JaeHyeok Doo, Yifei Xu, Md. Arafat Sultan, Deevya Swain, Avirup Sil, Heng Ji |
| 2024 | FIRST: Teach A Reliable Large Language Model Through Efficient Trustworthy Distillation. Kashun Shum, Minrui Xu, Jianshu Zhang, Zixin Chen, Shizhe Diao, Hanze Dong, Jipeng Zhang, Muhammad Omer Raza |
| 2024 | FIZZ: Factual Inconsistency Detection by Zoom-in Summary and Zoom-out Document. Joonho Yang, Seunghyun Yoon, Byeongjeong Kim, Hwanhee Lee |
| 2024 | FLIRT: Feedback Loop In-context Red Teaming. Ninareh Mehrabi, Palash Goyal, Christophe Dupuy, Qian Hu, Shalini Ghosh, Richard S. Zemel, Kai-Wei Chang, Aram Galstyan, Rahul Gupta |
| 2024 | FOLIO: Natural Language Reasoning with First-Order Logic. Simeng Han, Hailey Schoelkopf, Yilun Zhao, Zhenting Qi, Martin Riddell, Wenfei Zhou, James Coady, David Peng, Yujie Qiao, Luke Benson, Lucy Sun, Alexander Wardle-Solano, Hannah Szabó, Ekaterina Zubova, Matthew Burtell, Jonathan Fan, Yixin Liu, Brian Wong, Malcolm Sailor, Ansong Ni, Linyong Nan, Jungo Kasai, Tao Yu, Rui Zhang, Alexander R. Fabbri, Wojciech Kryscinski, Semih Yavuz, Ye Liu, Xi Victoria Lin, Shafiq Joty, Yingbo Zhou, Caiming Xiong, Rex Ying, Arman Cohan, Dragomir Radev |
| 2024 | FOOL ME IF YOU CAN! An Adversarial Dataset to Investigate the Robustness of LMs in Word Sense Disambiguation. Mohamad Ballout, Anne Dedert, Nohayr Abdelmoneim, Ulf Krumnack, Gunther Heidemann, Kai-Uwe Kühnberger |
| 2024 | FRoG: Evaluating Fuzzy Reasoning of Generalized Quantifiers in LLMs. Yiyuan Li, Shichao Sun, Pengfei Liu |
| 2024 | Factuality of Large Language Models: A Survey. Yuxia Wang, Minghan Wang, Muhammad Arslan Manzoor, Fei Liu, Georgi Nenkov Georgiev, Rocktim Jyoti Das, Preslav Nakov |
| 2024 | FairFlow: Mitigating Dataset Biases through Undecided Learning for Natural Language Understanding. Jiali Cheng, Hadi Amiri |
| 2024 | Fairer Preferences Elicit Improved Human-Aligned Large Language Model Judgments. Han Zhou, Xingchen Wan, Yinhong Liu, Nigel Collier, Ivan Vulic, Anna Korhonen |
| 2024 | Fast Forwarding Low-Rank Training. Adir Rahamim, Naomi Saphra, Sara Kangaslahti, Yonatan Belinkov |
| 2024 | Fewer is More: Boosting Math Reasoning with Reinforced Context Pruning. Xijie Huang, Li Lyna Zhang, Kwang-Ting Cheng, Fan Yang, Mao Yang |
| 2024 | Fill In The Gaps: Model Calibration and Generalization with Synthetic Data. Yang Ba, Michelle Mancenido, Rong Pan |
| 2024 | Filtered Direct Preference Optimization. Tetsuro Morimura, Mitsuki Sakamoto, Yuu Jinnai, Kenshi Abe, Kaito Ariu |
| 2024 | FinDVer: Explainable Claim Verification over Long and Hybrid-content Financial Documents. Yilun Zhao, Yitao Long, Tintin Jiang, Chengye Wang, Weiyuan Chen, Hongjun Liu, Xiangru Tang, Yiming Zhang, Chen Zhao, Arman Cohan |
| 2024 | Finding Blind Spots in Evaluator LLMs with Interpretable Checklists. Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Sshubam Verma, Mitesh M. Khapra |
| 2024 | Fine-Grained Detection of Solidarity for Women and Migrants in 155 Years of German Parliamentary Debates. Aida Kostikova, Dominik Beese, Benjamin Paassen, Ole Pütz, Gregor Wiedemann, Steffen Eger |
| 2024 | Fine-Grained Prediction of Reading Comprehension from Eye Movements. Omer Shubi, Yoav Meiri, Cfir Avraham Hadar, Yevgeni Berzak |
| 2024 | Fine-Tuning Large Language Models to Translate: Will a Touch of Noisy Data in Misaligned Languages Suffice? Dawei Zhu, Pinzhen Chen, Miaoran Zhang, Barry Haddow, Xiaoyu Shen, Dietrich Klakow |
| 2024 | Fine-Tuning and Prompt Optimization: Two Great Steps that Work Better Together. Dilara Soylu, Christopher Potts, Omar Khattab |
| 2024 | Fine-Tuning or Retrieval? Comparing Knowledge Injection in LLMs. Oded Ovadia, Menachem Brief, Moshik Mishaeli, Oren Elisha |
| 2024 | Fine-grained Pluggable Gradient Ascent for Knowledge Unlearning in Language Models. Xiaohua Feng, Chaochao Chen, Yuyuan Li, Zibin Lin |
| 2024 | FineCops-Ref: A new Dataset and Task for Fine-Grained Compositional Referring Expression Comprehension. Junzhuo Liu, Xuzheng Yang, Weiwei Li, Peng Wang |
| 2024 | Finer: Investigating and Enhancing Fine-Grained Visual Concept Recognition in Large Vision Language Models. Jeonghwan Kim, Heng Ji |
| 2024 | First Heuristic Then Rational: Dynamic Use of Heuristics in Language Model Reasoning. Yoichi Aoki, Keito Kudo, Tatsuki Kuribayashi, Shusaku Sone, Masaya Taniguchi, Keisuke Sakaguchi, Kentaro Inui |
| 2024 | Fisher Information-based Efficient Curriculum Federated Learning with Large Language Models. Ji Liu, Jiaxiang Ren, Ruoming Jin, Zijie Zhang, Yang Zhou, Patrick Valduriez, Dejing Dou |
| 2024 | Fishing for Magikarp: Automatically Detecting Under-trained Tokens in Large Language Models. Sander Land, Max Bartolo |
| 2024 | Flee the Flaw: Annotating the Underlying Logic of Fallacious Arguments Through Templates and Slot-filling. Irfan Robbani, Paul Reisert, Surawat Pothong, Naoya Inoue, Camélia Guerraoui, Wenzhi Wang, Shoichi Naito, Jungmin Choi, Kentaro Inui |
| 2024 | FlipGuard: Defending Preference Alignment against Update Regression with Constrained Optimization. Mingye Zhu, Yi Liu, Quan Wang, Junbo Guo, Zhendong Mao |
| 2024 | Focused Large Language Models are Stable Many-Shot Learners. Peiwen Yuan, Shaoxiong Feng, Yiwei Li, Xinglin Wang, Yueqi Zhang, Chuyi Tan, Boyuan Pan, Heda Wang, Yao Hu, Kan Li |
| 2024 | FoodieQA: A Multimodal Dataset for Fine-Grained Understanding of Chinese Food Culture. Wenyan Li, Xinyu Zhang, Jiaang Li, Qiwei Peng, Raphael Tang, Li Zhou, Weijia Zhang, Guimin Hu, Yifei Yuan, Anders Søgaard, Daniel Hershcovich, Desmond Elliott |
| 2024 | Fool Me Once? Contrasting Textual and Visual Explanations in a Clinical Decision-Support Setting. Maxime Kayser, Bayar Menzat, Cornelius Emde, Bogdan Bercean, Alex Novak, Abdalá Morgado, Bartlomiej W. Papiez, Susanne Gaube, Thomas Lukasiewicz, Oana-Maria Camburu |
| 2024 | Forgetting Curve: A Reliable Method for Evaluating Memorization Capability for Long-Context Models. Xinyu Liu, Runsong Zhao, Pengcheng Huang, Chunyang Xiao, Bei Li, Jingang Wang, Tong Xiao, Jingbo Zhu |
| 2024 | Formality is Favored: Unraveling the Learning Preferences of Large Language Models on Data with Conflicting Knowledge. Jiahuan Li, Yiqing Cao, Shujian Huang, Jiajun Chen |
| 2024 | Foundational Autoraters: Taming Large Language Models for Better Automatic Evaluation. Tu Vu, Kalpesh Krishna, Salaheddin Alzubi, Chris Tar, Manaal Faruqui, Yun-Hsuan Sung |
| 2024 | Free your mouse! Command Large Language Models to Generate Code to Format Word Documents. Shihao Rao, Liang Li, Jiapeng Liu, Weixin Guan, Xiyan Gao, Bing Lim, Can Ma |
| 2024 | From Bottom to Top: Extending the Potential of Parameter Efficient Fine-Tuning. Jihao Gu, Zelin Wang, Yibo Zhang, Ziji Zhang, Ping Gong |
| 2024 | From Descriptive Richness to Bias: Unveiling the Dark Side of Generative Image Caption Enrichment. Yusuke Hirota, Ryo Hachiuma, Chao-Han Huck Yang, Yuta Nakashima |
| 2024 | From Insights to Actions: The Impact of Interpretability and Analysis Research on NLP. Marius Mosbach, Vagrant Gautam, Tomás Vergara Browne, Dietrich Klakow, Mor Geva |
| 2024 | From LLMs to MLLMs: Exploring the Landscape of Multimodal Jailbreaking. Siyuan Wang, Zhuohan Long, Zhihao Fan, Zhongyu Wei |
| 2024 | From Local Concepts to Universals: Evaluating the Multicultural Understanding of Vision-Language Models. Mehar Bhatia, Sahithya Ravi, Aditya Chinchure, Eunjeong Hwang, Vered Shwartz |
| 2024 | From RAG to Riches: Retrieval Interlaced with Sequence Generation. Palak Jain, Livio Baldini Soares, Tom Kwiatkowski |
| 2024 | From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis. Chuanqi Cheng, Jian Guan, Wei Wu, Rui Yan |
| 2024 | Frontmatter. |
| 2024 | Fuse to Forget: Bias Reduction and Selective Memorization through Model Fusion. Kerem Zaman, Leshem Choshen, Shashank Srivastava |
| 2024 | FuseGen: PLM Fusion for Data-generation based Zero-shot Learning. Tianyuan Zou, Yang Liu, Peng Li, Jianqing Zhang, Jingjing Liu, Ya-Qin Zhang |
| 2024 | F²RL: Factuality and Faithfulness Reinforcement Learning Framework for Claim-Guided Evidence-Supported Counterspeech Generation. Haiyang Wang, Yuchen Pan, Xin Song, Xuechen Zhao, Minghao Hu, Bin Zhou |
| 2024 | GAMA: A Large Audio-Language Model with Advanced Audio Understanding and Complex Reasoning Abilities. Sreyan Ghosh, Sonal Kumar, Ashish Seth, Chandra Kiran Reddy Evuru, Utkarsh Tyagi, S. Sakshi, Oriol Nieto, Ramani Duraiswami, Dinesh Manocha |
| 2024 | GDPO: Learning to Directly Align Language Models with Diversity Using GFlowNets. Oh Joon Kwon, Daiki E. Matsunaga, Kee-Eung Kim |
| 2024 | GDTB: Genre Diverse Data for English Shallow Discourse Parsing across Modalities, Text Types, and Domains. Yang Janet Liu, Tatsuya Aoyama, Wesley Scivetti, Yilun Zhu, Shabnam Behzad, Lauren Levine, Jessica Lin, Devika Tiwari, Amir Zeldes |
| 2024 | GENRA: Enhancing Zero-shot Retrieval with Rank Aggregation. Georgios Katsimpras, Georgios Paliouras |
| 2024 | GLaPE: Gold Label-agnostic Prompt Evaluation for Large Language Models. Xuanchang Zhang, Zhuosheng Zhang, Hai Zhao |
| 2024 | GOME: Grounding-based Metaphor Binding With Conceptual Elaboration For Figurative Language Illustration. Linhao Zhang, Jintao Liu, Li Jin, Hao Wang, Kaiwen Wei, Guangluan Xu |
| 2024 | GPT vs RETRO: Exploring the Intersection of Retrieval and Parameter-Efficient Fine-Tuning. Aleksander Ficek, Jiaqi Zeng, Oleksii Kuchaiev |
| 2024 | GPT-4 Jailbreaks Itself with Near-Perfect Success Using Self-Explanation. Govind Ramesh, Yao Dou, Wei Xu |
| 2024 | GRASS: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients. Aashiq Muhamed, Oscar Li, David P. Woodruff, Mona T. Diab, Virginia Smith |
| 2024 | GRIZAL: Generative Prior-guided Zero-Shot Temporal Action Localization. Onkar Susladkar, Gayatri Deshmukh, Vandan Gorade, Sparsh Mittal |
| 2024 | Game on Tree: Visual Hallucination Mitigation via Coarse-to-Fine View Tree and Game Theory. Xianwei Zhuang, Zhihong Zhu, Zhanpeng Chen, Yuxin Xie, Liming Liang, Yuexian Zou |
| 2024 | Generalizing Clinical De-identification Models by Privacy-safe Data Augmentation using GPT-4. Woojin Kim, Sungeun Hahm, Jaejin Lee |
| 2024 | Generate-on-Graph: Treat LLM as both Agent and KG for Incomplete Knowledge Graph Question Answering. Yao Xu, Shizhu He, Jiabei Chen, Zihao Wang, Yangqiu Song, Hanghang Tong, Guang Liu, Jun Zhao, Kang Liu |
| 2024 | Generating Demonstrations for In-Context Compositional Generalization in Grounded Language Learning. Sam Spilsbury, Pekka Marttinen, Alexander Ilin |
| 2024 | Generation with Dynamic Vocabulary. Yanting Liu, Tao Ji, Changzhi Sun, Yuanbin Wu, Xiaoling Wang |
| 2024 | Generative Models for Automatic Medical Decision Rule Extraction from Text. Yuxin He, Buzhou Tang, Xiaoling Wang |
| 2024 | Generative Subgraph Retrieval for Knowledge Graph-Grounded Dialog Generation. Jinyoung Park, Minseok Joo, Joo-Kyung Kim, Hyunwoo J. Kim |
| 2024 | GeoGPT4V: Towards Geometric Multi-modal Large Language Models with Geometric Image Generation. Shihao Cai, Keqin Bao, Hangyu Guo, Jizhi Zhang, Jun Song, Bo Zheng |
| 2024 | Getting More from Less: Large Language Models are Good Spontaneous Multilingual Learners. Shimao Zhang, Changjiang Gao, Wenhao Zhu, Jiajun Chen, Xin Huang, Xue Han, Junlan Feng, Chao Deng, Shujian Huang |
| 2024 | Getting The Most Out of Your Training Data: Exploring Unsupervised Tasks for Morphological Inflection. Abhishek Purushothama, Adam Wiemerslage, Katharina von der Wense |
| 2024 | Global Reward to Local Rewards: Multimodal-Guided Decomposition for Improving Dialogue Agents. Dong Won Lee, Hae Park, Yoon Kim, Cynthia Breazeal, Louis-Philippe Morency |
| 2024 | GlobeSumm: A Challenging Benchmark Towards Unifying Multi-lingual, Cross-lingual and Multi-document News Summarization. Yangfan Ye, Xiachong Feng, Xiaocheng Feng, Weitao Ma, Libo Qin, Dongliang Xu, Qing Yang, Hongtao Liu, Bing Qin |
| 2024 | GlossLM: A Massively Multilingual Corpus and Pretrained Model for Interlinear Glossed Text. Michael Ginn, Lindia Tjuatja, Taiqi He, Enora Rice, Graham Neubig, Alexis Palmer, Lori S. Levin |
| 2024 | Glue pizza and eat rocks - Exploiting Vulnerabilities in Retrieval-Augmented Generative Models. Zhen Tan, Chengshuai Zhao, Raha Moraffah, Yifan Li, Song Wang, Jundong Li, Tianlong Chen, Huan Liu |
| 2024 | Gold Panning in Vocabulary: An Adaptive Method for Vocabulary Expansion of Domain-Specific LLMs. Chengyuan Liu, Shihang Wang, Lizhi Qing, Kun Kuang, Yangyang Kang, Changlong Sun, Fei Wu |
| 2024 | GoldCoin: Grounding Large Language Models in Privacy Laws via Contextual Integrity Theory. Wei Fan, Haoran Li, Zheye Deng, Weiqi Wang, Yangqiu Song |
| 2024 | GottBERT: a pure German Language Model. Raphael Scheible, Johann Frei, Fabian Thomczyk, Henry He, Patric Tippmann, Jochen Knaus, Victor Jaravine, Frank Kramer, Martin Boeker |
| 2024 | Granular Privacy Control for Geolocation with Vision Language Models. Ethan Mendes, Yang Chen, James Hays, Sauvik Das, Wei Xu, Alan Ritter |
| 2024 | Grasping the Essentials: Tailoring Large Language Models for Zero-Shot Relation Extraction. Sizhe Zhou, Yu Meng, Bowen Jin, Jiawei Han |
| 2024 | Grounding Language in Multi-Perspective Referential Communication. Zineng Tang, Lingjun Mao, Alane Suhr |
| 2024 | GuardBench: A Large-Scale Benchmark for Guardrail Models. Elias Bassani, Ignacio Sanchez |
| 2024 | HEART-felt Narratives: Tracing Empathy and Narrative Style in Personal Stories with LLMs. Jocelyn Shen, Joel Mire, Hae Park, Cynthia Breazeal, Maarten Sap |
| 2024 | HELPD: Mitigating Hallucination of LVLMs by Hierarchical Feedback Learning with Vision-enhanced Penalty Decoding. Fan Yuan, Chi Qin, Xiaogang Xu, Piji Li |
| 2024 | HalluMeasure: Fine-grained Hallucination Measurement Using Chain-of-Thought Reasoning. Shayan Ali Akbar, Md Mosharaf Hossain, Tess Wood, Si-Chi Chin, Erica Salinas, Victor Alvarez, Erwin Cornejo |
| 2024 | Hate Personified: Investigating the role of LLMs in content moderation. Sarah Masud, Sahajpreet Singh, Viktor Hangya, Alexander Fraser, Tanmoy Chakraborty |
| 2024 | Hateful Word in Context Classification. Sanne Hoeken, Sina Zarrieß, Özge Alaçam |
| 2024 | Heterogeneous LoRA for Federated Fine-tuning of On-Device Foundation Models. Yae Jee Cho, Luyang Liu, Zheng Xu, Aldi Fahrezi, Gauri Joshi |
| 2024 | HiFT: A Hierarchical Full Parameter Fine-Tuning Strategy. Yongkang Liu, Yiqun Zhang, Qian Li, Tong Liu, Shi Feng, Daling Wang, Yifei Zhang, Hinrich Schütze |
| 2024 | Hidden Persuaders: LLMs' Political Leaning and Their Influence on Voters. Yujin Potter, Shiyang Lai, Junsol Kim, James Evans, Dawn Song |
| 2024 | Hierarchical Deconstruction of LLM Reasoning: A Graph-Based Framework for Analyzing Knowledge Utilization. Miyoung Ko, Sue Hyun Park, Joonsuk Park, Minjoon Seo |
| 2024 | Holistic Automated Red Teaming for Large Language Models through Top-Down Test Case Generation and Multi-turn Interaction. Jinchuan Zhang, Yan Zhou, Yaxin Liu, Ziming Li, Songlin Hu |
| 2024 | Holistic Evaluation for Interleaved Text-and-Image Generation. Minqian Liu, Zhiyang Xu, Zihao Lin, Trevor Ashby, Joy Rimchala, Jiaxin Zhang, Lifu Huang |
| 2024 | Hopping Too Late: Exploring the Limitations of Large Language Models on Multi-Hop Queries. Eden Biran, Daniela Gottesman, Sohee Yang, Mor Geva, Amir Globerson |
| 2024 | Householder Pseudo-Rotation: A Novel Approach to Activation Editing in LLMs with Direction-Magnitude Perspective. Van-Cuong Pham, Thien Nguyen |
| 2024 | How Do Humans Write Code? Large Models Do It the Same Way Too. Long Li, Xuzheng He, Haozhe Wang, Linlin Wang, Liang He |
| 2024 | How Do Your Code LLMs perform? Empowering Code Instruction Tuning with Really Good Data. Yejie Wang, Keqing He, Dayuan Fu, Zhuoma Gongque, Heyang Xu, Yanxu Chen, Zhexu Wang, Yujia Fu, Guanting Dong, Muxi Diao, Jingang Wang, Mengdi Zhang, Xunliang Cai, Weiran Xu |
| 2024 | How Does the Disclosure of AI Assistance Affect the Perceptions of Writing? Zhuoyan Li, Chen Liang, Jing Peng, Ming Yin |
| 2024 | How Does the Textual Information Affect the Retrieval of Multimodal In-Context Learning? Yang Luo, Zangwei Zheng, Zirui Zhu, Yang You |
| 2024 | How Far Can We Extract Diverse Perspectives from Large Language Models? Shirley Anugrah Hayati, Minhwa Lee, Dheeraj Rajagopal, Dongyeop Kang |
| 2024 | How Hard is this Test Set? NLI Characterization by Exploiting Training Dynamics. Adrian Cosma, Stefan Ruseti, Mihai Dascalu, Cornelia Caragea |
| 2024 | How Susceptible are Large Language Models to Ideological Manipulation? Kai Chen, Zihao He, Jun Yan, Taiwei Shi, Kristina Lerman |
| 2024 | How do Large Language Models Learn In-Context? Query and Key Matrices of In-Context Heads are Two Towers for Metric Learning. Zeping Yu, Sophia Ananiadou |
| 2024 | How to Compute the Probability of a Word. Tiago Pimentel, Clara Meister |
| 2024 | How to Leverage Demonstration Data in Alignment for Large Language Model? A Self-Imitation Learning Perspective. Teng Xiao, Mingxiao Li, Yige Yuan, Huaisheng Zhu, Chao Cui, Vasant G. Honavar |
| 2024 | Human-LLM Hybrid Text Answer Aggregation for Crowd Annotations. Jiyi Li |
| 2024 | Humans or LLMs as the Judge? A Study on Judgement Bias. Guiming Chen, Shunian Chen, Ziche Liu, Feng Jiang, Benyou Wang |
| 2024 | I Could've Asked That: Reformulating Unanswerable Questions. Wenting Zhao, Ge Gao, Claire Cardie, Alexander M. Rush |
| 2024 | I Learn Better If You Speak My Language: Understanding the Superior Performance of Fine-Tuning Large Language Models with LLM-Generated Responses. Xuan Ren, Biao Wu, Lingqiao Liu |
| 2024 | I Need Help! Evaluating LLM's Ability to Ask for Users' Support: A Case Study on Text-to-SQL Generation. Cheng-Kuang Wu, Zhi Rui Tam, Chao-Chung Wu, Chieh-Yen Lin, Hung-yi Lee, Yun-Nung Chen |
| 2024 | I love pineapple on pizza != I hate pineapple on pizza: Stance-Aware Sentence Transformers for Opinion Mining. Vahid Ghafouri, Jose Such, Guillermo Suarez-Tangil |
| 2024 | I-AM-G: Interest Augmented Multimodal Generator for Item Personalization. Xianquan Wang, Likang Wu, Shukang Yin, Zhi Li, Yanjiang Chen, Hufeng Hufeng, Yu Su, Qi Liu |
| 2024 | IDEAW: Robust Neural Audio Watermarking with Invertible Dual-Embedding. Pengcheng Li, Xulong Zhang, Jing Xiao, Jianzong Wang |
| 2024 | IFCap: Image-like Retrieval and Frequency-based Entity Filtering for Zero-shot Captioning. Soeun Lee, Si-Woo Kim, Taewhan Kim, Dong-Jin Kim |
| 2024 | IM-BERT: Enhancing Robustness of BERT through the Implicit Euler Method. MiHyeon Kim, Juhyoung Park, YoungBin Kim |
| 2024 | If CLIP Could Talk: Understanding Vision-Language Model Representations Through Their Preferred Concept Descriptions. Reza Esfandiarpoor, Cristina Menghini, Stephen H. Bach |
| 2024 | ImageInWords: Unlocking Hyper-Detailed Image Descriptions. Roopal Garg, Andrea Burns, Burcu Karagol Ayan, Yonatan Bitton, Ceslee Montgomery, Yasumasa Onoe, Andrew Bunner, Ranjay Krishna, Jason Baldridge, Radu Soricut |
| 2024 | Images Speak Louder than Words: Understanding and Mitigating Bias in Vision-Language Model from a Causal Mediation Perspective. Zhaotian Weng, Zijun Gao, Jerone Theodore Alexander Andrews, Jieyu Zhao |
| 2024 | Impeding LLM-assisted Cheating in Introductory Programming Assignments via Adversarial Perturbation. Saiful Islam Salim, Rubin Yuchan Yang, Alexander Cooper, Suryashree Ray, Saumya Debray, Sazzadur Rahaman |
| 2024 | Improve Dense Passage Retrieval with Entailment Tuning. Lu Dai, Hao Liu, Hui Xiong |
| 2024 | Improve Student's Reasoning Generalizability through Cascading Decomposed CoTs Distillation. Chengwei Dai, Kun Li, Wei Zhou, Songlin Hu |
| 2024 | Improving Discriminative Capability of Reward Models in RLHF Using Contrastive Learning. Lu Chen, Rui Zheng, Binghai Wang, Senjie Jin, Caishuang Huang, Junjie Ye, Zhihao Zhang, Yuhao Zhou, Zhiheng Xi, Tao Gui, Qi Zhang, Xuanjing Huang |
| 2024 | Improving Knowledge Graph Completion with Structure-Aware Supervised Contrastive Learning. Jiashi Lin, Lifang Wang, Xinyu Lu, Zhongtian Hu, Wei Zhang, Wenxuan Lu |
| 2024 | Improving Minimum Bayes Risk Decoding with Multi-Prompt. David Heineman, Yao Dou, Wei Xu |
| 2024 | Improving Multi-party Dialogue Generation via Topic and Rhetorical Coherence. Yaxin Fan, Peifeng Li, Qiaoming Zhu |
| 2024 | Improving Retrieval-augmented Text-to-SQL with AST-based Ranking and Schema Pruning. Zhili Shen, Pavlos Vougiouklis, Chenxin Diao, Kaustubh Vyas, Yuanyi Ji, Jeff Z. Pan |
| 2024 | Improving Spoken Language Modeling with Phoneme Classification: A Simple Fine-tuning Approach. Maxime Poli, Emmanuel Chemla, Emmanuel Dupoux |
| 2024 | Improving Zero-shot LLM Re-Ranker with Risk Minimization. Xiaowei Yuan, Zhao Yang, Yequan Wang, Jun Zhao, Kang Liu |
| 2024 | In Search of the Long-Tail: Systematic Generation of Long-Tail Inferential Knowledge via Logical Rule Guided Search. Huihan Li, Yuting Ning, Zeyi Liao, Siyuan Wang, Xiang Li, Ximing Lu, Wenting Zhao, Faeze Brahman, Yejin Choi, Xiang Ren |
| 2024 | In-Context Compositional Generalization for Large Vision-Language Models. Chuanhao Li, Chenchen Jing, Zhen Li, Mingliang Zhai, Yuwei Wu, Yunde Jia |
| 2024 | In-context Contrastive Learning for Event Causality Identification. Chao Liang, Wei Xiang, Bang Wang |
| 2024 | Incomplete Utterance Rewriting with Editing Operation Guidance and Utterance Augmentation. Zhiyu Cao, Peifeng Li, Yaxin Fan, Qiaoming Zhu |
| 2024 | Incubating Text Classifiers Following User Instruction with Nothing but LLM. Letian Peng, Zilong Wang, Jingbo Shang |
| 2024 | Induct-Learn: Short Phrase Prompting with Instruction Induction. Po-Chun Chen, Sheng-Lun Wei, Hen-Hsen Huang, Hsin-Hsi Chen |
| 2024 | Inductive-Deductive Strategy Reuse for Multi-Turn Instructional Dialogues. Jiao Ou, Jiayu Wu, Che Liu, Fuzheng Zhang, Di Zhang, Kun Gai |
| 2024 | InferAligner: Inference-Time Alignment for Harmlessness through Cross-Model Guidance. Pengyu Wang, Dong Zhang, Linyang Li, Chenkun Tan, Xinghao Wang, Mozhi Zhang, Ke Ren, Botian Jiang, Xipeng Qiu |
| 2024 | Inference Helps PLMs' Conceptual Understanding: Improving the Abstract Inference Ability with Hierarchical Conceptual Entailment Graphs. Juncai Li, Ru Li, Xiaoli Li, Qinghua Chai, Jeff Z. Pan |
| 2024 | InfiniPot: Infinite Context Processing on Memory-Constrained LLMs. Minsoo Kim, Kyuhong Shim, Jungwook Choi, Simyung Chang |
| 2024 | Information Flow Routes: Automatically Interpreting Language Models at Scale. Javier Ferrando, Elena Voita |
| 2024 | Initialization of Large Language Models via Reparameterization to Mitigate Loss Spikes. Kosuke Nishida, Kyosuke Nishida, Kuniko Saito |
| 2024 | Instruction Fine-Tuning: Does Prompt Loss Matter? Mathew Huerta-Enochian, Seung Ko |
| 2024 | Instruction Matters: A Simple yet Effective Task Selection for Optimized Instruction Tuning of Specific Tasks. Changho Lee, Janghoon Han, Seonghyeon Ye, Stanley Jungkyu Choi, Honglak Lee, Kyunghoon Bae |
| 2024 | Instruction Pre-Training: Language Models are Supervised Multitask Learners. Daixuan Cheng, Yuxian Gu, Shaohan Huang, Junyu Bi, Minlie Huang, Furu Wei |
| 2024 | IntCoOp: Interpretability-Aware Vision-Language Prompt Tuning. Soumya Suvra Ghosal, Samyadeep Basu, Soheil Feizi, Dinesh Manocha |
| 2024 | Integrating Argumentation and Hate-Speech-based Techniques for Countering Misinformation. Sougata Saha, Rohini K. Srihari |
| 2024 | Integrating Plutchik's Theory with Mixture of Experts for Enhancing Emotion Classification. Dongjun Lim, Yun-Gyung Cheong |
| 2024 | Integrating Structural Semantic Knowledge for Enhanced Information Extraction Pre-training. Xiaoyang Yi, Yuru Bao, Jian Zhang, Yifang Qin, Faxin Lin |
| 2024 | InterIntent: Investigating Social Intelligence of LLMs via Intention Understanding in an Interactive Game Context. Ziyi Liu, Abhishek Anand, Pei Zhou, Jen-tse Huang, Jieyu Zhao |
| 2024 | Interpretability-based Tailored Knowledge Editing in Transformers. Yihuai Hong, Aldo Lipani |
| 2024 | Interpretable Composition Attribution Enhancement for Visio-linguistic Compositional Understanding. Wei Li, Zhen Huang, Xinmei Tian, Le Lu, Houqiang Li, Xu Shen, Jieping Ye |
| 2024 | Interpreting Arithmetic Mechanism in Large Language Models through Comparative Neuron Analysis. Zeping Yu, Sophia Ananiadou |
| 2024 | Interpreting Context Look-ups in Transformers: Investigating Attention-MLP Interactions. Clement Neo, Shay B. Cohen, Fazl Barez |
| 2024 | Interventional Speech Noise Injection for ASR Generalizable Spoken Language Understanding. Yeonjoon Jung, Jaeseong Lee, Seungtaek Choi, Dohyeon Lee, Minsoo Kim, Seung-won Hwang |
| 2024 | Into the Unknown Unknowns: Engaged Human Learning through Participation in Language Model Agent Conversations. Yucheng Jiang, Yijia Shao, Dekun Ma, Sina J. Semnani, Monica S. Lam |
| 2024 | Intrinsic Self-correction for Enhanced Morality: An Analysis of Internal Mechanisms and the Superficial Hypothesis. Guangliang Liu, Haitao Mao, Jiliang Tang, Kristen Marie Johnson |
| 2024 | Investigating LLMs as Voting Assistants via Contextual Augmentation: A Case Study on the European Parliament Elections 2024. Ilias Chalkidis |
| 2024 | Investigating Large Language Models for Complex Word Identification in Multilingual and Multidomain Setups. Razvan-Alexandru Smadu, David-Gabriel Ion, Dumitru-Clementin Cercel, Florin Pop, Mihaela-Claudia Cercel |
| 2024 | Investigating Multilingual Instruction-Tuning: Do Polyglot Models Demand for Multilingual Instructions? Alexander Arno Weber, Klaudia Thellmann, Jan Ebert, Nicolas Flores-Herr, Jens Lehmann, Michael Fromm, Mehdi Ali |
| 2024 | Investigating Mysteries of CoT-Augmented Distillation. Somin Wadhwa, Silvio Amir, Byron C. Wallace |
| 2024 | Investigating and Mitigating Object Hallucinations in Pretrained Vision-Language (CLIP) Models. Yufang Liu, Tao Ji, Changzhi Sun, Yuanbin Wu, Aimin Zhou |
| 2024 | Investigating the Role of Instruction Variety and Task Difficulty in Robotic Manipulation Tasks. Amit Parekh, Nikolas Vitsakis, Alessandro Suglia, Ioannis Konstas |
| 2024 | Is C4 Dataset Optimal for Pruning? An Investigation of Calibration Data for LLM Pruning. Abhinav Bandari, Lu Yin, Cheng-Yu Hsieh, Ajay Jaiswal, Tianlong Chen, Li Shen, Ranjay Krishna, Shiwei Liu |
| 2024 | Is Child-Directed Speech Effective Training Data for Language Models? Steven Y. Feng, Noah D. Goodman, Michael Frank |
| 2024 | Is It Good Data for Multilingual Instruction Tuning or Just Bad Multilingual Evaluation for Large Language Models? Pinzhen Chen, Simon Yu, Zhicheng Guo, Barry Haddow |
| 2024 | Is It Really Long Context if All You Need Is Retrieval? Towards Genuinely Difficult Long Context NLP. Omer Goldman, Alon Jacovi, Aviv Slobodkin, Aviya Maimon, Ido Dagan, Reut Tsarfaty |
| 2024 | Is LLM-as-a-Judge Robust? Investigating Universal Adversarial Attacks on Zero-shot LLM Assessment. Vyas Raina, Adian Liusie, Mark J. F. Gales |
| 2024 | Is Safer Better? The Impact of Guardrails on the Argumentative Strength of LLMs in Hate Speech Countering. Helena Bonaldi, Greta Damo, Nicolás Benjamín Ocampo, Elena Cabrio, Serena Villata, Marco Guerini |
| 2024 | Is This a Bad Table? A Closer Look at the Evaluation of Table Generation from Text. Pritika Ramu, Aparna Garimella, Sambaran Bandyopadhyay |
| 2024 | Is this the real life? Is this just fantasy? The Misleading Success of Simulating Social Interactions With LLMs. Xuhui Zhou, Zhe Su, Tiwalayo Eisape, Hyunwoo Kim, Maarten Sap |
| 2024 | Jailbreaking LLMs with Arabic Transliteration and Arabizi. Mansour Al Ghanim, Saleh Almohaimeed, Mengxin Zheng, Yan Solihin, Qian Lou |
| 2024 | Jellyfish: Instruction-Tuning Local Large Language Models for Data Preprocessing. Haochen Zhang, Yuyang Dong, Chuan Xiao, Masafumi Oyamada |
| 2024 | Joint Pre-Encoding Representation and Structure Embedding for Efficient and Low-Resource Knowledge Graph Completion. Chenyu Qiu, Pengjiang Qian, Chuang Wang, Jian Yao, Li Liu, Wei Fang, Eddie Eddie |
| 2024 | Jump Starting Bandits with LLM-Generated Prior Knowledge. Parand A. Alamdari, Yanshuai Cao, Kevin H. Wilson |
| 2024 | KARL: Knowledge-Aware Retrieval and Representations aid Retention and Learning in Students. Matthew Shu, Nishant Balepur, Shi Feng, Jordan L. Boyd-Graber |
| 2024 | KB-Plugin: A Plug-and-play Framework for Large Language Models to Induce Programs over Low-resourced Knowledge Bases. Jiajie Zhang, Shulin Cao, Linmei Hu, Ling Feng, Lei Hou, Juanzi Li |
| 2024 | KNN-Instruct: Automatic Instruction Construction with K Nearest Neighbor Deduction. Jianshang Kou, Benfeng Xu, Chiwei Zhu, Zhendong Mao |
| 2024 | KidLM: Advancing Language Models for Children - Early Insights and Future Directions. Mir Tafseer Nayeem, Davood Rafiei |
| 2024 | Kiss up, Kick down: Exploring Behavioral Changes in Multi-modal Large Language Models with Assigned Visual Personas. Seungjong Sun, Eungu Lee, Seo Yeon Baek, Seunghyun Hwang, Wonbyung Lee, Dongyan Nan, Bernard J. Jansen, Jang-Hyun Kim |
| 2024 | KnowTuning: Knowledge-aware Fine-tuning for Large Language Models. Yougang Lyu, Lingyong Yan, Shuaiqiang Wang, Haibo Shi, Dawei Yin, Pengjie Ren, Zhumin Chen, Maarten de Rijke, Zhaochun Ren |
| 2024 | Knowledge Conflicts for LLMs: A Survey. Rongwu Xu, Zehan Qi, Zhijiang Guo, Cunxiang Wang, Hongru Wang, Yue Zhang, Wei Xu |
| 2024 | Knowledge Graph Enhanced Large Language Model Editing. Mengqi Zhang, Xiaotian Ye, Qiang Liu, Pengjie Ren, Shu Wu, Zhumin Chen |
| 2024 | Knowledge Planning in Large Language Models for Domain-Aligned Counseling Summarization. Aseem Srivastava, Smriti Joshi, Tanmoy Chakraborty, Md. Shad Akhtar |
| 2024 | Knowledge Verification to Nip Hallucination in the Bud. Fanqi Wan, Xinting Huang, Leyang Cui, Xiaojun Quan, Wei Bi, Shuming Shi |
| 2024 | Knowledge-Centric Hallucination Detection. Xiangkun Hu, Dongyu Ru, Lin Qiu, Qipeng Guo, Tianhang Zhang, Yang Xu, Yun Luo, Pengfei Liu, Yue Zhang, Zheng Zhang |
| 2024 | KnowledgeSG: Privacy-Preserving Synthetic Text Generation with Knowledge Distillation from Server. Wenhao Wang, Xiaoyu Liang, Rui Ye, Jingyi Chai, Siheng Chen, Yanfeng Wang |
| 2024 | LEMoE: Advanced Mixture of Experts Adaptor for Lifelong Model Editing of Large Language Models. Renzhi Wang, Piji Li |
| 2024 | LIONs: An Empirically Optimized Approach to Align Language Models. Xiao Yu, Qingyang Wu, Yu Li, Zhou Yu |
| 2024 | LLM See, LLM Do: Leveraging Active Inheritance to Target Non-Differentiable Objectives. Luísa Shimabucoro, Sebastian Ruder, Julia Kreutzer, Marzieh Fadaee, Sara Hooker |
| 2024 | LLM Task Interference: An Initial Study on the Impact of Task-Switch in Conversational History. Akash Gupta, Ivaxi Sheth, Vyas Raina, Mark J. F. Gales, Mario Fritz |
| 2024 | LLM-Based Agent Society Investigation: Collaboration and Confrontation in Avalon Gameplay. Yihuai Lan, Zhiqiang Hu, Lei Wang, Yang Wang, Deheng Ye, Peilin Zhao, Ee-Peng Lim, Hui Xiong, Hao Wang |
| 2024 | LLM-Evolve: Evaluation for LLM's Evolving Capability on Benchmarks. Jiaxuan You, Mingjie Liu, Shrimai Prabhumoye, Mostofa Patwary, Mohammad Shoeybi, Bryan Catanzaro |
| 2024 | LLM-based Code-Switched Text Generation for Grammatical Error Correction. Tom Potter, Zheng Yuan |
| 2024 | LLM4Decompile: Decompiling Binary Code with Large Language Models. Hanzhuo Tan, Qi Luo, Jing Li, Yuqun Zhang |
| 2024 | LLMEdgeRefine: Enhancing Text Clustering with LLM-Based Boundary Point Refinement. Zijin Feng, Luyang Lin, Lingzhi Wang, Hong Cheng, Kam-Fai Wong |
| 2024 | LLMs Are Prone to Fallacies in Causal Inference. Nitish Joshi, Abulhair Saparov, Yixin Wang, He He |
| 2024 | LLMs Are Zero-Shot Context-Aware Simultaneous Translators. Roman Koshkin, Katsuhito Sudoh, Satoshi Nakamura |
| 2024 | LLMs Assist NLP Researchers: Critique Paper (Meta-)Reviewing. Jiangshu Du, Yibo Wang, Wenting Zhao, Zhongfen Deng, Shuaiqi Liu, Renze Lou, Henry Peng Zou, Pranav Narayanan Venkit, Nan Zhang, Mukund Srinath, Haoran Zhang, Vipul Gupta, Yinghui Li, Tao Li, Fei Wang, Qin Liu, Tianlin Liu, Pengzhi Gao, Congying Xia, Chen Xing, Cheng Jiayang, Zhaowei Wang, Ying Su, Raj Sanjay Shah, Ruohao Guo, Jing Gu, Haoran Li, Kangda Wei, Zihao Wang, Lu Cheng, Surangika Ranathunga, Meng Fang, Jie Fu, Fei Liu, Ruihong Huang, Eduardo Blanco, Yixin Cao, Rui Zhang, Philip S. Yu, Wenpeng Yin |
| 2024 | LLMs learn governing principles of dynamical systems, revealing an in-context neural scaling law. Toni J. B. Liu, Nicolas Boullé, Raphaël Sarfati, Christopher J. Earls |
| 2024 | LLaMA-MoE: Building Mixture-of-Experts from LLaMA with Continual Pre-Training. Tong Zhu, Xiaoye Qu, Daize Dong, Jiacheng Ruan, Jingqi Tong, Conghui He, Yu Cheng |
| 2024 | LLoCO: Learning Long Contexts Offline. Sijun Tan, Xiuyu Li, Shishir G. Patil, Ziyang Wu, Tianjun Zhang, Kurt Keutzer, Joseph Gonzalez, Raluca A. Popa |
| 2024 | LM2: A Simple Society of Language Models Solves Complex Reasoning. Gurusha Juneja, Subhabrata Dutta, Tanmoy Chakraborty |
| 2024 | LONGAGENT: Achieving Question Answering for 128k-Token-Long Documents through Multi-Agent Collaboration. Jun Zhao, Can Zu, Xu Hao, Yi Lu, Wei He, Yiwen Ding, Tao Gui, Qi Zhang, Xuanjing Huang |
| 2024 | LUQ: Long-text Uncertainty Quantification for LLMs. Caiqi Zhang, Fangyu Liu, Marco Basaldella, Nigel Collier |
| 2024 | Label Confidence Weighted Learning for Target-level Sentence Simplification. Xin Ying Qiu, Jingshen Zhang |
| 2024 | Ladder: A Model-Agnostic Framework Boosting LLM-based Machine Translation to the Next Level. Zhaopeng Feng, Ruizhe Chen, Yan Zhang, Zijie Meng, Zuozhu Liu |
| 2024 | Language Concept Erasure for Language-invariant Dense Retrieval. Zhiqi Huang, Puxuan Yu, Shauli Ravfogel, James Allan |
| 2024 | Language Models Learn Rare Phenomena from Less Rare Phenomena: The Case of the Missing AANNs. Kanishka Misra, Kyle Mahowald |
| 2024 | Language Models as Compilers: Simulating Pseudocode Execution Improves Algorithmic Reasoning in Language Models. Hyungjoo Chae, Yeonghyeon Kim, Seungone Kim, Kai Tzu-iunn Ong, Beong-woo Kwak, Moohyeon Kim, Sunghwan Kim, Taeyoon Kwon, Jiwan Chung, Youngjae Yu, Jinyoung Yeo |
| 2024 | Language is Scary when Over-Analyzed: Unpacking Implied Misogynistic Reasoning with Argumentation Theory-Driven Prompts. Arianna Muti, Federico Ruggeri, Khalid Al-Khatib, Alberto Barrón-Cedeño, Tommaso Caselli |
| 2024 | Language models and brains align due to more than next-word prediction and word-level information. Gabriele Merlin, Mariya Toneva |
| 2024 | Language-to-Code Translation with a Single Labeled Example. Kaj Bostrom, Harsh Jhamtani, Hao Fang, Sam Thomson, Richard Shin, Patrick Xia, Benjamin Van Durme, Jason Eisner, Jacob Andreas |
| 2024 | Large Language Model as an Assignment Evaluator: Insights, Feedback, and Challenges in a 1000+ Student Course. Cheng-Han Chiang, Wei-Chih Chen, Chun-Yi Kuan, Chienchou Yang, Hung-yi Lee |
| 2024 | Large Language Models Are Involuntary Truth-Tellers: Exploiting Fallacy Failure for Jailbreak Attacks. Yue Zhou, Henry Peng Zou, Barbara Di Eugenio, Yang Zhang |
| 2024 | Large Language Models Are Poor Clinical Decision-Makers: A Comprehensive Benchmark. Fenglin Liu, Zheng Li, Hongjian Zhou, Qingyu Yin, Jingfeng Yang, Xianfeng Tang, Chen Luo, Ming Zeng, Haoming Jiang, Yifan Gao, Priyanka Nigam, Sreyashi Nag, Bing Yin, Yining Hua, Xuan Zhou, Omid Rohanian, Anshul Thakur, Lei A. Clifton, David A. Clifton |
| 2024 | Large Language Models Can Be Contextual Privacy Protection Learners. Yijia Xiao, Yiqiao Jin, Yushi Bai, Yue Wu, Xianjun Yang, Xiao Luo, Wenchao Yu, Xujiang Zhao, Yanchi Liu, Quanquan Gu, Haifeng Chen, Wei Wang, Wei Cheng |
| 2024 | Large Language Models Can Self-Correct with Key Condition Verification. Zhenyu Wu, Qingkai Zeng, Zhihan Zhang, Zhaoxuan Tan, Chao Shen, Meng Jiang |
| 2024 | Large Language Models Know What is Key Visual Entity: An LLM-assisted Multimodal Retrieval for VQA. Pu Jian, Donglei Yu, Jiajun Zhang |
| 2024 | Large Language Models as Foundations for Next-Gen Dense Retrieval: A Comprehensive Empirical Assessment. Kun Luo, Minghao Qin, Zheng Liu, Shitao Xiao, Jun Zhao, Kang Liu |
| 2024 | Large Language Models for Data Annotation and Synthesis: A Survey. Zhen Tan, Dawei Li, Song Wang, Alimohammad Beigi, Bohan Jiang, Amrita Bhattacharjee, Mansooreh Karami, Jundong Li, Lu Cheng, Huan Liu |
| 2024 | Latent Concept-based Explanation of NLP Models. Xuemin Yu, Fahim Dalvi, Nadir Durrani, Marzia Nouri, Hassan Sajjad |
| 2024 | LawBench: Benchmarking Legal Knowledge of Large Language Models. Zhiwei Fei, Xiaoyu Shen, Dawei Zhu, Fengzhe Zhou, Zhuo Han, Alan Huang, Songyang Zhang, Kai Chen, Zhixin Yin, Zongwen Shen, Jidong Ge, Vincent Ng |
| 2024 | Layer by Layer: Uncovering Where Multi-Task Learning Happens in Instruction-Tuned Large Language Models. Zheng Zhao, Yftah Ziser, Shay B. Cohen |
| 2024 | Leading Whitespaces of Language Models' Subword Vocabulary Pose a Confound for Calculating Word Probabilities. Byung-Doh Oh, William Schuler |
| 2024 | Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning. Zhihan Zhang, Tao Ge, Zhenwen Liang, Wenhao Yu, Dian Yu, Mengzhao Jia, Dong Yu, Meng Jiang |
| 2024 | Learn to Refuse: Making Large Language Models More Controllable and Reliable through Knowledge Scope Limitation and Refusal Mechanism. Lang Cao |
| 2024 | Learning Interpretable Legal Case Retrieval via Knowledge-Guided Case Reformulation. Chenlong Deng, Kelong Mao, Zhicheng Dou |
| 2024 | Learning Personalized Alignment for Evaluating Open-ended Text Generation. Danqing Wang, Kevin Yang, Hanlin Zhu, Xiaomeng Yang, Andrew Cohen, Lei Li, Yuandong Tian |
| 2024 | Learning Planning-based Reasoning by Trajectories Collection and Process Reward Synthesizing. Fangkai Jiao, Chengwei Qin, Zhengyuan Liu, Nancy F. Chen, Shafiq Joty |
| 2024 | Learning from Natural Language Explanations for Generalizable Entity Matching. Somin Wadhwa, Adit Krishnan, Runhui Wang, Byron C. Wallace, Luyang Kong |
| 2024 | Learning to Correct for QA Reasoning with Black-box LLMs. Jaehyung Kim, Dongyoung Kim, Yiming Yang |
| 2024 | Learning to Extract Structured Entities Using Language Models. Haolun Wu, Ye Yuan, Liana Mikaelyan, Alexander Meulemans, Xue Liu, James Hensman, Bhaskar Mitra |
| 2024 | Learning to Rank Salient Content for Query-focused Summarization. Sajad Sotudeh, Nazli Goharian |
| 2024 | Learning to Retrieve Iteratively for In-Context Learning. Yunmo Chen, Tongfei Chen, Harsh Jhamtani, Patrick Xia, Richard Shin, Jason Eisner, Benjamin Van Durme |
| 2024 | Learning to Write Rationally: How Information Is Distributed in Non-native Speakers' Essays. Zixin Tang, Janet G. van Hell |
| 2024 | Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA. Minzheng Wang, Longze Chen, Fu Cheng, Shengyi Liao, Xinghua Zhang, Bingli Wu, Haiyang Yu, Nan Xu, Lei Zhang, Run Luo, Yunshui Li, Min Yang, Fei Huang, Yongbin Li |
| 2024 | Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning. David Schulte, Felix Hamborg, Alan Akbik |
| 2024 | Let Me Teach You: Pedagogical Foundations of Feedback for Language Models. Beatriz Borges, Niket Tandon, Tanja Käser, Antoine Bosselut |
| 2024 | Let the Expert Stick to His Last: Expert-Specialized Fine-Tuning for Sparse Architectural Large Language Models. Zihan Wang, Deli Chen, Damai Dai, Runxin Xu, Zhuoshu Li, Yu Wu |
| 2024 | Let's discuss! Quality Dimensions and Annotated Datasets for Computational Argument Quality Assessment. Rositsa V. Ivanova, Thomas Huber, Christina Niklaus |
| 2024 | Leveraging BERT and TFIDF Features for Short Text Clustering via Alignment-Promoting Co-Training. Zetong Li, Qinliang Su, Shijing Si, Jianxing Yu |
| 2024 | Leveraging Conflicts in Social Media Posts: Unintended Offense Dataset. Che-Wei Tsai, Yen-Hao Huang, Tsu-Keng Liao, Didier Estrada, Retnani Latifah, Yi-Shin Chen |
| 2024 | Leveraging Context-Aware Prompting for Commit Message Generation. Zhihua Jiang, Jianwei Chen, Dongning Rao, Guanghui Ye |
| 2024 | Leveraging Estimated Transferability Over Human Intuition for Model Selection in Text Ranking. Jun Bai, Zhuofan Chen, Zhenzi Li, Hanhua Hong, Jianfei Zhang, Chen Li, Chenghua Lin, Wenge Rong |
| 2024 | Leveraging Large Language Models for NLG Evaluation: Advances and Challenges. Zhen Li, Xiaohan Xu, Tao Shen, Can Xu, Jia-Chen Gu, Yuxuan Lai, Chongyang Tao, Shuai Ma |
| 2024 | Leveraging pre-trained language models for linguistic analysis: A case of argument structure constructions. Hakyung Sung, Kristopher Kyle |
| 2024 | Lexically Grounded Subword Segmentation. Jindrich Libovický, Jindrich Helcl |
| 2024 | Liar, Liar, Logical Mire: A Benchmark for Suppositional Reasoning in Large Language Models. Philipp Mondorf, Barbara Plank |
| 2024 | Lifelong Event Detection via Optimal Transport. Viet Dao, Van-Cuong Pham, Quyen Tran, Thanh-Thien Le, Linh Ngo Van, Thien Huu Nguyen |
| 2024 | Lifelong Knowledge Editing for LLMs with Retrieval-Augmented Continuous Prompt Learning. Qizhou Chen, Taolin Zhang, Xiaofeng He, Dongyang Li, Chengyu Wang, Longtao Huang, Hui Xue' |
| 2024 | Linear Layer Extrapolation for Fine-Grained Emotion Classification. Mayukh Sharma, Sean O'Brien, Julian J. McAuley |
| 2024 | Linguistic Bias in ChatGPT: Language Models Reinforce Dialect Discrimination. Eve Fleisig, Genevieve Smith, Madeline Bossi, Ishita Rustagi, Xavier Yin, Dan Klein |
| 2024 | Link, Synthesize, Retrieve: Universal Document Linking for Zero-Shot Information Retrieval. Dae Yon Hwang, Bilal Taha, Harshit Pande, Yaroslav Nechaev |
| 2024 | LitSearch: A Retrieval Benchmark for Scientific Literature Search. Anirudh Ajith, Mengzhou Xia, Alexis Chevalier, Tanya Goyal, Danqi Chen, Tianyu Gao |
| 2024 | LoRA-Guard: Parameter-Efficient Guardrail Adaptation for Content Moderation of Large Language Models. Hayder Elesedy, Pedro M. Esperança, Silviu Vlad Oprea, Mete Ozay |
| 2024 | Local Contrastive Editing of Gender Stereotypes. Marlene Lutz, Rochelle Choenni, Markus Strohmaier, Anne Lauscher |
| 2024 | Locating Information Gaps and Narrative Inconsistencies Across Languages: A Case Study of LGBT People Portrayals on Wikipedia. Farhan Samir, Chan Young Park, Anjalie Field, Vered Shwartz, Yulia Tsvetkov |
| 2024 | LogicAsker: Evaluating and Improving the Logical Reasoning Ability of Large Language Models. Yuxuan Wan, Wenxuan Wang, Yiliu Yang, Youliang Yuan, Jen-tse Huang, Pinjia He, Wenxiang Jiao, Michael R. Lyu |
| 2024 | LogicST: A Logical Self-Training Framework for Document-Level Relation Extraction with Incomplete Annotations. Shengda Fan, Yanting Wang, Shasha Mo, Jianwei Niu |
| 2024 | LongEmbed: Extending Embedding Models for Long Context Retrieval. Dawei Zhu, Liang Wang, Nan Yang, Yifan Song, Wenhao Wu, Furu Wei, Sujian Li |
| 2024 | LongRAG: A Dual-Perspective Retrieval-Augmented Generation Paradigm for Long-Context Question Answering. Qingfei Zhao, Ruobing Wang, Yukuo Cen, Daren Zha, Shicheng Tan, Yuxiao Dong, Jie Tang |
| 2024 | Lookback Lens: Detecting and Mitigating Contextual Hallucinations in Large Language Models Using Only Attention Maps. Yung-Sung Chuang, Linlu Qiu, Cheng-Yu Hsieh, Ranjay Krishna, Yoon Kim, James R. Glass |
| 2024 | M3D: MultiModal MultiDocument Fine-Grained Inconsistency Detection. Chia-Wei Tang, Ting-Chih Chen, Kiet Nguyen, Kazi Sajeed Mehrab, Alvi Md. Ishmam, Chris Thomas |
| 2024 | M3Hop-CoT: Misogynous Meme Identification with Multimodal Multi-hop Chain-of-Thought. Gitanjali Kumari, Kirtan Jain, Asif Ekbal |
| 2024 | MAIR: A Massive Benchmark for Evaluating Instructed Retrieval. Weiwei Sun, Zhengliang Shi, Wu Long, Lingyong Yan, Xinyu Ma, Yiding Liu, Min Cao, Dawei Yin, Zhaochun Ren |
| 2024 | MAR: Matching-Augmented Reasoning for Enhancing Visual-based Entity Question Answering. Zhengxuan Zhang, Yin Wu, Yuyu Luo, Nan Tang |
| 2024 | MARE: Multi-Aspect Rationale Extractor on Unsupervised Rationale Extraction. Han Jiang, Junwen Duan, Zhe Qu, Jianxin Wang |
| 2024 | MASIVE: Open-Ended Affective State Identification in English and Spanish. Nicholas Deas, Elsbeth Turcan, Iván Pérez Mejía, Kathleen R. McKeown |
| 2024 | MAgIC: Investigation of Large Language Model Powered Multi-Agent in Cognition, Adaptability, Rationality and Collaboration. Lin Xu, Zhiyuan Hu, Daquan Zhou, Hongyu Ren, Zhen Dong, Kurt Keutzer, See-Kiong Ng, Jiashi Feng |
| 2024 | MEANT: Multimodal Encoder for Antecedent Information. Benjamin Irving, Annika Marie Schoene |
| 2024 | MIBench: Evaluating Multimodal Large Language Models over Multiple Images. Haowei Liu, Xi Zhang, Haiyang Xu, Yaya Shi, Chaoya Jiang, Ming Yan, Ji Zhang, Fei Huang, Chunfeng Yuan, Bing Li, Weiming Hu |
| 2024 | MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding. Baixuan Xu, Weiqi Wang, Haochen Shi, Wenxuan Ding, Huihao Jing, Tianqing Fang, Jiaxin Bai, Xin Liu, Changlong Yu, Zheng Li, Chen Luo, Qingyu Yin, Bing Yin, Long Chen, Yangqiu Song |
| 2024 | MIPD: Exploring Manipulation and Intention In a Novel Corpus of Polish Disinformation. Arkadiusz Modzelewski, Giovanni Da San Martino, Pavel Savov, Magdalena Wilczynska, Adam Wierzbicki |
| 2024 | MLLM-Protector: Ensuring MLLM's Safety without Hurting Performance. Renjie Pi, Tianyang Han, Jianshu Zhang, Yueqi Xie, Rui Pan, Qing Lian, Hanze Dong, Jipeng Zhang, Tong Zhang |
| 2024 | MMNeuron: Discovering Neuron-Level Domain-Specific Interpretation in Multimodal Large Language Model. Jiahao Huo, Yibo Yan, Boren Hu, Yutao Yue, Xuming Hu |
| 2024 | MMTE: Corpus and Metrics for Evaluating Machine Translation Quality of Metaphorical Language. Shun Wang, Ge Zhang, Han Wu, Tyler Loakman, Wenhao Huang, Chenghua Lin |
| 2024 | MMoE: Enhancing Multimodal Models with Mixtures of Multimodal Interaction Experts. Haofei Yu, Zhengyang Qi, Lawrence Jang, Russ Salakhutdinov, Louis-Philippe Morency, Paul Pu Liang |
| 2024 | MORPHEUS: Modeling Role from Personalized Dialogue History by Exploring and Utilizing Latent Space. Yihong Tang, Bo Wang, Dongming Zhao, Jinxiaojia Jinxiaojia, Zhangjijun Zhangjijun, Ruifang He, Yuexian Hou |
| 2024 | MOSEL: 950, 000 Hours of Speech Data for Open-Source Speech Foundation Model Training on EU Languages. Marco Gaido, Sara Papi, Luisa Bentivogli, Alessio Brutti, Mauro Cettolo, Roberto Gretter, Marco Matassoni, Mohamed Nabih, Matteo Negri |
| 2024 | MOSEL: Inference Serving Using Dynamic Modality Selection. Bodun Hu, Le Xu, Jeongyoon Moon, Neeraja J. Yadwadkar, Aditya Akella |
| 2024 | MP2D: An Automated Topic Shift Dialogue Generation Framework Leveraging Knowledge Graphs. Yerin Hwang, Yongil Kim, Yunah Jang, Jeesoo Bang, Hyunkyung Bae, Kyomin Jung |
| 2024 | MQuinE: a Cure for "Z-paradox" in Knowledge Graph Embedding. Yang Liu, Huang Fang, Yunfeng Cai, Mingming Sun |
| 2024 | MSI-Agent: Incorporating Multi-Scale Insight into Embodied Agents for Superior Planning and Decision-Making. Dayuan Fu, Biqing Qi, Yihuai Gao, Che Jiang, Guanting Dong, Bowen Zhou |
| 2024 | MT-Eval: A Multi-Turn Capabilities Evaluation Benchmark for Large Language Models. Wai-Chung Kwan, Xingshan Zeng, Yuxin Jiang, Yufei Wang, Liangyou Li, Lifeng Shang, Xin Jiang, Qun Liu, Kam-Fai Wong |
| 2024 | MTA4DPR: Multi-Teaching-Assistants Based Iterative Knowledge Distillation for Dense Passage Retrieval. Qixi Lu, Endong Xun, Gongbo Tang |
| 2024 | MTLS: Making Texts into Linguistic Symbols. Wenlong Fei, Xiaohua Wang, Min Hu, Qingyu Zhang, Hongbo Li |
| 2024 | MaPPER: Multimodal Prior-guided Parameter Efficient Tuning for Referring Expression Comprehension. Ting Liu, Zunnan Xu, Yue Hu, Liangtao Shi, Zhiqiang Wang, Quanjun Yin |
| 2024 | Major Entity Identification: A Generalizable Alternative to Coreference Resolution. Kawshik Sundar, Shubham Toshniwal, Makarand Tapaswi, Vineet Gandhi |
| 2024 | Make Some Noise: Unlocking Language Model Parallel Inference Capability through Noisy Training. Yixuan Wang, Xianzhen Luo, Fuxuan Wei, Yijun Liu, Qingfu Zhu, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che |
| 2024 | Making Large Language Models Better Reasoners with Orchestrated Streaming Experiences. Xiangyang Liu, Junliang He, Xipeng Qiu |
| 2024 | MatchTime: Towards Automatic Soccer Game Commentary Generation. Jiayuan Rao, Haoning Wu, Chang Liu, Yanfeng Wang, Weidi Xie |
| 2024 | Mathador-LM: A Dynamic Benchmark for Mathematical Reasoning on Large Language Models. Eldar Kurtic, Amir Moeini, Dan Alistarh |
| 2024 | Matryoshka-Adaptor: Unsupervised and Supervised Tuning for Smaller Embedding Dimensions. Jinsung Yoon, Rajarishi Sinha, Sercan Ömer Arik, Tomas Pfister |
| 2024 | Measuring Psychological Depth in Language Models. Fabrice Harel-Canada, Hanyu Zhou, Sreya Muppalla, Zeynep Yildiz, Miryung Kim, Amit Sahai, Nanyun Peng |
| 2024 | MedAdapter: Efficient Test-Time Adaptation of Large Language Models Towards Medical Reasoning. Wenqi Shi, Ran Xu, Yuchen Zhuang, Yue Yu, Haotian Sun, Hang Wu, Carl Yang, May Dongmei Wang |
| 2024 | MedCoT: Medical Chain of Thought via Hierarchical Expert. Jiaxiang Liu, Yuan Wang, Jiawei Du, Joey Zhou, Zuozhu Liu |
| 2024 | MedReadMe: A Systematic Study for Fine-grained Sentence Readability in Medical Domain. Chao Jiang, Wei Xu |
| 2024 | MediTOD: An English Dialogue Dataset for Medical History Taking with Comprehensive Annotations. Vishal Vivek Saley, Goonjan Saha, Rocktim Jyoti Das, Dinesh Raghu, Mausam |
| 2024 | Media Attitude Detection via Framing Analysis with Events and their Relations. Jin Zhao, Jingxuan Tu, Han Du, Nianwen Xue |
| 2024 | Medical Adaptation of Large Language and Vision-Language Models: Are We Making Progress? Daniel P. Jeong, Saurabh Garg, Zachary C. Lipton, Michael Oberst |
| 2024 | MemeCLIP: Leveraging CLIP Representations for Multimodal Meme Classification. Siddhant Bikram Shah, Shuvam Shiwakoti, Maheep Chaudhary, Haohan Wang |
| 2024 | Memorize Step by Step: Efficient Long-Context Prefilling with Incremental Memory and Decremental Chunk. Zhiyuan Zeng, Qipeng Guo, Xiaoran Liu, Zhangyue Yin, Wentao Shu, Mianqiu Huang, Bo Wang, Yunhua Zhou, Linlin Li, Qun Liu, Xipeng Qiu |
| 2024 | Memory-Efficient Fine-Tuning of Transformers via Token Selection. Antoine Simoulin, Namyong Park, Xiaoyi Liu, Grey Yang |
| 2024 | Mentor-KD: Making Small Language Models Better Multi-step Reasoners. HoJae Lee, Junho Kim, SangKeun Lee |
| 2024 | Message Passing on Semantic-Anchor-Graphs for Fine-grained Emotion Representation Learning and Classification. Pinyi Zhang, Jingyang Chen, Junchen Shen, Zijie Zhai, Ping Li, Jie Zhang, Kai Zhang |
| 2024 | MetaGPT: Merging Large Language Models Using Model Exclusive Task Arithmetic. Yuyan Zhou, Liang Song, Bingning Wang, Weipeng Chen |
| 2024 | MetaReflection: Learning Instructions for Language Agents using Past Reflections. Priyanshu Gupta, Shashank Kirtania, Ananya Singha, Sumit Gulwani, Arjun Radhakrishna, Gustavo Soares, Sherry Shi |
| 2024 | Methods of Automatic Matrix Language Determination for Code-Switched Speech. Olga Iakovenko, Thomas Hain |
| 2024 | Metrics for What, Metrics for Whom: Assessing Actionability of Bias Evaluation Metrics in NLP. Pieter Delobelle, Giuseppe Attanasio, Debora Nozza, Su Lin Blodgett, Zeerak Talat |
| 2024 | MiTTenS: A Dataset for Evaluating Gender Mistranslation. Kevin Robinson, Sneha Kudugunta, Romina Stella, Sunipa Dev, Jasmijn Bastings |
| 2024 | Middleware for LLMs: Tools Are Instrumental for Language Agents in Complex Environments. Yu Gu, Yiheng Shu, Hao Yu, Xiao Liu, Yuxiao Dong, Jie Tang, Jayanth Srinivasa, Hugo Latapie, Yu Su |
| 2024 | MiniCheck: Efficient Fact-Checking of LLMs on Grounding Documents. Liyan Tang, Philippe Laban, Greg Durrett |
| 2024 | MiniConGTS: A Near Ultimate Minimalist Contrastive Grid Tagging Scheme for Aspect Sentiment Triplet Extraction. Qiao Sun, Liujia Yang, Minghao Ma, Nanyang Ye, Qinying Gu |
| 2024 | MirrorStories: Reflecting Diversity through Personalized Narrative Generation with Large Language Models. Sarfaroz Yunusov, Hamza Sidat, Ali Emami |
| 2024 | MisinfoEval: Generative AI in the Era of "Alternative Facts". Saadia Gabriel, Liang Lyu, James Siderius, Marzyeh Ghassemi, Jacob Andreas, Asuman E. Ozdaglar |
| 2024 | Mitigate Extrinsic Social Bias in Pre-trained Language Models via Continuous Prompts Adjustment. Yiwei Dai, Hengrui Gu, Ying Wang, Xin Wang |
| 2024 | Mitigating Frequency Bias and Anisotropy in Language Model Pre-Training with Syntactic Smoothing. Richard Diehl Martinez, Zébulon Goriely, Andrew Caines, Paula Buttery, Lisa Beinborn |
| 2024 | Mitigating Language Bias of LMMs in Social Intelligence Understanding with Virtual Counterfactual Calibration. Peng Chen, Xiao-Yu Guo, Yuan-Fang Li, Xiaowang Zhang, Zhiyong Feng |
| 2024 | Mitigating Matthew Effect: Multi-Hypergraph Boosted Multi-Interest Self-Supervised Learning for Conversational Recommendation. Yongsen Zheng, Ruilin Xu, Guohua Wang, Liang Lin, Kwok-Yan Lam |
| 2024 | Mitigating Open-Vocabulary Caption Hallucinations. Assaf Ben-Kish, Moran Yanuka, Morris Alper, Raja Giryes, Hadar Averbuch-Elor |
| 2024 | Mitigating Training Imbalance in LLM Fine-Tuning via Selective Parameter Merging. Yiming Ju, Ziyi Ni, Xingrun Xing, Zhixiong Zeng, Hanyu Zhao, Siqi Fan, Zheng Zhang |
| 2024 | Mitigating the Alignment Tax of RLHF. Yong Lin, Hangyu Lin, Wei Xiong, Shizhe Diao, Jianmeng Liu, Jipeng Zhang, Rui Pan, Haoxiang Wang, Wenbin Hu, Hanning Zhang, Hanze Dong, Renjie Pi, Han Zhao, Nan Jiang, Heng Ji, Yuan Yao, Tong Zhang |
| 2024 | Mitigating the Impact of Reference Quality on Evaluation of Summarization Systems with Reference-Free Metrics. Théo Gigant, Camille Guinaudeau, Marc Decombas, Frédéric Dufaux |
| 2024 | Mitigating the Language Mismatch and Repetition Issues in LLM-based Machine Translation via Model Editing. Weichuan Wang, Zhaoyi Li, Defu Lian, Chen Ma, Linqi Song, Ying Wei |
| 2024 | MixGR: Enhancing Retriever Generalization for Scientific Domain through Complementary Granularity. Fengyu Cai, Xinran Zhao, Tong Chen, Sihao Chen, Hongming Zhang, Iryna Gurevych, Heinz Koeppl |
| 2024 | Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules. Zhuocheng Gong, Ang Lv, Jian Guan, Wei Wu, Huishuai Zhang, Minlie Huang, Dongyan Zhao, Rui Yan |
| 2024 | Mixture-of-Skills: Learning to Optimize Data Usage for Fine-Tuning Large Language Models. Minghao Wu, Thuy-Trang Vu, Lizhen Qu, Reza Haf |
| 2024 | Mixture-of-Subspaces in Low-Rank Adaptation. Taiqiang Wu, Jiahao Wang, Zhe Zhao, Ngai Wong |
| 2024 | MoCoKGC: Momentum Contrast Entity Encoding for Knowledge Graph Completion. Qingyang Li, Yanru Zhong, Yuchu Qin |
| 2024 | MoDULA: Mixture of Domain-Specific and Universal LoRA for Multi-Task Learning. Yufei Ma, Zihan Liang, Huangyu Dai, Ben Chen, Dehong Gao, Zhuoran Ran, Zihan Wang, Linbo Jin, Wen Jiang, Guannan Zhang, Xiaoyan Cai, Libin Yang |
| 2024 | ModSCAN: Measuring Stereotypical Bias in Large Vision-Language Models from Vision and Language Modalities. Yukun Jiang, Zheng Li, Xinyue Shen, Yugeng Liu, Michael Backes, Yang Zhang |
| 2024 | Model Balancing Helps Low-data Training and Fine-tuning. Zihang Liu, Yuanzhe Hu, Tianyu Pang, Yefan Zhou, Pu Ren, Yaoqing Yang |
| 2024 | Model Editing Harms General Abilities of Large Language Models: Regularization to the Rescue. Jia-Chen Gu, Hao-Xiang Xu, Jun-Yu Ma, Pan Lu, Zhen-Hua Ling, Kai-Wei Chang, Nanyun Peng |
| 2024 | Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation. Jirui Qi, Gabriele Sarti, Raquel Fernández, Arianna Bisazza |
| 2024 | Model-based Preference Optimization in Abstractive Summarization without Human Feedback. Jaepill Choi, Kyubyung Chae, Jiwoo Song, Yohan Jo, Taesup Kim |
| 2024 | Modeling Layout Reading Order as Ordering Relations for Visually-rich Document Understanding. Chong Zhang, Yi Tu, Yixi Zhao, Chenshu Yuan, Huan Chen, Yue Zhang, Mingxu Chai, Ya Guo, Huijia Zhu, Qi Zhang, Tao Gui |
| 2024 | Modeling Nonnative Sentence Processing with L2 Language Models. Tatsuya Aoyama, Nathan Schneider |
| 2024 | Modeling User Preferences with Automatic Metrics: Creating a High-Quality Preference Dataset for Machine Translation. Sweta Agrawal, José Guilherme Camargo de Souza, Ricardo Rei, António Farinhas, Gonçalo Rui Alves Faria, Patrick Fernandes, Nuno Miguel Guerreiro, André F. T. Martins |
| 2024 | Modular Pluralism: Pluralistic Alignment via Multi-LLM Collaboration. Shangbin Feng, Taylor Sorensen, Yuhan Liu, Jillian Fisher, Chan Young Park, Yejin Choi, Yulia Tsvetkov |
| 2024 | MolTRES: Improving Chemical Language Representation Learning for Molecular Property Prediction. Jun-Hyung Park, Yeachan Kim, Mingyu Lee, Hyuntae Park, SangKeun Lee |
| 2024 | Moral Foundations of Large Language Models. Marwa Abdulhai, Gregory Serapio-García, Clément Crepy, Daria Valter, John Canny, Natasha Jaques |
| 2024 | More DWUGs: Extending and Evaluating Word Usage Graph Datasets in Multiple Languages. Dominik Schlechtweg, Pierluigi Cassotti, Bill Noble, David Alfter, Sabine Schulte im Walde, Nina Tahmasebi |
| 2024 | More Insightful Feedback for Tutoring: Enhancing Generation Mechanisms and Automatic Evaluation. Wencke Liermann, Jin-Xia Huang, Yohan Lee, Kong Joo Lee |
| 2024 | More Than Catastrophic Forgetting: Integrating General Capabilities For Domain-Specific LLMs. Chengyuan Liu, Yangyang Kang, Shihang Wang, Lizhi Qing, Fubang Zhao, Chao Wu, Changlong Sun, Kun Kuang, Fei Wu |
| 2024 | MuMath-Code: Combining Tool-Use Large Language Models with Multi-perspective Data Augmentation for Mathematical Reasoning. Shuo Yin, Weihao You, Zhilong Ji, Guoqiang Zhong, Jinfeng Bai |
| 2024 | Multi-Dialect Vietnamese: Task, Dataset, Baseline Models and Challenges. Nguyen Dinh, Thanh Dang, Luan Thanh Nguyen, Kiet Van Nguyen |
| 2024 | Multi-Granularity History and Entity Similarity Learning for Temporal Knowledge Graph Reasoning. Shi Mingcong, Chunjiang Zhu, Detian Zhang, Shiting Wen, Qing Li |
| 2024 | Multi-Level Cross-Modal Alignment for Speech Relation Extraction. Liang Zhang, Zhen Yang, Biao Fu, Ziyao Lu, Liangying Shao, Shiyu Liu, Fandong Meng, Jie Zhou, Xiaoli Wang, Jinsong Su |
| 2024 | Multi-Level Information Retrieval Augmented Generation for Knowledge-based Visual Question Answering. Omar Adjali, Olivier Ferret, Sahar Ghannay, Hervé Le Borgne |
| 2024 | Multi-LogiEval: Towards Evaluating Multi-Step Logical Reasoning Ability of Large Language Models. Nisarg Patel, Mohith Kulkarni, Mihir Parmar, Aashna Budhiraja, Mutsumi Nakamura, Neeraj Varshney, Chitta Baral |
| 2024 | Multi-News+: Cost-efficient Dataset Cleansing via LLM-based Data Annotation. Juhwan Choi, Jungmin Yun, Kyohoon Jin, YoungBin Kim |
| 2024 | Multi-expert Prompting Improves Reliability, Safety and Usefulness of Large Language Models. Do Xuan Long, Duong Ngoc Yen, Anh Tuan Luu, Kenji Kawaguchi, Min-Yen Kan, Nancy F. Chen |
| 2024 | Multi-pass Decoding for Grammatical Error Correction. Xiaoying Wang, Lingling Mu, Jingyi Zhang, Hongfei Xu |
| 2024 | Multilingual Topic Classification in X: Dataset and Analysis. Dimosthenis Antypas, Asahi Ushio, Francesco Barbieri, José Camacho-Collados |
| 2024 | Multimodal Clickbait Detection by De-confounding Biases Using Causal Representation Inference. Jianxing Yu, Shiqi Wang, Han Yin, Zhenlong Sun, Ruobing Xie, Bo Zhang, Yanghui Rao |
| 2024 | Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model. Wenqi Zhang, Zhenglin Cheng, Yuanyu He, Mengna Wang, Yongliang Shen, Zeqi Tan, Guiyang Hou, Mingqian He, Yanna Ma, Weiming Lu, Yueting Zhuang |
| 2024 | Multiple Sources are Better Than One: Incorporating External Knowledge in Low-Resource Glossing. Changbing Yang, Garrett Nicolai, Miikka Silfverberg |
| 2024 | Muting Whisper: A Universal Acoustic Adversarial Attack on Speech Foundation Models. Vyas Raina, Rao Ma, Charles McGhee, Kate M. Knill, Mark J. F. Gales |
| 2024 | M²PT: Multimodal Prompt Tuning for Zero-shot Instruction Learning. Taowen Wang, Yiyang Liu, James Liang, Junhan Zhao, Yiming Cui, Yuning Mao, Shaoliang Nie, Jiahao Liu, Fuli Feng, Zenglin Xu, Cheng Han, Lifu Huang, Qifan Wang, Dongfang Liu |
| 2024 | NLEBench+NorGLM: A Comprehensive Empirical Analysis and Benchmark Dataset for Generative Language Models in Norwegian. Peng Liu, Lemei Zhang, Terje Nissen Farup, Even W. Lauvrak, Jon Espen Ingvaldsen, Simen Eide, Jon Atle Gulla, Zhirong Yang |
| 2024 | Nash CoT: Multi-Path Inference with Preference Equilibrium. Ziqi Zhang, Cunxiang Wang, Xiao Xiong, Yue Zhang, Donglin Wang |
| 2024 | Nearest Neighbor Normalization Improves Multimodal Retrieval. Neil Chowdhury, Franklin Wang, Sumedh Shenoy, Douwe Kiela, Sarah Schwettmann, Tristan Thrush |
| 2024 | Neeko: Leveraging Dynamic LoRA for Efficient Multi-Character Role-Playing Agent. Xiaoyan Yu, Tongxu Luo, Yifan Wei, Fangyu Lei, Yiming Huang, Hao Peng, Liehuang Zhu |
| 2024 | NeuroTrialNER: An Annotated Corpus for Neurological Diseases and Therapies in Clinical Trial Registries. Simona Doneva, Tilia Ellendorff, Beate Sick, Jean-Philippe Goldman, Amelia Cannon, Gerold Schneider, Benjamin Ineichen |
| 2024 | Neuron Specialization: Leveraging Intrinsic Task Modularity for Multilingual Machine Translation. Shaomu Tan, Di Wu, Christof Monz |
| 2024 | Neuron-Level Knowledge Attribution in Large Language Models. Zeping Yu, Sophia Ananiadou |
| 2024 | No Culture Left Behind: ArtELingo-28, a Benchmark of WikiArt with Captions in 28 Languages. Youssef Mohamed, Runjia Li, Ibrahim Said Ahmad, Kilichbek Haydarov, Philip Torr, Kenneth Church, Mohamed Elhoseiny |
| 2024 | Noise, Novels, Numbers. A Framework for Detecting and Categorizing Noise in Danish and Norwegian Literature. Ali Al-Laith, Daniel Hershcovich, Jens Bjerring-Hansen, Jakob Parby, Alexander Conroy, Timothy Tangherlini |
| 2024 | NoiseBench: Benchmarking the Impact of Real Label Noise on Named Entity Recognition. Elena Merdjanovska, Ansar Aynetdinov, Alan Akbik |
| 2024 | Not All Contexts Are Equal: Teaching LLMs Credibility-aware Generation. Ruotong Pan, Boxi Cao, Hongyu Lin, Xianpei Han, Jia Zheng, Sirui Wang, Xunliang Cai, Le Sun |
| 2024 | Not Everything is All You Need: Toward Low-Redundant Optimization for Large Language Model Alignment. Zhipeng Chen, Kun Zhou, Xin Zhao, Jingyuan Wang, Ji-Rong Wen |
| 2024 | NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data. Sergei Bogdanov, Alexandre Constantin, Timothée Bernard, Benoît Crabbé, Etienne Bernard |
| 2024 | Null-Shot Prompting: Rethinking Prompting Large Language Models With Hallucination. Pittawat Taveekitworachai, Febri Abdullah, Ruck Thawonmas |
| 2024 | NumeroLogic: Number Encoding for Enhanced LLMs' Numerical Reasoning. Eli Schwartz, Leshem Choshen, Joseph Shtok, Sivan Doveh, Leonid Karlinsky, Assaf Arbelle |
| 2024 | OATH-Frames: Characterizing Online Attitudes Towards Homelessness with LLM Assistants. Jaspreet Ranjit, Brihi Joshi, Rebecca Dorn, Laura Petry, Olga Koumoundouros, Jayne Bottarini, Peichen Liu, Eric Rice, Swabha Swayamdipta |
| 2024 | ORPO: Monolithic Preference Optimization without Reference Model. Jiwoo Hong, Noah Lee, James Thorne |
| 2024 | Oddballs and Misfits: Detecting Implicit Abuse in Which Identity Groups are Depicted as Deviating from the Norm. Michael Wiegand, Josef Ruppenhofer |
| 2024 | OmAgent: A Multi-modal Agent Framework for Complex Video Understanding with Task Divide-and-Conquer. Lu Zhang, Tiancheng Zhao, Heting Ying, Yibo Ma, Kyusong Lee |
| 2024 | On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning. Geewook Kim, Minjoon Seo |
| 2024 | On Eliciting Syntax from Language Models via Hashing. Yiran Wang, Masao Utiyama |
| 2024 | On Fake News Detection with LLM Enhanced Semantics Mining. Xiaoxiao Ma, Yuchen Zhang, Kaize Ding, Jian Yang, Jia Wu, Hao Fan |
| 2024 | On Mitigating Performance Disparities in Multilingual Speech Recognition. Monorama Swain, Anna Zee, Anders Søgaard |
| 2024 | On Sensitivity of Learning with Limited Labelled Data to the Effects of Randomness: Impact of Interactions and Systematic Choices. Branislav Pecher, Ivan Srba, Mária Bieliková |
| 2024 | On Training Data Influence of GPT Models. Yekun Chai, Qingyi Liu, Shuohuan Wang, Yu Sun, Qiwei Peng, Hua Wu |
| 2024 | On the Fragility of Active Learners for Text Classification. Abhishek Ghose, Emma Nguyen |
| 2024 | On the In-context Generation of Language Models. Zhongtao Jiang, Yuanzhe Zhang, Kun Luo, Xiaowei Yuan, Jun Zhao, Kang Liu |
| 2024 | On the Influence of Gender and Race in Romantic Relationship Prediction from Large Language Models. Abhilasha Sancheti, Haozhe An, Rachel Rudinger |
| 2024 | On the Proper Treatment of Tokenization in Psycholinguistics. Mario Giulianelli, Luca Malagutti, Juan Luis Gastaldi, Brian DuSell, Tim Vieira, Ryan Cotterell |
| 2024 | On the Relationship between Truth and Political Bias in Language Models. Suyash Fulay, William Brannon, Shrestha Mohanty, Cassandra Overney, Elinor Poole-Dayan, Deb Roy, Jad Kabbara |
| 2024 | On the Reliability of Psychological Scales on Large Language Models. Jen-tse Huang, Wenxiang Jiao, Man Ho Lam, Eric John Li, Wenxuan Wang, Michael R. Lyu |
| 2024 | On the Robustness of Editing Large Language Models. Xinbei Ma, Tianjie Ju, Jiyang Qiu, Zhuosheng Zhang, Hai Zhao, Lifeng Liu, Yulong Wang |
| 2024 | On the Role of Context in Reading Time Prediction. Andreas Opedal, Eleanor Chodroff, Ryan Cotterell, Ethan Wilcox |
| 2024 | On the Universal Truthfulness Hyperplane Inside LLMs. Junteng Liu, Shiqi Chen, Yu Cheng, Junxian He |
| 2024 | One Thousand and One Pairs: A "novel" challenge for long-context language models. Marzena Karpinska, Katherine Thai, Kyle Lo, Tanya Goyal, Mohit Iyyer |
| 2024 | One-to-Many Communication and Compositionality in Emergent Communication. Heeyoung Lee |
| 2024 | One2Set + Large Language Model: Best Partners for Keyphrase Generation. Liangying Shao, Liang Zhang, Minlong Peng, Guoqi Ma, Hao Yue, Mingming Sun, Jinsong Su |
| 2024 | OneNet: A Fine-Tuning Free Framework for Few-Shot Entity Linking via Large Language Model Prompting. Xukai Liu, Ye Liu, Kai Zhang, Kehang Wang, Qi Liu, Enhong Chen |
| 2024 | Ontologically Faithful Generation of Non-Player Character Dialogues. Nathaniel Weir, Ryan Thomas, Randolph D'Amore, Kellie Hill, Benjamin Van Durme, Harsh Jhamtani |
| 2024 | Open-world Multi-label Text Classification with Extremely Weak Supervision. Xintong Li, Jinya Jiang, Ria Dharmani, Jayanth Srinivasa, Gaowen Liu, Jingbo Shang |
| 2024 | OpenSep: Leveraging Large Language Models with Textual Inversion for Open World Audio Separation. Tanvir Mahmud, Diana Marculescu |
| 2024 | Optimized Speculative Sampling for GPU Hardware Accelerators. Dominik Wagner, Seanie Lee, Ilja Baumann, Philipp Seeberger, Korbinian Riedhammer, Tobias Bocklet |
| 2024 | Optimizing Chinese Lexical Simplification Across Word Types: A Hybrid Approach. Zihao Xiao, Jiefu Gong, Shijin Wang, Wei Song |
| 2024 | Optimizing Code Retrieval: High-Quality and Scalable Dataset Annotation through Large Language Models. Rui Li, Qi Liu, Liyang He, Zheng Zhang, Hao Zhang, Shengyu Ye, Junyu Lu, Zhenya Huang |
| 2024 | Optimizing Instructions and Demonstrations for Multi-Stage Language Model Programs. Krista Opsahl-Ong, Michael J. Ryan, Josh Purtell, David Broman, Christopher Potts, Matei Zaharia, Omar Khattab |
| 2024 | Optimizing Language Models with Fair and Stable Reward Composition in Reinforcement Learning. Jiahui Li, Hanlin Zhang, Fengda Zhang, Tai-Wei Chang, Kun Kuang, Long Chen, Jun Zhou |
| 2024 | Optimizing Rare Word Accuracy in Direct Speech Translation with a Retrieval-and-Demonstration Approach. Siqi Li, Danni Liu, Jan Niehues |
| 2024 | Order of Magnitude Speedups for LLM Membership Inference. Rongting Zhang, Martín Bertrán, Aaron Roth |
| 2024 | Ouroboros: Generating Longer Drafts Phrase by Phrase for Faster Speculative Decoding. Weilin Zhao, Yuxiang Huang, Xu Han, Wang Xu, Chaojun Xiao, Xinrong Zhang, Yewei Fang, Kaihuo Zhang, Zhiyuan Liu, Maosong Sun |
| 2024 | Outcome-Constrained Large Language Models for Countering Hate Speech. Lingzi Hong, Pengcheng Luo, Eduardo Blanco, Xiaoying Song |
| 2024 | Overcome Noise and Bias: Segmentation-Aided Multi-Granularity Denoising and Debiasing for Enhanced Quarduples Extraction in Dialogue. Xianlong Luo, Meng Yang, Yihao Wang |
| 2024 | PALM: Few-Shot Prompt Learning for Audio Language Models. Asif Hanif, Maha Tufail Agro, Mohammad Areeb Qazi, Hanan Aldarmaki |
| 2024 | PANDA: Persona Attributes Navigation for Detecting and Alleviating Overuse Problem in Large Language Models. Jinsung Kim, Seonmin Koo, Heuiseok Lim |
| 2024 | PARIKSHA: A Large-Scale Investigation of Human-LLM Evaluator Agreement on Multilingual and Multi-Cultural Data. Ishaan Watts, Varun Gumma, Aditya Yadavalli, Vivek Seshadri, Manohar Swaminathan, Sunayana Sitaram |
| 2024 | PATIENT-ψ: Using Large Language Models to Simulate Patients for Training Mental Health Professionals. Ruiyi Wang, Stephanie Milani, Jamie C. Chiu, Jiayin Zhi, Shaun M. Eack, Travis Labrum, Samuel M. Murphy, Nev Jones, Kate Hardy, Hong Shen, Fei Fang, Zhiyu Chen |
| 2024 | PCQPR: Proactive Conversational Question Planning with Reflection. Shasha Guo, Lizi Liao, Jing Zhang, Cuiping Li, Hong Chen |
| 2024 | PREDICT: Multi-Agent-based Debate Simulation for Generalized Hate Speech Detection. Someen Park, Jaehoon Kim, Seungwan Jin, Sohyun Park, Kyungsik Han |
| 2024 | PRompt Optimization in Multi-Step Tasks (PROMST): Integrating Human Feedback and Heuristic-based Sampling. Yongchao Chen, Jacob Arkin, Yilun Hao, Yang Zhang, Nicholas Roy, Chuchu Fan |
| 2024 | PSC: Extending Context Window of Large Language Models via Phase Shift Calibration. Wenqiao Zhu, Chao Xu, Lulu Wang, Jun Wu |
| 2024 | PTD-SQL: Partitioning and Targeted Drilling with LLMs in Text-to-SQL. Ruilin Luo, Liyuan Wang, Binghuai Lin, Zicheng Lin, Yujiu Yang |
| 2024 | PairDistill: Pairwise Relevance Distillation for Dense Retrieval. Chao-Wei Huang, Yun-Nung Chen |
| 2024 | Parameter-Efficient Sparsity Crafting from Dense to Mixture-of-Experts for Instruction Tuning on General Tasks. Haoyuan Wu, Haisheng Zheng, Zhuolun He, Bei Yu |
| 2024 | Paraphrase Types Elicit Prompt Engineering Capabilities. Jan Philip Wahle, Terry Ruas, Yang Xu, Bela Gipp |
| 2024 | Pcc-tuning: Breaking the Contrastive Learning Ceiling in Semantic Textual Similarity. Bowen Zhang, Chunping Li |
| 2024 | Pelican: Correcting Hallucination in Vision-LLMs via Claim Decomposition and Program of Thought Verification. Pritish Sahu, Karan Sikka, Ajay Divakaran |
| 2024 | PepRec: Progressive Enhancement of Prompting for Recommendation. Yakun Yu, Shiang Qi, Baochun Li, Di Niu |
| 2024 | Perceptions of Linguistic Uncertainty by Language Models and Humans. Catarina G. Belém, Markelle Kelly, Mark Steyvers, Sameer Singh, Padhraic Smyth |
| 2024 | Perceptions to Beliefs: Exploring Precursory Inferences for Theory of Mind in Large Language Models. Chani Jung, Dongkwan Kim, Jiho Jin, Jiseon Kim, Yeon Seonwoo, Yejin Choi, Alice Oh, Hyunwoo Kim |
| 2024 | Performance-Guided LLM Knowledge Distillation for Efficient Text Classification at Scale. Flavio Palo, Prateek Singhi, Bilal Fadlallah |
| 2024 | Personality-aware Student Simulation for Conversational Intelligent Tutoring Systems. Zhengyuan Liu, Stella Xin Yin, Geyu Lin, Nancy F. Chen |
| 2024 | Personalized Pieces: Efficient Personalized Large Language Models through Collaborative Efforts. Zhaoxuan Tan, Zheyuan Liu, Meng Jiang |
| 2024 | Personas as a Way to Model Truthfulness in Language Models. Nitish Joshi, Javier Rando, Abulhair Saparov, Najoung Kim, He He |
| 2024 | PhiloGPT: A Philology-Oriented Large Language Model for Ancient Chinese Manuscripts with Dunhuang as Case Study. Yuqing Zhang, Baoyi He, Yihan Chen, Hangqi Li, Han Yue, Shengyu Zhang, Huaiyong Dou, Junchi Yan, Zemin Liu, Yongquan Zhang, Fei Wu |
| 2024 | Pixology: Probing the Linguistic and Visual Capabilities of Pixel-based Language Models. Kushal Tatariya, Vladimir Araujo, Thomas Bauwens, Miryam de Lhoneux |
| 2024 | Please note that I'm just an AI: Analysis of Behavior Patterns of LLMs in (Non-)offensive Speech Identification. Esra Dönmez, Thang Vu, Agnieszka Falenska |
| 2024 | Position Engineering: Boosting Large Language Models through Positional Information Manipulation. Zhiyuan He, Huiqiang Jiang, Zilong Wang, Yuqing Yang, Luna Qiu, Lili Qiu |
| 2024 | PostMark: A Robust Blackbox Watermark for Large Language Models. Yapei Chang, Kalpesh Krishna, Amir Houmansadr, John Wieting, Mohit Iyyer |
| 2024 | PrExMe! Large Scale Prompt Exploration of Open Source LLMs for Machine Translation and Summarization Evaluation. Christoph Leiter, Steffen Eger |
| 2024 | Pragmatic Norms Are All You Need - Why The Symbol Grounding Problem Does Not Apply to LLMs. Reto Gubelmann |
| 2024 | Pre-trained Language Models Do Not Help Auto-regressive Text-to-Image Generation. Yuhui Zhang, Brandon McKinzie, Zhe Gan, Vaishaal Shankar, Alexander Toshev |
| 2024 | Pre-training Cross-lingual Open Domain Question Answering with Large-scale Synthetic Supervision. Fan Jiang, Tom Drummond, Trevor Cohn |
| 2024 | PreAlign: Boosting Cross-Lingual Transfer by Early Establishment of Multilingual Alignment. Jiahuan Li, Shujian Huang, Aarron Ching, Xinyu Dai, Jiajun Chen |
| 2024 | Precise Model Benchmarking with Only a Few Observations. Riccardo Fogliato, Pratik Patil, Nil-Jana Akpinar, Mathew Monfort |
| 2024 | Predicate Debiasing in Vision-Language Models Integration for Scene Graph Generation Enhancement. Yuxuan Wang, Xiaoyuan Liu |
| 2024 | Predicting Rewards Alongside Tokens: Non-disruptive Parameter Insertion for Efficient Inference Intervention in Large Language Model. Chenhan Yuan, Fei Huang, Ru Peng, Keming Lu, Bowen Yu, Chang Zhou, Jingren Zhou |
| 2024 | Preference-Guided Reflective Sampling for Aligning Language Models. Hai Ye, Hwee Tou Ng |
| 2024 | Prefixing Attention Sinks can Mitigate Activation Outliers for Large Language Model Quantization. Seungwoo Son, Wonpyo Park, Woohyun Han, Kyuyeun Kim, Jaeho Lee |
| 2024 | Preserving Generalization of Language models in Few-shot Continual Relation Extraction. Quyen Tran, Nguyen Xuan Thanh, Nguyen Hoang Anh, Nam Le Hai, Trung Le, Linh Van Ngo, Thien Huu Nguyen |
| 2024 | Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality. Youngtaek Oh, Jae-Won Cho, Dong-Jin Kim, In So Kweon, Junmo Kim |
| 2024 | Pretraining Data Detection for Large Language Models: A Divergence-based Calibration Method. Weichao Zhang, Ruqing Zhang, Jiafeng Guo, Maarten de Rijke, Yixing Fan, Xueqi Cheng |
| 2024 | Pretraining Language Models Using Translationese. Meet Doshi, Raj Dabre, Pushpak Bhattacharyya |
| 2024 | Private Language Models via Truncated Laplacian Mechanism. Tianhao Huang, Tao Yang, Ivan Habernal, Lijie Hu, Di Wang |
| 2024 | Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing, EMNLP 2024, Miami, FL, USA, November 12-16, 2024 Yaser Al-Onaizan, Mohit Bansal, Yun-Nung Chen |
| 2024 | Prometheus 2: An Open Source Language Model Specialized in Evaluating Other Language Models. Seungone Kim, Juyoung Suk, Shayne Longpre, Bill Yuchen Lin, Jamin Shin, Sean Welleck, Graham Neubig, Moontae Lee, Kyungjae Lee, Minjoon Seo |
| 2024 | PromptReps: Prompting Large Language Models to Generate Dense and Sparse Representations for Zero-Shot Document Retrieval. Shengyao Zhuang, Xueguang Ma, Bevan Koopman, Jimmy Lin, Guido Zuccon |
| 2024 | Prompts have evil twins. Rimon Melamed, Lucas H. McCabe, Tanay Wakhare, Yejin Kim, H. Howie Huang, Enric Boix-Adserà |
| 2024 | Pron vs Prompt: Can Large Language Models already Challenge a World-Class Fiction Author at Creative Text Writing? Guillermo Marco, Julio Gonzalo, María Teresa Mateo Girona, Ramón Santos |
| 2024 | Prove Your Point!: Bringing Proof-Enhancement Principles to Argumentative Essay Generation. Ruiyu Xiao, Lei Wu, Yuhang Gou, Weinan Zhang, Ting Liu |
| 2024 | Pruning via Merging: Compressing LLMs via Manifold Alignment Based Layer Merging. Deyuan Liu, Zhanyue Qin, Hairu Wang, Zhao Yang, Zecheng Wang, Fangying Rong, Qingbin Liu, Yanchao Hao, Bo Li, Xi Chen, Cunhang Fan, Zhao Lv, Dianhui Chu, Zhiying Tu, Dianbo Sui |
| 2024 | PsFuture: A Pseudo-Future-based Zero-Shot Adaptive Policy for Simultaneous Machine Translation. Libo Zhao, Jing Li, Ziqian Zeng |
| 2024 | PsyGUARD: An Automated System for Suicide Detection and Risk Assessment in Psychological Counseling. Huachuan Qiu, Lizhi Ma, Zhenzhong Lan |
| 2024 | Puzzle Solving using Reasoning of Large Language Models: A Survey. Panagiotis Giadikiaroglou, Maria Lymperaiou, Giorgos Filandrianos, Giorgos Stamou |
| 2024 | Python is Not Always the Best Choice: Embracing Multilingual Program of Thoughts. Xianzhen Luo, Qingfu Zhu, Zhiming Zhang, Libo Qin, Xuanyu Zhang, Qing Yang, Dongliang Xu, Wanxiang Che |
| 2024 | QGEval: Benchmarking Multi-dimensional Evaluation for Question Generation. Weiping Fu, Bifan Wei, Jianxiang Hu, Zhongmin Cai, Jun Liu |
| 2024 | QUDSELECT: Selective Decoding for Questions Under Discussion Parsing. Ashima Suvarna, Xiao Liu, Tanmay Parekh, Kai-Wei Chang, Nanyun Peng |
| 2024 | QUIK: Towards End-to-end 4-Bit Inference on Generative Large Language Models. Saleh Ashkboos, Ilia Markov, Elias Frantar, Tingxuan Zhong, Xincheng Wang, Jie Ren, Torsten Hoefler, Dan Alistarh |
| 2024 | QUITE: Quantifying Uncertainty in Natural Language Text in Bayesian Reasoning Scenarios. Timo Pierre Schrader, Lukas Lange, Simon Razniewski, Annemarie Friedrich |
| 2024 | QuBE: Question-based Belief Enhancement for Agentic LLM Reasoning. Minsoo Kim, Jongyoon Kim, Jihyuk Kim, Seung-won Hwang |
| 2024 | Quality Matters: Evaluating Synthetic Data for Tool-Using LLMs. Shadi Iskander, Sofia Tolmach, Ori Shapira, Nachshon Cohen, Zohar S. Karnin |
| 2024 | Quantifying the Gaps Between Translation and Native Perception in Training for Multimodal, Multilingual Retrieval. Kyle Buettner, Adriana Kovashka |
| 2024 | Quantum Recurrent Architectures for Text Classification. Wenduan Xu, Stephen Clark, Douglas Brown, Gabriel Matos, Konstantinos Meichanetzidis |
| 2024 | RA2FD: Distilling Faithfulness into Efficient Dialogue Systems. Zhiyuan Zhu, Yusheng Liao, Chenxin Xu, Yunfeng Guan, Yanfeng Wang, Yu Wang |
| 2024 | RAFT: Realistic Attacks to Fool Text Detectors. James Wang, Ran Li, Junfeng Yang, Chengzhi Mao |
| 2024 | RAG-QA Arena: Evaluating Domain Robustness for Long-form Retrieval Augmented Question Answering. Rujun Han, Yuhao Zhang, Peng Qi, Yumo Xu, Jenyuan Wang, Lan Liu, William Yang Wang, Bonan Min, Vittorio Castelli |
| 2024 | RAR: Retrieval-augmented retrieval for code generation in low resource languages. Avik Dutta, Mukul Singh, Gust Verbruggen, Sumit Gulwani, Vu Le |
| 2024 | RAt: Injecting Implicit Bias for Text-To-Image Prompt Refinement Models. Ziyi Kou, Shichao Pei, Meng Jiang, Xiangliang Zhang |
| 2024 | RE-RAG: Improving Open-Domain QA Performance and Interpretability with Relevance Estimator in Retrieval-Augmented Generation. Kiseung Kim, Jay-Yoon Lee |
| 2024 | REAR: A Relevance-Aware Retrieval-Augmented Framework for Open-Domain Question Answering. Yuhao Wang, Ruiyang Ren, Junyi Li, Xin Zhao, Jing Liu, Ji-Rong Wen |
| 2024 | RECANTFormer: Referring Expression Comprehension with Varying Numbers of Targets. Bhathiya Hemanthage, Hakan Bilen, Phil Bartie, Christian Dondrup, Oliver Lemon |
| 2024 | RLHF Can Speak Many Languages: Unlocking Multilingual Preference Optimization for LLMs. John Dang, Arash Ahmadian, Kelly Marchisio, Julia Kreutzer, Ahmet Üstün, Sara Hooker |
| 2024 | RSA-Control: A Pragmatics-Grounded Lightweight Controllable Text Generation Framework. Yifan Wang, Vera Demberg |
| 2024 | RULE: Reliable Multimodal RAG for Factuality in Medical Vision Language Models. Peng Xia, Kangyu Zhu, Haoran Li, Hongtu Zhu, Yun Li, Gang Li, Linjun Zhang, Huaxiu Yao |
| 2024 | RWKV-CLIP: A Robust Vision-Language Representation Learner. Tiancheng Gu, Kaicheng Yang, Xiang An, Ziyong Feng, Dongnan Liu, Weidong Cai, Jiankang Deng |
| 2024 | RaTEScore: A Metric for Radiology Report Generation. Weike Zhao, Chaoyi Wu, Xiaoman Zhang, Ya Zhang, Yanfeng Wang, Weidi Xie |
| 2024 | Ranking Manipulation for Conversational Search Engines. Samuel Pfrommer, Yatong Bai, Tanmay Gautam, Somayeh Sojoudi |
| 2024 | Rationale-Aware Answer Verification by Pairwise Self-Evaluation. Akira Kawabata, Saku Sugawara |
| 2024 | Rationalizing Transformer Predictions via End-To-End Differentiable Self-Training. Marc Felix Brinner, Sina Zarrieß |
| 2024 | Re-Evaluating Evaluation for Multilingual Summarization. Jessica Forde, Ruochen Zhang, Lintang Sutawika, Alham Fikri Aji, Samuel Cahyawijaya, Genta Indra Winata, Minghao Wu, Carsten Eickhoff, Stella Biderman, Ellie Pavlick |
| 2024 | Re-ReST: Reflection-Reinforced Self-Training for Language Agents. Zi-Yi Dou, Cheng-Fu Yang, Xueqing Wu, Kai-Wei Chang, Nanyun Peng |
| 2024 | Re-Reading Improves Reasoning in Large Language Models. Xiaohan Xu, Chongyang Tao, Tao Shen, Can Xu, Hongbo Xu, Guodong Long, Jian-Guang Lou, Shuai Ma |
| 2024 | ReCaLL: Membership Inference via Relative Conditional Log-Likelihoods. Roy Xie, Junlin Wang, Ruomin Huang, Minxing Zhang, Rong Ge, Jian Pei, Neil Gong, Bhuwan Dhingra |
| 2024 | Read Anywhere Pointed: Layout-aware GUI Screen Reading with Tree-of-Lens Grounding. Yue Fan, Lei Ding, Ching-Chen Kuo, Shan Jiang, Yang Zhao, Xinze Guan, Jie Yang, Yi Zhang, Xin Wang |
| 2024 | ReadMe++: Benchmarking Multilingual Language Models for Multi-Domain Readability Assessment. Tarek Naous, Michael J. Ryan, Anton Lavrouk, Mohit Chandra, Wei Xu |
| 2024 | RealVul: Can We Detect Vulnerabilities in Web Applications with LLM? Di Cao, Yong Liao, Xiuwei Shang |
| 2024 | Reasoning Paths with Reference Objects Elicit Quantitative Spatial Reasoning in Large Vision-Language Models. Yuan-Hong Liao, Rafid Mahmood, Sanja Fidler, David Acuna |
| 2024 | Reasoning Robustness of LLMs to Adversarial Typographical Errors. Esther Gan, Yiran Zhao, Liying Cheng, Yancan Mao, Anirudh Goyal, Kenji Kawaguchi, Min-Yen Kan, Michael Shieh |
| 2024 | Reasoning in Token Economies: Budget-Aware Evaluation of LLM Reasoning Strategies. Junlin Wang, Siddhartha Jain, Dejiao Zhang, Baishakhi Ray, Varun Kumar, Ben Athiwaratkun |
| 2024 | Reasoning or a Semblance of it? A Diagnostic Study of Transitive Reasoning in LLMs. Houman Mehrafarin, Arash Eshghi, Ioannis Konstas |
| 2024 | Rebuilding ROME : Resolving Model Collapse during Sequential Model Editing. Akshat Gupta, Sidharth Baskaran, Gopala Anumanchipalli |
| 2024 | Reconsidering Sentence-Level Sign Language Translation. Garrett Tanzer, Maximus Shengelia, Ken Harrenstien, David Uthus |
| 2024 | Reconstruct Your Previous Conversations! Comprehensively Investigating Privacy Leakage Risks in Conversations with GPT Models. Junjie Chu, Zeyang Sha, Michael Backes, Yang Zhang |
| 2024 | Recurrent Alignment with Hard Attention for Hierarchical Text Rating. Chenxi Lin, Jiayu Ren, Guoxiu He, Zhuoren Jiang, Haiyan Yu, Xiaomin Zhu |
| 2024 | Red Teaming Language Models for Processing Contradictory Dialogues. Xiaofei Wen, Bangzheng Li, Tenghao Huang, Muhao Chen |
| 2024 | Related Work and Citation Text Generation: A Survey. Xiangci Li, Jessica Ouyang |
| 2024 | Relevance Is a Guiding Light: Relevance-aware Adaptive Learning for End-to-end Task-oriented Dialogue System. Zhanpeng Chen, Zhihong Zhu, Wanshi Xu, Xianwei Zhuang, Yuexian Zou |
| 2024 | RepEval: Effective Text Evaluation with LLM Representation. Shuqian Sheng, Yi Xu, Tianhang Zhang, Zanwei Shen, Luoyi Fu, Jiaxin Ding, Lei Zhou, Xiaoying Gan, Xinbing Wang, Chenghu Zhou |
| 2024 | RepMatch: Quantifying Cross-Instance Similarities in Representation Space. Mohammad Modarres, Sina Abbasi, Mohammad Taher Pilehvar |
| 2024 | Repairs in a Block World: A New Benchmark for Handling User Corrections with Multi-Modal Language Models. Francisco Javier Chiyah Garcia, Alessandro Suglia, Arash Eshghi |
| 2024 | Representational Analysis of Binding in Language Models. Qin Dai, Benjamin Heinzerling, Kentaro Inui |
| 2024 | Resampled Datasets Are Not Enough: Mitigating Societal Bias Beyond Single Attributes. Yusuke Hirota, Jerone Theodore Alexander Andrews, Dora Zhao, Orestis Papakyriakopoulos, Apostolos Modas, Yuta Nakashima, Alice Xiang |
| 2024 | Rethinking Pragmatics in Large Language Models: Towards Open-Ended Evaluation and Preference Tuning. Shengguang Wu, Shusheng Yang, Zhenglun Chen, Qi Su |
| 2024 | Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization. Sungbin Shin, Wonpyo Park, Jaeho Lee, Namhoon Lee |
| 2024 | Rethinking Token Reduction for State Space Models. Zheng Zhan, Yushu Wu, Zhenglun Kong, Changdi Yang, Yifan Gong, Xuan Shen, Xue Lin, Pu Zhao, Yanzhi Wang |
| 2024 | Rethinking the Evaluation of In-Context Learning for LLMs. Guoxin Yu, Lemao Liu, Mo Yu, Yue Yu, Xiang Ao |
| 2024 | Rethinking the Reversal Curse of LLMs: a Prescription from Human Knowledge Reversal. Zhicong Lu, Li Jin, Peiguang Li, Yu Tian, Linhao Zhang, Sirui Wang, Guangluan Xu, Changyuan Tian, Xunliang Cai |
| 2024 | Rethinking the Role of Proxy Rewards in Language Model Alignment. Sungdong Kim, Minjoon Seo |
| 2024 | Retrieval-enriched zero-shot image classification in low-resource domains. Nicola Dall'Asen, Yiming Wang, Enrico Fini, Elisa Ricci |
| 2024 | Retrieve-Plan-Generation: An Iterative Planning and Answering Framework for Knowledge-Intensive LLM Generation. Yuanjie Lyu, Zihan Niu, Zheyong Xie, Chao Zhang, Tong Xu, Yang Wang, Enhong Chen |
| 2024 | Retrieved In-Context Principles from Previous Mistakes. Hao Sun, Yong Jiang, Bo Wang, Yingyan Hou, Yan Zhang, Pengjun Xie, Fei Huang |
| 2024 | Retrieved Sequence Augmentation for Protein Representation Learning. Chang Ma, Haiteng Zhao, Lin Zheng, Jiayi Xin, Qintong Li, Lijun Wu, Zhihong Deng, Yang Lu, Qi Liu, Sheng Wang, Lingpeng Kong |
| 2024 | Retrospex: Language Agent Meets Offline Reinforcement Learning Critic. Yufei Xiang, Yiqun Shen, Yeqin Zhang, Cam-Tu Nguyen |
| 2024 | Reuse Your Rewards: Reward Model Transfer for Zero-Shot Cross-Lingual Alignment. Zhaofeng Wu, Ananth Balashankar, Yoon Kim, Jacob Eisenstein, Ahmad Beirami |
| 2024 | Reusing Transferable Weight Increments for Low-resource Style Generation. Chunzhen Jin, Eliot Huang, Heng Chang, Yaqi Wang, Peng Cao, Osmar R. Zaïane |
| 2024 | RevMUX: Data Multiplexing with Reversible Adapters for Efficient LLM Batch Inference. Yige Xu, Xu Guo, Zhiwei Zeng, Chunyan Miao |
| 2024 | Revealing Personality Traits: A New Benchmark Dataset for Explainable Personality Recognition on Dialogues. Lei Sun, Jinming Zhao, Qin Jin |
| 2024 | Revealing the Parallel Multilingual Learning within Large Language Models. Yongyu Mu, Peinan Feng, Zhiquan Cao, Yuzhang Wu, Bei Li, Chenglong Wang, Tong Xiao, Kai Song, Tongran Liu, Chunliang Zhang, Jingbo Zhu |
| 2024 | Reverse-Engineering the Reader. Samuel Kiegeland, Ethan Wilcox, Afra Amini, David Robert Reich, Ryan Cotterell |
| 2024 | Revisiting Automated Evaluation for Long-form Table Question Answering. Yuqi Wang, Lyuhao Chen, Songcheng Cai, Zhijian Xu, Yilun Zhao |
| 2024 | Revisiting Supertagging for faster HPSG parsing. Olga Zamaraeva, Carlos Gómez-Rodríguez |
| 2024 | Revisiting Supervised Contrastive Learning for Microblog Classification. Junbo Huang, Ricardo Usbeck |
| 2024 | Revisiting Who's Harry Potter: Towards Targeted Unlearning from a Causal Intervention Perspective. Yujian Liu, Yang Zhang, Tommi S. Jaakkola, Shiyu Chang |
| 2024 | Revisiting the Robustness of Watermarking to Paraphrasing Attacks. Saksham Rastogi, Danish Pruthi |
| 2024 | Right for Right Reasons: Large Language Models for Verifiable Commonsense Knowledge Graph Question Answering. Armin Toroghi, Willis Guo, Mohammad Mahdi Abdollah Pour, Scott Sanner |
| 2024 | RoCEL: Advancing Table Entity Linking through Distinctive Row and Column Contexts. Yuanzheng Wang, Yixing Fan, Jiafeng Guo, Ruqing Zhang, Xueqi Cheng |
| 2024 | RoTBench: A Multi-Level Benchmark for Evaluating the Robustness of Large Language Models in Tool Learning. Junjie Ye, Yilong Wu, Songyang Gao, Caishuang Huang, Sixian Li, Guanyu Li, Xiaoran Fan, Qi Zhang, Tao Gui, Xuanjing Huang |
| 2024 | Roleplay-doh: Enabling Domain-Experts to Create LLM-simulated Patients via Eliciting and Adhering to Principles. Ryan Louie, Ananjan Nandi, William Fang, Cheng Chang, Emma Brunskill, Diyi Yang |
| 2024 | RoseLoRA: Row and Column-wise Sparse Low-rank Adaptation of Pre-trained Language Model for Knowledge Editing and Fine-tuning. Haoyu Wang, Tianci Liu, Ruirui Li, Monica Xiao Cheng, Tuo Zhao, Jing Gao |
| 2024 | RuBLiMP: Russian Benchmark of Linguistic Minimal Pairs. Ekaterina Taktasheva, Maxim Bazhukov, Kirill Koncha, Alena Fenogenova, Ekaterina Artemova, Vladislav Mikhailov |
| 2024 | SCOI: Syntax-augmented Coverage-based In-context Example Selection for Machine Translation. Chenming Tang, Zhixiang Wang, Yunfang Wu |
| 2024 | SEACrowd: A Multilingual Multimodal Data Hub and Benchmark Suite for Southeast Asian Languages. Holy Lovenia, Rahmad Mahendra, Salsabil Maulana Akbar, Lester James V. Miranda, Jennifer Santoso, Elyanah Aco, Akhdan Fadhilah, Jonibek Mansurov, Joseph Marvin Imperial, Onno Kampman, Joel Ruben Antony Moniz, Muhammad Ravi Shulthan Habibi, Frederikus Hudi, Jann Railey Montalan, Ryan Hadiwijaya, Joanito Agili Lopo, William Nixon, Börje Karlsson, James Jaya, Ryandito Diandaru, Yuze Gao, Patrick Amadeus Irawan, Bin Wang, Jan Christian Blaise Cruz, Chenxi Whitehouse, Ivan Halim Parmonangan, Maria Khelli, Wenyu Zhang, Lucky Susanto, Reynard Adha Ryanda, Sonny Lazuardi Hermawan, Dan John Velasco, Muhammad Dehan Al Kautsar, Willy Fitra Hendria, Yasmin Moslem, Noah Flynn, Muhammad Farid Adilazuarda, Haochen Li, Johanes Lee, R. Damanhuri, Shuo Sun, Muhammad Reza Qorib, Amirbek Djanibekov, Wei Qi Leong, Quyet V. Do, Niklas Muennighoff, Tanrada Pansuwan, Ilham Firdausi Putra, Yan Xu, Ngee Tai Chia, Ayu Purwarianti, Sebastian Ruder, William-Chandra Tjhi, Peerat Limkonchotiwat, Alham Fikri Aji, Sedrick Keh, Genta Indra Winata, Ruochen Zhang, Fajri Koto, Zheng Xin Yong, Samuel Cahyawijaya |
| 2024 | SEEKR: Selective Attention-Guided Knowledge Retention for Continual Learning of Large Language Models. Jinghan He, Haiyun Guo, Kuan Zhu, Zihan Zhao, Ming Tang, Jinqiao Wang |
| 2024 | SEER: Self-Aligned Evidence Extraction for Retrieval-Augmented Generation. Xinping Zhao, Dongfang Li, Yan Zhong, Boren Hu, Yibin Chen, Baotian Hu, Min Zhang |
| 2024 | SEGMENT+: Long Text Processing with Short-Context Language Models. Wei Shi, Shuang Li, Kerun Yu, Jinglei Chen, Zujie Liang, Xinhui Wu, Yuxi Qian, Feng Wei, Bo Zheng, Jiaqing Liang, Jiangjie Chen, Yanghua Xiao |
| 2024 | SHIELD: Evaluation and Defense Strategies for Copyright Compliance in LLM Text Generation. Xiaoze Liu, Ting Sun, Tianyang Xu, Feijie Wu, Cunxiang Wang, Xiaoqian Wang, Jing Gao |
| 2024 | SLANG: New Concept Comprehension of Large Language Models. Lingrui Mei, Shenghua Liu, Yiwei Wang, Baolong Bi, Xueqi Cheng |
| 2024 | SOUL: Unlocking the Power of Second-Order Optimization for LLM Unlearning. Jinghan Jia, Yihua Zhang, Yimeng Zhang, Jiancheng Liu, Bharat Runwal, James Diffenderfer, Bhavya Kailkhura, Sijia Liu |
| 2024 | SPEED++: A Multilingual Event Extraction Framework for Epidemic Prediction and Preparedness. Tanmay Parekh, Jeffrey Kwan, Jiarui Yu, Sparsh Johri, Hyosang Ahn, Sreya Muppalla, Kai-Wei Chang, Wei Wang, Nanyun Peng |
| 2024 | SRF: Enhancing Document-Level Relation Extraction with a Novel Secondary Reasoning Framework. Fu Zhang, Qi Miao, Jingwei Cheng, Hongsen Yu, Yi Yan, Xin Li, Yongxue Wu |
| 2024 | STAR: SocioTechnical Approach to Red Teaming Language Models. Laura Weidinger, John Mellor, Bernat Guillen Pegueroles, Nahema Marchal, Ravin Kumar, Kristian Lum, Canfer Akbulut, Mark Diaz, A. Stevie Bergman, Mikel Rodriguez, Verena Rieser, William Isaac |
| 2024 | STOP! Benchmarking Large Language Models with Sensitivity Testing on Offensive Progressions. Robert Morabito, Sangmitra Madhusudan, Tyler McDonald, Ali Emami |
| 2024 | STORYSUMM: Evaluating Faithfulness in Story Summarization. Melanie Subbiah, Faisal Ladhak, Akankshya Mishra, Griffin Adams, Lydia B. Chilton, Kathleen R. McKeown |
| 2024 | SUPER: Evaluating Agents on Setting Up and Executing Tasks from Research Repositories. Ben Bogin, Kejuan Yang, Shashank Gupta, Kyle Richardson, Erin Bransom, Peter Clark, Ashish Sabharwal, Tushar Khot |
| 2024 | SURf: Teaching Large Vision-Language Models to Selectively Utilize Retrieved Information. Jiashuo Sun, Jihai Zhang, Yucheng Zhou, Zhaochen Su, Xiaoye Qu, Yu Cheng |
| 2024 | SYNFAC-EDIT: Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization. Prakamya Mishra, Zonghai Yao, Parth Vashisht, Feiyun Ouyang, Beining Wang, Vidhi Dhaval Mody, Hong Yu |
| 2024 | Safely Learning with Private Data: A Federated Learning Framework for Large Language Model. Jiaying Zheng, Hainan Zhang, Lingxiang Wang, Wangjie Qiu, Hongwei Zheng, Zhi Ming Zheng |
| 2024 | Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations. Rima Hazra, Sayan Layek, Somnath Banerjee, Soujanya Poria |
| 2024 | Satyrn: A Platform for Analytics Augmented Generation. Marko Sterbentz, Cameron Barrie, Shubham Shahi, Abhratanu Dutta, Donna Hooshmand, Harper Pack, Kristian J. Hammond |
| 2024 | SaySelf: Teaching LLMs to Express Confidence with Self-Reflective Rationales. Tianyang Xu, Shujin Wu, Shizhe Diao, Xiaoze Liu, Xingyao Wang, Yangyi Chen, Jing Gao |
| 2024 | Scalable Data Ablation Approximations for Language Models through Modular Training and Merging. Clara Na, Ian Magnusson, Ananya Harsh Jha, Tom Sherborne, Emma Strubell, Jesse Dodge, Pradeep Dasigi |
| 2024 | Scalable Efficient Training of Large Language Models with Low-dimensional Projected Attention. Xingtai Lv, Ning Ding, Kaiyan Zhang, Ermo Hua, Ganqu Cui, Bowen Zhou |
| 2024 | Scaling Laws Across Model Architectures: A Comparative Analysis of Dense and MoE Models in Large Language Models. Siqi Wang, Zhengyu Chen, Bei Li, Keqing He, Min Zhang, Jingang Wang |
| 2024 | Scaling Laws for Linear Complexity Language Models. Xuyang Shen, Dong Li, Ruitao Leng, Zhen Qin, Weigao Sun, Yiran Zhong |
| 2024 | Scaling Properties of Speech Language Models. Santiago Cuervo, Ricard Marxer |
| 2024 | Scaling Synthetic Logical Reasoning Datasets with Context-Sensitive Declarative Grammars. Damien Sileo |
| 2024 | ScalingFilter: Assessing Data Quality through Inverse Utilization of Scaling Laws. Ruihang Li, Yixuan Wei, Miaosen Zhang, Nenghai Yu, Han Hu, Houwen Peng |
| 2024 | SciAgent: Tool-augmented Language Models for Scientific Reasoning. Yubo Ma, Zhibin Gou, Junheng Hao, Ruochen Xu, Shuohang Wang, Liangming Pan, Yujiu Yang, Yixin Cao, Aixin Sun |
| 2024 | SciDQA: A Deep Reading Comprehension Dataset over Scientific Papers. Shruti Singh, Nandan Sarkar, Arman Cohan |
| 2024 | SciER: An Entity and Relation Extraction Dataset for Datasets, Methods, and Tasks in Scientific Documents. Qi Zhang, Zhijia Chen, Huitong Pan, Cornelia Caragea, Longin Jan Latecki, Eduard C. Dragut |
| 2024 | SciEx: Benchmarking Large Language Models on Scientific Exams with Human Expert Grading and Automatic Grading. Tu Anh Dinh, Carlos Mullov, Leonard Bärmann, Zhaolin Li, Danni Liu, Simon Reiß, Jueun Lee, Nathan Lerzer, Jianfeng Gao, Fabian Peller-Konrad, Tobias Röddiger, Alexander Waibel, Tamim Asfour, Michael Beigl, Rainer Stiefelhagen, Carsten Dachsbacher, Klemens Böhm, Jan Niehues |
| 2024 | SciPrompt: Knowledge-augmented Prompting for Fine-grained Categorization of Scientific Topics. Zhiwen You, Kanyao Han, Haotian Zhu, Bertram Ludäscher, Jana Diesner |
| 2024 | Scope-enhanced Compositional Semantic Parsing for DRT. Xiulin Yang, Jonas Groschwitz, Alexander Koller, Johan Bos |
| 2024 | Searching for Best Practices in Retrieval-Augmented Generation. Xiaohua Wang, Zhenghua Wang, Xuan Gao, Feiran Zhang, Yixin Wu, Zhibo Xu, Tianyuan Shi, Zhengyuan Wang, Shizheng Li, Qi Qian, Ruicheng Yin, Changze Lv, Xiaoqing Zheng, Xuanjing Huang |
| 2024 | SecCoder: Towards Generalizable and Robust Secure Code Generation. Boyu Zhang, Tianyu Du, Junkai Tong, Xuhong Zhang, Kingsum Chow, Sheng Cheng, Xun Wang, Jianwei Yin |
| 2024 | Seeing the Forest through the Trees: Data Leakage from Partial Transformer Gradients. Weijun Li, Qiongkai Xu, Mark Dras |
| 2024 | Seemingly Plausible Distractors in Multi-Hop Reasoning: Are Large Language Models Attentive Readers? Neeladri Bhuiya, Viktor Schlegel, Stefan Winkler |
| 2024 | Seg2Act: Global Context-aware Action Generation for Document Logical Structuring. Zichao Li, Shaojie He, Meng Liao, Xuanang Chen, Yaojie Lu, Hongyu Lin, Yanxiong Lu, Xianpei Han, Le Sun |
| 2024 | Segment Any Text: A Universal Approach for Robust, Efficient and Adaptable Sentence Segmentation. Markus Frohmann, Igor Sterner, Ivan Vulic, Benjamin Minixhofer, Markus Schedl |
| 2024 | Selective Vision is the Challenge for Visual Reasoning: A Benchmark for Visual Argument Understanding. Jiwan Chung, Sungjae Lee, Minseo Kim, Seungju Han, Ashkan Yousefpour, Jack Hessel, Youngjae Yu |
| 2024 | Self-AMPLIFY: Improving Small Language Models with Self Post Hoc Explanations. Milan Bhan, Jean-Noël Vittaut, Nicolas Chesneau, Marie-Jeanne Lesot |
| 2024 | Self-Bootstrapped Visual-Language Model for Knowledge Selection and Question Answering. Dongze Hao, Qunbo Wang, Longteng Guo, Jie Jiang, Jing Liu |
| 2024 | Self-Powered LLM Modality Expansion for Large Speech-Text Models. Tengfei Yu, Xuebo Liu, Zhiyi Hou, Liang Ding, Dacheng Tao, Min Zhang |
| 2024 | Self-Refine Instruction-Tuning for Aligning Reasoning in Language Models. Leonardo Ranaldi, André Freitas |
| 2024 | Self-Training Large Language and Vision Assistant for Medical Question Answering. Guohao Sun, Can Qin, Huazhu Fu, Linwei Wang, Zhiqiang Tao |
| 2024 | Self-Training for Sample-Efficient Active Learning for Text Classification with Pre-Trained Language Models. Christopher Schröder, Gerhard Heyer |
| 2024 | Semantic Training Signals Promote Hierarchical Syntactic Generalization in Transformers. Aditya Yedetore, Najoung Kim |
| 2024 | Semantics and Sentiment: Cross-lingual Variations in Emoji Use. Giulio Zhou, Sydelle de Souza, Ella Markham, Oghenetekevwe Kwakpovwe, Sumin Zhao |
| 2024 | Semformer: Transformer Language Models with Semantic Planning. Yongjing Yin, Junran Ding, Kai Song, Yue Zhang |
| 2024 | Sequential API Function Calling Using GraphQL Schema. Avirup Saha, Lakshmi Mandal, Balaji Ganesan, Sambit Ghosh, Renuka Sindhgatta, Carlos Eberhardt, Dan Debrunner, Sameep Mehta |
| 2024 | ShadowLLM: Predictor-based Contextual Sparsity for Large Language Models. Yash Akhauri, Ahmed F. AbouElhamayed, Jordan Dotzel, Zhiru Zhang, Alexander M. Rush, Safeen Huda, Mohamed S. Abdelfattah |
| 2024 | Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling. Georgios Pantazopoulos, Malvina Nikandrou, Alessandro Suglia, Oliver Lemon, Arash Eshghi |
| 2024 | Shortcuts Arising from Contrast: Towards Effective and Lightweight Clean-Label Attacks in Prompt-Based Learning. Xiaopeng Xie, Ming Yan, Xiwen Zhou, Chenlong Zhao, Suli Wang, Yong Zhang, Joey Zhou |
| 2024 | Show and Guide: Instructional-Plan Grounded Vision and Language Model. Diogo Glória-Silva, David Semedo, João Magalhães |
| 2024 | SignCLIP: Connecting Text and Sign Language by Contrastive Learning. Zifan Jiang, Gerard Sant, Amit Moryossef, Mathias Müller, Rico Sennrich, Sarah Ebling |
| 2024 | SimLLM: Detecting Sentences Generated by Large Language Models Using Similarity between the Generation and its Re-generation. Hoang-Quoc Nguyen-Son, Minh-Son Dao, Koji Zettsu |
| 2024 | Simul-MuST-C: Simultaneous Multilingual Speech Translation Corpus Using Large Language Model. Mana Makinae, Yusuke Sakai, Hidetaka Kamigaito, Taro Watanabe |
| 2024 | Simultaneous Interpretation Corpus Construction by Large Language Models in Distant Language Pair. Yusuke Sakai, Mana Makinae, Hidetaka Kamigaito, Taro Watanabe |
| 2024 | Simultaneous Masking, Not Prompting Optimization: A Paradigm Shift in Fine-tuning LLMs for Simultaneous Translation. Matthew Raffel, Victor Agostinelli, Lizhong Chen |
| 2024 | Small Agent Can Also Rock! Empowering Small Language Models as Hallucination Detector. Xiaoxue Cheng, Junyi Li, Xin Zhao, Hongzhi Zhang, Fuzheng Zhang, Di Zhang, Kun Gai, Ji-Rong Wen |
| 2024 | Small LLMs Are Weak Tool Learners: A Multi-LLM Agent. Weizhou Shen, Chenliang Li, Hongzhan Chen, Ming Yan, Xiaojun Quan, Hehong Chen, Ji Zhang, Fei Huang |
| 2024 | Social Bias Probing: Fairness Benchmarking for Language Models. Marta Marchiori Manerba, Karolina Stanczak, Riccardo Guidotti, Isabelle Augenstein |
| 2024 | SparseGrad: A Selective Method for Efficient Fine-tuning of MLP Layers. Viktoria Chekalina, Anna Rudenko, Gleb Mezentsev, Aleksandr Mikhalev, Alexander Panchenko, Ivan V. Oseledets |
| 2024 | Speaking in Wavelet Domain: A Simple and Efficient Approach to Speed up Speech Diffusion Model. Xiangyu Zhang, Daijiao Liu, Hexin Liu, Qiquan Zhang, Hanyu Meng, Leibny Paola García-Perera, Engsiong Chng, Lina Yao |
| 2024 | SpecHub: Provable Acceleration to Multi-Draft Speculative Decoding. Ryan Sun, Tianyi Zhou, Xun Chen, Lichao Sun |
| 2024 | SpeechQE: Estimating the Quality of Direct Speech Translation. HyoJung Han, Kevin Duh, Marine Carpuat |
| 2024 | Speechworthy Instruction-tuned Language Models. Hyundong Cho, Nicolaas Paul Jedema, Leonardo F. R. Ribeiro, Karishma Sharma, Pedro A. Szekely, Alessandro Moschitti, Ruben Janssen, Jonathan May |
| 2024 | Split and Merge: Aligning Position Biases in LLM-based Evaluators. Zongjie Li, Chaozheng Wang, Pingchuan Ma, Daoyuan Wu, Shuai Wang, Cuiyun Gao, Yang Liu |
| 2024 | Sprout: Green Generative AI with Carbon-Efficient LLM Inference. Baolin Li, Yankai Jiang, Vijay Gadepally, Devesh Tiwari |
| 2024 | Stable Language Model Pre-training by Reducing Embedding Variability. Woojin Chung, Jiwoo Hong, Na Min An, James Thorne, Se-Young Yun |
| 2024 | StablePrompt : Automatic Prompt Tuning using Reinforcement Learning for Large Language Model. Minchan Kwon, Gaeun Kim, Jongsuk Kim, Haeil Lee, Junmo Kim |
| 2024 | Standardize: Aligning Language Models with Expert-Defined Standards for Content Generation. Joseph Marvin Imperial, Gail Forey, Harish Tayyar Madabushi |
| 2024 | Statistical Uncertainty in Word Embeddings: GloVe-V. Andrea Vallebueno, Cassandra Handan-Nader, Christopher D. Manning, Daniel E. Ho |
| 2024 | Step-by-Step Reasoning to Solve Grid Puzzles: Where do LLMs Falter? Nemika Tyagi, Mihir Parmar, Mohith Kulkarni, Aswin RRV, Nisarg Patel, Mutsumi Nakamura, Arindam Mitra, Chitta Baral |
| 2024 | Stepwise Verification and Remediation of Student Reasoning Errors with Large Language Model Tutors. Nico Daheim, Jakub Macina, Manu Kapur, Iryna Gurevych, Mrinmaya Sachan |
| 2024 | Still Not Quite There! Evaluating Large Language Models for Comorbid Mental Health Diagnosis. Amey Hengle, Atharva Kulkarni, Shantanu Patankar, Madhumitha Chandrasekaran, Sneha D'Silva, Jemima Jacob, Rashmi Gupta |
| 2024 | Story Embeddings - Narrative-Focused Representations of Fictional Stories. Hans Ole Hatzel, Chris Biemann |
| 2024 | Story Morals: Surfacing value-driven narrative schemas using large language models. David G. Hobson, Haiqi Zhou, Derek Ruths, Andrew Piper |
| 2024 | StorySparkQA: Expert-Annotated QA Pairs with Real-World Knowledge for Children's Story-Based Learning. Jiaju Chen, Yuxuan Lu, Shao Zhang, Bingsheng Yao, Yuanzhe Dong, Ying Xu, Yunyao Li, Qianwen Wang, Dakuo Wang, Yuling Sun |
| 2024 | Strategic Demonstration Selection for Improved Fairness in LLM In-Context Learning. Jingyu Hu, Weiru Liu, Mengnan Du |
| 2024 | Strength Lies in Differences! Improving Strategy Planning for Non-collaborative Dialogues via Diversified User Simulation. Tong Zhang, Chen Huang, Yang Deng, Hongru Liang, Jia Liu, Zujie Wen, Wenqiang Lei, Tat-Seng Chua |
| 2024 | Strengthening Structural Inductive Biases by Pre-training to Perform Syntactic Transformations. Matthias Lindemann, Alexander Koller, Ivan Titov |
| 2024 | Structure Guided Prompt: Instructing Large Language Model in Multi-Step Reasoning by Exploring Graph Structure of the Text. Kewei Cheng, Nesreen K. Ahmed, Theodore L. Willke, Yizhou Sun |
| 2024 | Structured Optimal Brain Pruning for Large Language Models. Jiateng Wei, Quan Lu, Ning Jiang, Siqi Li, Jingyang Xiang, Jun Chen, Yong Liu |
| 2024 | Studying and Mitigating Biases in Sign Language Understanding Models. Katherine Atwell, Danielle Bragg, Malihe Alikhani |
| 2024 | Style-Shifting Behaviour of the Manosphere on Reddit. Jai Aggarwal, Suzanne Stevenson |
| 2024 | Style-Specific Neurons for Steering LLMs in Text Style Transfer. Wen Lai, Viktor Hangya, Alexander Fraser |
| 2024 | StyleRemix: Interpretable Authorship Obfuscation via Distillation and Perturbation of Style Elements. Jillian Fisher, Skyler Hallinan, Ximing Lu, Mitchell L. Gordon, Zaïd Harchaoui, Yejin Choi |
| 2024 | Subjective Topic meets LLMs: Unleashing Comprehensive, Reflective and Creative Thinking through the Negation of Negation. Fangrui Lv, Kaixiong Gong, Jian Liang, Xinyu Pang, Changshui Zhang |
| 2024 | Subword Segmentation in LLMs: Looking at Inflection and Consistency. Marion Di Marco, Alexander Fraser |
| 2024 | Successfully Guiding Humans with Imperfect Instructions by Highlighting Potential Errors and Suggesting Corrections. Lingjun Zhao, Khanh Nguyen, Hal Daumé III |
| 2024 | Summary of a Haystack: A Challenge to Long-Context LLMs and RAG Systems. Philippe Laban, Alexander R. Fabbri, Caiming Xiong, Chien-Sheng Wu |
| 2024 | Surprise! Uniform Information Density Isn't the Whole Story: Predicting Surprisal Contours in Long-form Discourse. Eleftheria Tsipidi, Franz Nowak, Ryan Cotterell, Ethan Wilcox, Mario Giulianelli, Alex Warstadt |
| 2024 | Surveying the Dead Minds: Historical-Psychological Text Analysis with Contextualized Construct Representation (CCR) for Classical Chinese. Yuqi Chen, Sixuan Li, Ying Li, Mohammad Atari |
| 2024 | Susu Box or Piggy Bank: Assessing Cultural Commonsense Knowledge between Ghana and the US. Christabel Acquaye, Haozhe An, Rachel Rudinger |
| 2024 | Symbolic Working Memory Enhances Language Models for Complex Rule Application. Siyuan Wang, Zhongyu Wei, Yejin Choi, Xiang Ren |
| 2024 | Synchronous Faithfulness Monitoring for Trustworthy Retrieval-Augmented Generation. Di Wu, Jia-Chen Gu, Fan Yin, Nanyun Peng, Kai-Wei Chang |
| 2024 | Synergizing In-context Learning with Hints for End-to-end Task-oriented Dialog Systems. Vishal Vivek Saley, Rocktim Jyoti Das, Dinesh Raghu, Mausam |
| 2024 | SynthesizRR: Generating Diverse Datasets with Retrieval Augmentation. Abhishek Divekar, Greg Durrett |
| 2024 | Synthetic Knowledge Ingestion: Towards Knowledge Refinement and Injection for Enhancing Large Language Models. Jiaxin Zhang, Wendi Cui, Yiran Huang, Kamalika Das, Kumar Sricharan |
| 2024 | Systematic Biases in LLM Simulations of Debates. Amir Taubenfeld, Yaniv Dover, Roi Reichart, Ariel Goldstein |
| 2024 | T-FREE: Subword Tokenizer-Free Generative LLMs via Sparse Representations for Memory-Efficient Embeddings. Björn Deiseroth, Manuel Brack, Patrick Schramowski, Kristian Kersting, Samuel Weinbach |
| 2024 | TCSinger: Zero-Shot Singing Voice Synthesis with Style Transfer and Multi-Level Style Control. Yu Zhang, Ziyue Jiang, Ruiqi Li, Changhao Pan, Jinzheng He, Rongjie Huang, Chuxin Wang, Zhou Zhao |
| 2024 | TEMA: Token Embeddings Mapping for Enriching Low-Resource Language Models. Rodolfo Zevallos, Núria Bel, Mireia Farrús |
| 2024 | TKGT: Redefinition and A New Way of Text-to-Table Tasks Based on Real World Demands and Knowledge Graphs Augmented LLMs. Peiwen Jiang, Xinbo Lin, Zibo Zhao, Ruhui Ma, Yvonne Jie Chen, Jinhua Cheng |
| 2024 | TL-CL: Task And Language Incremental Continual Learning. Shrey Satapara, P. K. Srijith |
| 2024 | TRoTR: A Framework for Evaluating the Re-contextualization of Text Reuse. Francesco Periti, Pierluigi Cassotti, Stefano Montanelli, Nina Tahmasebi, Dominik Schlechtweg |
| 2024 | TV-TREES: Multimodal Entailment Trees for Neuro-Symbolic Video Reasoning. Kate Sanders, Nathaniel Weir, Benjamin Van Durme |
| 2024 | Table Question Answering for Low-resourced Indic Languages. Vaishali Pal, Evangelos Kanoulas, Andrew Yates, Maarten de Rijke |
| 2024 | Tag-grounded Visual Instruction Tuning with Retrieval Augmentation. Daiqing Qi, Handong Zhao, Zijun Wei, Sheng Li |
| 2024 | Take Off the Training Wheels! Progressive In-Context Learning for Effective Alignment. Zhenyu Liu, Dongfang Li, Xinshuo Hu, Xinping Zhao, Yibin Chen, Baotian Hu, Min Zhang |
| 2024 | Target-Aware Language Modeling via Granular Data Sampling. Ernie Chang, Pin-Jie Lin, Yang Li, Changsheng Zhao, Daeil Kim, Rastislav Rabatin, Zechun Liu, Yangyang Shi, Vikas Chandra |
| 2024 | Task Arithmetic can Mitigate Synthetic-to-Real Gap in Automatic Speech Recognition. Hsuan Su, Hua Farn, Fan-Yun Sun, Shang-Tse Chen, Hung-yi Lee |
| 2024 | Task Oriented In-Domain Data Augmentation. Xiao Liang, Xinyu Hu, Simiao Zuo, Yeyun Gong, Qiang Lou, Yi Liu, Shao-Lun Huang, Jian Jiao |
| 2024 | Taxonomy-guided Semantic Indexing for Academic Paper Search. SeongKu Kang, Yunyi Zhang, Pengcheng Jiang, Dongha Lee, Jiawei Han, Hwanjo Yu |
| 2024 | Taylor Unswift: Secured Weight Release for Large Language Models via Taylor Expansion. Guanchu Wang, Yu-Neng Chuang, Ruixiang Tang, Shaochen Zhong, Jiayi Yuan, Hongye Jin, Zirui Liu, Vipin Chaudhary, Shuai Xu, James Caverlee, Xia Ben Hu |
| 2024 | Teaching Embodied Reinforcement Learning Agents: Informativeness and Diversity of Language Use. Jiajun Xi, Yinong He, Jianing Yang, Yinpei Dai, Joyce Chai |
| 2024 | Teaching LLMs to Abstain across Languages via Multilingual Feedback. Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Orevaoghene Ahia, Shuyue Stella Li, Vidhisha Balachandran, Sunayana Sitaram, Yulia Tsvetkov |
| 2024 | Teaching Small Language Models Reasoning through Counterfactual Distillation. Tao Feng, Yicheng Li, Chenglin Li, Hao Chen, Fei Yu, Yin Zhang |
| 2024 | TempoFormer: A Transformer for Temporally-aware Representations in Change Detection. Talia Tseriotou, Adam Tsakalidis, Maria Liakata |
| 2024 | Temporally Consistent Factuality Probing for Large Language Models. Ashutosh Bajpai, Aaryan Goyal, Atif Anwer, Tanmoy Chakraborty |
| 2024 | Text Fluoroscopy: Detecting LLM-Generated Text through Intrinsic Features. Xiao Yu, Kejiang Chen, Qi Yang, Weiming Zhang, Nenghai Yu |
| 2024 | Text Grafting: Near-Distribution Weak Supervision for Minority Classes in Text Classification. Letian Peng, Yi Gu, Chengyu Dong, Zihan Wang, Jingbo Shang |
| 2024 | Text-Tuple-Table: Towards Information Integration in Text-to-Table Generation via Global Tuple Extraction. Zheye Deng, Chunkit Chan, Weiqi Wang, Yuxi Sun, Wei Fan, Tianshi Zheng, Yauwai Yim, Yangqiu Song |
| 2024 | Text2Chart31: Instruction Tuning for Chart Generation with Automatic Feedback. Fatemeh Pesaran Zadeh, Juyeon Kim, Jin-Hwa Kim, Gunhee Kim |
| 2024 | The Accuracy Paradox in RLHF: When Better Reward Models Don't Yield Better Language Models. Yanjun Chen, Dawei Zhu, Yirong Sun, Xinghao Chen, Wei Zhang, Xiaoyu Shen |
| 2024 | The Best Defense is Attack: Repairing Semantics in Textual Adversarial Examples. Heng Yang, Ke Li |
| 2024 | The Computational Anatomy of Humility: Modeling Intellectual Humility in Online Public Discourse. Xiaobo Guo, Neil Potnis, Melody Yu, Nabeel Gillani, Soroush Vosoughi |
| 2024 | The Death and Life of Great Prompts: Analyzing the Evolution of LLM Prompts from the Structural Perspective. Yihan Ma, Xinyue Shen, Yixin Wu, Boyang Zhang, Michael Backes, Yang Zhang |
| 2024 | The Emergence of Compositional Languages in Multi-entity Referential Games: from Image to Graph Representations. Daniel Akkerman, Phong Le, Raquel G. Alhama |
| 2024 | The Empirical Variability of Narrative Perceptions of Social Media Texts. Joel Mire, Maria Antoniak, Elliott Ash, Andrew Piper, Maarten Sap |
| 2024 | The Factuality Tax of Diversity-Intervened Text-to-Image Generation: Benchmark and Fact-Augmented Intervention. Yixin Wan, Di Wu, Haoran Wang, Kai-Wei Chang |
| 2024 | The Generation Gap: Exploring Age Bias in the Value Systems of Large Language Models. Siyang Liu, Trisha Maturi, Bowen Yi, Siqi Shen, Rada Mihalcea |
| 2024 | The Greatest Good Benchmark: Measuring LLMs' Alignment with Utilitarian Moral Dilemmas. Giovanni Marraffini, Andrés Cotton, Noe Hsueh, Axel Fridman, Juan Wisznia, Luciano Del Corro |
| 2024 | The Illusion of Competence: Evaluating the Effect of Explanations on Users' Mental Models of Visual Question Answering Systems. Judith Sieker, Simeon Junker, Ronja Utescher, Nazia Attari, Heiko Wersing, Hendrik Buschmeier, Sina Zarrieß |
| 2024 | The Instinctive Bias: Spurious Images lead to Illusion in MLLMs. Tianyang Han, Qing Lian, Rui Pan, Renjie Pi, Jipeng Zhang, Shizhe Diao, Yong Lin, Tong Zhang |
| 2024 | The LLM Effect: Are Humans Truly Using LLMs, or Are They Being Influenced By Them Instead? Alexander S. Choi, Syeda Sabrina Akter, JP Singh, Antonios Anastasopoulos |
| 2024 | The Lou Dataset - Exploring the Impact of Gender-Fair Language in German Text Classification. Andreas Waldis, Joel Birrer, Anne Lauscher, Iryna Gurevych |
| 2024 | The Multilingual Alignment Prism: Aligning Global and Local Preferences to Reduce Harm. Aakanksha, Arash Ahmadian, Beyza Ermis, Seraphina Goldfarb-Tarrant, Julia Kreutzer, Marzieh Fadaee, Sara Hooker |
| 2024 | The Mystery of In-Context Learning: A Comprehensive Survey on Interpretation and Analysis. Yuxiang Zhou, Jiazheng Li, Yanzheng Xiang, Hanqi Yan, Lin Gui, Yulan He |
| 2024 | The Mystery of the Pathological Path-star Task for Language Models. Arvid Frydenlund |
| 2024 | The Odyssey of Commonsense Causality: From Foundational Benchmarks to Cutting-Edge Reasoning. Shaobo Cui, Zhijing Jin, Bernhard Schölkopf, Boi Faltings |
| 2024 | The Zeno's Paradox of 'Low-Resource' Languages. Hellina Hailu Nigatu, Atnafu Lambebo Tonja, Benjamin Rosman, Thamar Solorio, Monojit Choudhury |
| 2024 | The effects of distance on NPI illusive effects in BERT. So Lee, Mai Vu |
| 2024 | Themis: A Reference-free NLG Evaluation Language Model with Flexibility and Interpretability. Xinyu Hu, Li Lin, Mingqi Gao, Xunjian Yin, Xiaojun Wan |
| 2024 | TheoremLlama: Transforming General-Purpose LLMs into Lean4 Experts. Ruida Wang, Jipeng Zhang, Yizhen Jia, Rui Pan, Shizhe Diao, Renjie Pi, Tong Zhang |
| 2024 | Thinking Outside of the Differential Privacy Box: A Case Study in Text Privatization with Language Model Prompting. Stephen Meisenbacher, Florian Matthes |
| 2024 | Thoughts to Target: Enhance Planning for Target-driven Conversation. Zhonghua Zheng, Lizi Liao, Yang Deng, Ee-Peng Lim, Minlie Huang, Liqiang Nie |
| 2024 | Threshold-driven Pruning with Segmented Maximum Term Weights for Approximate Cluster-based Sparse Retrieval. Yifan Qiao, Parker Carlson, Shanxiu He, Yingrui Yang, Tao Yang |
| 2024 | TimeR⁴ : Time-aware Retrieval-Augmented Large Language Models for Temporal Knowledge Graph Question Answering. Xinying Qian, Ying Zhang, Yu Zhao, Baohang Zhou, Xuhui Sui, Li Zhang, Kehui Song |
| 2024 | TinyChart: Efficient Chart Understanding with Program-of-Thoughts Learning and Visual Token Merging. Liang Zhang, Anwen Hu, Haiyang Xu, Ming Yan, Yichen Xu, Qin Jin, Ji Zhang, Fei Huang |
| 2024 | To Preserve or To Compress: An In-Depth Study of Connector Selection in Multimodal Large Language Models. Junyan Lin, Haoran Chen, Dawei Zhu, Xiaoyu Shen |
| 2024 | To Word Senses and Beyond: Inducing Concepts with Contextualized Language Models. Bastien Liétard, Pascal Denis, Mikaela Keller |
| 2024 | Token Erasure as a Footprint of Implicit Vocabulary Items in LLMs. Sheridan Feucht, David Atkinson, Byron C. Wallace, David Bau |
| 2024 | TokenVerse: Towards Unifying Speech and NLP Tasks via Transducer-based ASR. Shashi Kumar, Srikanth R. Madikeri, Juan Pablo Zuluaga-Gomez, Iuliia Thorbecke, Esaú Villatoro-Tello, Sergio Burdisso, Petr Motlícek, Karthik S, Aravind Ganapathiraju |
| 2024 | Tokenization Is More Than Compression. Craig W. Schmidt, Varshini Reddy, Haoran Zhang, Alec Alameddine, Omri Uzan, Yuval Pinter, Chris Tanner |
| 2024 | ToolBeHonest: A Multi-level Hallucination Diagnostic Benchmark for Tool-Augmented Large Language Models. Yuxiang Zhang, Jing Chen, Junjie Wang, Yaxin Liu, Cheng Yang, Chufan Shi, Xinyu Zhu, Zihao Lin, Hanwen Wan, Yujiu Yang, Tetsuya Sakai, Tian Feng, Hayato Yamana |
| 2024 | ToolPlanner: A Tool Augmented LLM for Multi Granularity Instructions with Path Planning and Feedback. Qinzhuo Wu, Wei Liu, Jian Luan, Bin Wang |
| 2024 | Tools Fail: Detecting Silent Errors in Faulty Tools. Jimin Sun, So Yeon Min, Yingshan Chang, Yonatan Bisk |
| 2024 | TopViewRS: Vision-Language Models as Top-View Spatial Reasoners. Chengzu Li, Caiqi Zhang, Han Zhou, Nigel Collier, Anna Korhonen, Ivan Vulic |
| 2024 | Topic-Oriented Open Relation Extraction with A Priori Seed Generation. Linyi Ding, Jinfeng Xiao, Sizhe Zhou, Chaoqi Yang, Jiawei Han |
| 2024 | Toward Compositional Behavior in Neural Models: A Survey of Current Views. Kate McCurdy, Paul Soulos, Paul Smolensky, Roland Fernandez, Jianfeng Gao |
| 2024 | Towards Aligning Language Models with Textual Feedback. Saüc Abadal Lloret, Shehzaad Dhuliawala, Keerthiram Murugesan, Mrinmaya Sachan |
| 2024 | Towards Cross-Cultural Machine Translation with Retrieval-Augmented Generation from Multilingual Knowledge Graphs. Simone Conia, Daniel Lee, Min Li, Umar Farooq Minhas, Saloni Potdar, Yunyao Li |
| 2024 | Towards Difficulty-Agnostic Efficient Transfer Learning for Vision-Language Models. Yongjin Yang, Jongwoo Ko, Se-Young Yun |
| 2024 | Towards Enhancing Coherence in Extractive Summarization: Dataset and Experiments with LLMs. Mihir Parmar, Hanieh Deilamsalehy, Franck Dernoncourt, Seunghyun Yoon, Ryan A. Rossi, Trung Bui |
| 2024 | Towards Faithful Knowledge Graph Explanation Through Deep Alignment in Commonsense Question Answering. Weihe Zhai, Arkaitz Zubiaga, Bingquan Liu, Chengjie Sun, Yalong Zhao |
| 2024 | Towards Fast Multilingual LLM Inference: Speculative Decoding and Specialized Drafters. Euiin Yi, Taehyeon Kim, Hongseok Jeung, Du-Seong Chang, Se-Young Yun |
| 2024 | Towards Injecting Medical Visual Knowledge into Multimodal LLMs at Scale. Junying Chen, Chi Gui, Ruyi Ouyang, Anningzhe Gao, Shunian Chen, Guiming Chen, Xidong Wang, Zhenyang Cai, Ke Ji, Xiang Wan, Benyou Wang |
| 2024 | Towards Interpretable Sequence Continuation: Analyzing Shared Circuits in Large Language Models. Michael Lan, Philip Torr, Fazl Barez |
| 2024 | Towards Low-Resource Harmful Meme Detection with LMM Agents. Jianzhao Huang, Hongzhan Lin, Ziyan Liu, Ziyang Luo, Guang Chen, Jing Ma |
| 2024 | Towards Measuring and Modeling "Culture" in LLMs: A Survey. Muhammad Farid Adilazuarda, Sagnik Mukherjee, Pradhyumna Lavania, Siddhant Singh, Alham Fikri Aji, Jacki O'Neill, Ashutosh Modi, Monojit Choudhury |
| 2024 | Towards Online Continuous Sign Language Recognition and Translation. Ronglai Zuo, Fangyun Wei, Brian Mak |
| 2024 | Towards Probing Speech-Specific Risks in Large Multimodal Models: A Taxonomy, Benchmark, and Insights. Hao Yang, Lizhen Qu, Ehsan Shareghi, Reza Haf |
| 2024 | Towards Robust Speech Representation Learning for Thousands of Languages. William Chen, Wangyou Zhang, Yifan Peng, Xinjian Li, Jinchuan Tian, Jiatong Shi, Xuankai Chang, Soumi Maiti, Karen Livescu, Shinji Watanabe |
| 2024 | Towards Tool Use Alignment of Large Language Models. Zhiyuan Chen, Shiqi Shen, Guangyao Shen, Gong Zhi, Xu Chen, Yankai Lin |
| 2024 | Towards Understanding Jailbreak Attacks in LLMs: A Representation Space Analysis. Yuping Lin, Pengfei He, Han Xu, Yue Xing, Makoto Yamada, Hui Liu, Jiliang Tang |
| 2024 | Towards Verifiable Text Generation with Evolving Memory and Self-Reflection. Hao Sun, Hengyi Cai, Bo Wang, Yingyan Hou, Xiaochi Wei, Shuaiqiang Wang, Yan Zhang, Dawei Yin |
| 2024 | Towards a Greek Proverb Atlas: Computational Spatial Exploration and Attribution of Greek Proverbs. John Pavlopoulos, Panos Louridas, Panagiotis Filos |
| 2024 | Towards a Similarity-adjusted Surprisal Theory. Clara Meister, Mario Giulianelli, Tiago Pimentel |
| 2024 | ToxiCloakCN: Evaluating Robustness of Offensive Language Detection in Chinese with Cloaking Perturbations. Yunze Xiao, Yujia Hu, Kenny T. W. Choo, Roy Ka-Wei Lee |
| 2024 | Toxicity Detection is NOT all you Need: Measuring the Gaps to Supporting Volunteer Content Moderators through a User-Centric Method. Yang Trista Cao, Lovely-Frances Domingo, Sarah A. Gilbert, Michelle L. Mazurek, Katie Shilton, Hal Daumé III |
| 2024 | Tracking the perspectives of interacting language models. Hayden S. Helm, Brandon Duderstadt, Youngser Park, Carey E. Priebe |
| 2024 | Training-free Deep Concept Injection Enables Language Models for Video Question Answering. Xudong Lin, Manling Li, Richard S. Zemel, Heng Ji, Shih-Fu Chang |
| 2024 | TransferTOD: A Generalizable Chinese Multi-Domain Task-Oriented Dialogue System with Transfer Capabilities. Ming Zhang, Caishuang Huang, Yilong Wu, Shichun Liu, Huiyuan Zheng, Yurui Dong, Yujiong Shen, Shihan Dou, Jun Zhao, Junjie Ye, Qi Zhang, Tao Gui, Xuanjing Huang |
| 2024 | Transformers are Multi-State RNNs. Matanel Oren, Michael Hassid, Yarden Nir, Yossi Adi, Roy Schwartz |
| 2024 | TraveLER: A Modular Multi-LMM Agent Framework for Video Question-Answering. Chuyi Shang, Amos You, Sanjay Subramanian, Trevor Darrell, Roei Herzig |
| 2024 | Tree of Problems: Improving structured problem solving with compositionality. Armel Zebaze, Benoît Sagot, Rachel Bawden |
| 2024 | Triad: A Framework Leveraging a Multi-Role LLM-based Agent to Solve Knowledge Base Question Answering. Chang Zong, Yuchen Yan, Weiming Lu, Jian Shao, Yongfeng Huang, Heng Chang, Yueting Zhuang |
| 2024 | TroL: Traversal of Layers for Large Language and Vision Models. Byung-Kwan Lee, Sangyun Chung, Chae Won Kim, Beomchan Park, Yong Man Ro |
| 2024 | Turn Waste into Worth: Rectifying Top-k Router of MoE. Zhiyuan Zeng, Qipeng Guo, Zhaoye Fei, Zhangyue Yin, Yunhua Zhou, Linyang Li, Tianxiang Sun, Hang Yan, Dahua Lin, Xipeng Qiu |
| 2024 | Twists, Humps, and Pebbles: Multilingual Speech Recognition Models Exhibit Gender Performance Gaps. Giuseppe Attanasio, Beatrice Savoldi, Dennis Fucci, Dirk Hovy |
| 2024 | UNICORN: A Unified Causal Video-Oriented Language-Modeling Framework for Temporal Video-Language Tasks. Yuanhao Xiong, Yixin Nie, Haotian Liu, Boxin Wang, Jun Chen, Rong Jin, Cho-Jui Hsieh, Lorenzo Torresani, Jie Lei |
| 2024 | UNO Arena for Evaluating Sequential Decision-Making Capability of Large Language Models. Zhanyue Qin, Haochuan Wang, Deyuan Liu, Ziyang Song, Cunhang Fan, Zhao Lv, Jinlin Wu, Zhen Lei, Zhiying Tu, Dianhui Chu, Xiaoyan Yu, Dianbo Sui |
| 2024 | UOUO: Uncontextualized Uncommon Objects for Measuring Knowledge Horizons of Vision Language Models. Xinyu Pi, Mingyuan Wu, Jize Jiang, Haozhen Zheng, Beitong Tian, ChengXiang Zhai, Klara Nahrstedt, Zhiting Hu |
| 2024 | Uncertainty in Language Models: Assessment through Rank-Calibration. Xinmeng Huang, Shuo Li, Mengxin Yu, Matteo Sesia, Hamed Hassani, Insup Lee, Osbert Bastani, Edgar Dobriban |
| 2024 | Understanding "Democratization" in NLP and ML Research. Arjun Subramonian, Vagrant Gautam, Dietrich Klakow, Zeerak Talat |
| 2024 | Understanding Higher-Order Correlations Among Semantic Components in Embeddings. Momose Oyama, Hiroaki Yamagiwa, Hidetoshi Shimodaira |
| 2024 | Understanding Slang with LLMs: Modelling Cross-Cultural Nuances through Paraphrasing. Ifeoluwa Wuraola, Nina Dethlefs, Daniel Marciniak |
| 2024 | Understanding and Mitigating Language Confusion in LLMs. Kelly Marchisio, Wei-Yin Ko, Alexandre Berard, Théo Dehaze, Sebastian Ruder |
| 2024 | UniFashion: A Unified Vision-Language Model for Multimodal Fashion Retrieval and Generation. Xiangyu Zhao, Yuehan Zhang, Wenlong Zhang, Xiao-Ming Wu |
| 2024 | UniGen: Universal Domain Generalization for Sentiment Classification via Zero-shot Dataset Generation. Juhwan Choi, Yeonghwa Kim, Seunguk Yu, Jungmin Yun, YoungBin Kim |
| 2024 | Unifying Multimodal Retrieval via Document Screenshot Embedding. Xueguang Ma, Sheng-Chieh Lin, Minghan Li, Wenhu Chen, Jimmy Lin |
| 2024 | Universal Vulnerabilities in Large Language Models: Backdoor Attacks for In-context Learning. Shuai Zhao, Meihuizi Jia, Anh Tuan Luu, Fengjun Pan, Jinming Wen |
| 2024 | Unknown Claims: Generation of Fact-Checking Training Examples from Unstructured and Structured Data. Jean-Flavien Bussotti, Luca Ragazzi, Giacomo Frisoni, Gianluca Moro, Paolo Papotti |
| 2024 | Unlabeled Debiasing in Downstream Tasks via Class-wise Low Variance Regularization. Shahed Masoudian, Markus Frohmann, Navid Rekabsaz, Markus Schedl |
| 2024 | Unleashing the Power of Emojis in Texts via Self-supervised Graph Pre-Training. Zhou Zhang, Dongzeng Tan, Jiaan Wang, Yilong Chen, Jiarong Xu |
| 2024 | Unlocking Anticipatory Text Generation: A Constrained Approach for Large Language Models Decoding. Lifu Tu, Semih Yavuz, Jin Qu, Jiacheng Xu, Rui Meng, Caiming Xiong, Yingbo Zhou |
| 2024 | Unlocking Markets: A Multilingual Benchmark to Cross-Market Question Answering. Yifei Yuan, Yang Deng, Anders Søgaard, Mohammad Aliannejadi |
| 2024 | Unlocking Memorization in Large Language Models with Dynamic Soft Prompting. Zhepeng Wang, Runxue Bao, Yawen Wu, Jackson Taylor, Cao Xiao, Feng Zheng, Weiwen Jiang, Shangqian Gao, Yanfu Zhang |
| 2024 | Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models. Tianyi Men, Pengfei Cao, Zhuoran Jin, Yubo Chen, Kang Liu, Jun Zhao |
| 2024 | Unraveling Babel: Exploring Multilingual Activation Patterns of LLMs and Their Applications. Weize Liu, Yinlong Xu, Hongxia Xu, Jintai Chen, Xuming Hu, Jian Wu |
| 2024 | Unsupervised Discrete Representations of American Sign Language. Artem Abzaliev, Rada Mihalcea |
| 2024 | Unsupervised End-to-End Task-Oriented Dialogue with LLMs: The Power of the Noisy Channel. Brendan King, Jeffrey Flanigan |
| 2024 | Unsupervised Extraction of Dialogue Policies from Conversations. Makesh Narsimhan Sreedhar, Traian Rebedea, Christopher Parisien |
| 2024 | Unsupervised Human Preference Learning. Sumuk Shashidhar, Abhinav Chinta, Vaibhav Sahai, Dilek Hakkani-Tür |
| 2024 | Unsupervised Named Entity Disambiguation for Low Resource Domains. Debarghya Datta, Soumajit Pramanik |
| 2024 | Unveiling Factual Recall Behaviors of Large Language Models through Knowledge Neurons. Yifei Wang, Yuheng Chen, Wanting Wen, Yu Sheng, Linjing Li, Daniel Zeng |
| 2024 | Unveiling In-Context Learning: A Coordinate System to Understand Its Working Mechanism. Anhao Zhao, Fanghua Ye, Jinlan Fu, Xiaoyu Shen |
| 2024 | Unveiling Multi-level and Multi-modal Semantic Representations in the Human Brain using Large Language Models. Yuko Nakagi, Takuya Matsuyama, Naoko Koide-Majima, Hiroto Yamaguchi, Rieko Kubo, Shinji Nishimoto, Yu Takagi |
| 2024 | Unveiling and Consulting Core Experts in Retrieval-Augmented MoE-based LLMs. Xin Zhou, Ping Nie, Yiwen Guo, Haojie Wei, Zhanqiu Zhang, Pasquale Minervini, Ruotian Ma, Tao Gui, Qi Zhang, Xuanjing Huang |
| 2024 | Unveiling the Lexical Sensitivity of LLMs: Combinatorial Optimization for Prompt Enhancement. Pengwei Zhan, Zhen Xu, Qian Tan, Jie Song, Ru Xie |
| 2024 | Unveiling the Role of Pretraining in Direct Speech Translation. Belen Alastruey, Gerard I. Gállego, Marta R. Costa-jussà |
| 2024 | Unveiling the mystery of visual attributes of concrete and abstract concepts: Variability, nearest neighbors, and challenging categories. Tarun Tater, Sabine Schulte im Walde, Diego Frassinelli |
| 2024 | Updating CLIP to Prefer Descriptions Over Captions. Amir Zur, Elisa Kreiss, Karel D'Oosterlinck, Christopher Potts, Atticus Geiger |
| 2024 | User Inference Attacks on Large Language Models. Nikhil Kandpal, Krishna Pillutla, Alina Oprea, Peter Kairouz, Christopher A. Choquette-Choo, Zheng Xu |
| 2024 | Using Language Models to Disambiguate Lexical Choices in Translation. Josh Barua, Sanjay Subramanian, Kayo Yin, Alane Suhr |
| 2024 | VGBench: Evaluating Large Language Models on Vector Graphics Understanding and Generation. Bocheng Zou, Mu Cai, Jianrui Zhang, Yong Jae Lee |
| 2024 | VHASR: A Multimodal Speech Recognition System With Vision Hotwords. Jiliang Hu, Zuchao Li, Ping Wang, Haojun Ai, Lefei Zhang, Hai Zhao |
| 2024 | VIEWS: Entity-Aware News Video Captioning. Hammad A. Ayyubi, Tianqi Liu, Arsha Nagrani, Xudong Lin, Mingda Zhang, Anurag Arnab, Feng Han, Yukun Zhu, Xuande Feng, Kevin Zhang, Jialu Liu, Shih-Fu Chang |
| 2024 | VIMI: Grounding Video Generation through Multi-modal Instruction. Yuwei Fang, Willi Menapace, Aliaksandr Siarohin, Tsai-Shien Chen, Kuan-Chieh Wang, Ivan Skorokhodov, Graham Neubig, Sergey Tulyakov |
| 2024 | VIVA: A Benchmark for Vision-Grounded Decision-Making with Human Values. Zhe Hu, Yixiao Ren, Jing Li, Yu Yin |
| 2024 | VLEU: a Method for Automatic Evaluation for Generalizability of Text-to-Image Models. Jingtao Cao, Zhang Zheng, Hongru Wang, Kam-Fai Wong |
| 2024 | VLFeedback: A Large-Scale AI Feedback Dataset for Large Vision-Language Models Alignment. Lei Li, Zhihui Xie, Mukai Li, Shunian Chen, Peiyi Wang, Liang Chen, Yazheng Yang, Benyou Wang, Lingpeng Kong, Qi Liu |
| 2024 | VPTQ: Extreme Low-bit Vector Post-Training Quantization for Large Language Models. Yifei Liu, Jicheng Wen, Yang Wang, Shengyu Ye, Li Lyna Zhang, Ting Cao, Cheng Li, Mao Yang |
| 2024 | Varying Sentence Representations via Condition-Specified Routers. Ziyong Lin, Quansen Wang, Zixia Jia, Zilong Zheng |
| 2024 | Verba volant, scripta volant? Don't worry! There are computational solutions for protoword reconstruction. Liviu P. Dinu, Ana Sabina Uban, Alina Maria Cristea, Ioan-Bogdan Iordache, Teodor-George Marchitan, Simona Georgescu, Laurentiu Zoicas |
| 2024 | Verifiable, Debuggable, and Repairable Commonsense Logical Reasoning via LLM-based Theory Resolution. Armin Toroghi, Willis Guo, Ali Pesaranghader, Scott Sanner |
| 2024 | Verification and Refinement of Natural Language Explanations through LLM-Symbolic Theorem Proving. Xin Quan, Marco Valentino, Louise A. Dennis, André Freitas |
| 2024 | VerifyMatch: A Semi-Supervised Learning Paradigm for Natural Language Inference with Confidence-Aware MixUp. Seoyeon Park, Cornelia Caragea |
| 2024 | Video-LLaVA: Learning United Visual Representation by Alignment Before Projection. Bin Lin, Yang Ye, Bin Zhu, Jiaxi Cui, Munan Ning, Peng Jin, Li Yuan |
| 2024 | Video-Text Prompting for Weakly Supervised Spatio-Temporal Video Grounding. Heng Zhao, Yinjie Zhao, Bihan Wen, Yew-Soon Ong, Joey Zhou |
| 2024 | VideoCLIP-XL: Advancing Long Description Understanding for Video CLIP Models. Jiapeng Wang, Chengyu Wang, Kunzhe Huang, Jun Huang, Lianwen Jin |
| 2024 | VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation. Xuan He, Dongfu Jiang, Ge Zhang, Max Ku, Achint Soni, Sherman Siu, Haonan Chen, Abhranil Chandra, Ziyan Jiang, Aaran Arulraj, Kai Wang, Quy Duc Do, Yuansheng Ni, Bohan Lyu, Yaswanth Narsupalli, Rongqi Fan, Zhiheng Lyu, Bill Yuchen Lin, Wenhu Chen |
| 2024 | Virtual Personas for Language Models via an Anthology of Backstories. Suhong Moon, Marwa Abdulhai, Minwoo Kang, Joseph Suh, Widyadewi Soedarmadji, Eran Kohen Behar, David M. Chan |
| 2024 | Vision-Language Model Fine-Tuning via Simple Parameter-Efficient Modification. Ming Li, Jike Zhong, Chenxin Li, Liuzhuozheng Li, Nie Lin, Masashi Sugiyama |
| 2024 | Visual Prompting in LLMs for Enhancing Emotion Recognition. Qixuan Zhang, Zhifeng Wang, Dylan Zhang, Wenjia Niu, Sabrina B. Caldwell, Tom Gedeon, Yang Liu, Zhenyue Qin |
| 2024 | Visual Text Matters: Improving Text-KVQA with Visual Text Entity Knowledge-aware Large Multimodal Assistant. Abhirama Subramanyam Penamakuri, Anand Mishra |
| 2024 | Voices Unheard: NLP Resources and Models for Yorùbá Regional Dialects. Orevaoghene Ahia, Anuoluwapo Aremu, Diana Abagyan, Hila Gonen, David Ifeoluwa Adelani, Daud Abolade, Noah A. Smith, Yulia Tsvetkov |
| 2024 | Voices in a Crowd: Searching for clusters of unique perspectives. Nikolas Vitsakis, Amit Parekh, Ioannis Konstas |
| 2024 | WPO: Enhancing RLHF with Weighted Preference Optimization. Wenxuan Zhou, Ravi Agrawal, Shujian Zhang, Sathish Reddy Indurthi, Sanqiang Zhao, Kaiqiang Song, Silei Xu, Chenguang Zhu |
| 2024 | Walking in Others' Shoes: How Perspective-Taking Guides Large Language Models in Reducing Toxicity and Bias. Rongwu Xu, Zi'an Zhou, Tianwei Zhang, Zehan Qi, Su Yao, Ke Xu, Wei Xu, Han Qiu |
| 2024 | Watch Every Step! LLM Agent Learning via Iterative Step-level Process Refinement. Weimin Xiong, Yifan Song, Xiutian Zhao, Wenhao Wu, Xun Wang, Ke Wang, Cheng Li, Wei Peng, Sujian Li |
| 2024 | Waterfall: Scalable Framework for Robust Text Watermarking and Provenance for LLMs. Gregory Kang Ruey Lau, Xinyuan Niu, Hieu Dao, Jiangwei Chen, Chuan-Sheng Foo, Bryan Kian Hsiang Low |
| 2024 | Weak Reward Model Transforms Generative Models into Robust Causal Event Extraction Systems. Italo Luis da Silva, Hanqi Yan, Lin Gui, Yulan He |
| 2024 | What Are the Odds? Language Models Are Capable of Probabilistic Reasoning. Akshay Paruchuri, Jake Garrison, Shun Liao, John Hernandez, Jacob E. Sunshine, Tim Althoff, Xin Liu, Daniel McDuff |
| 2024 | What are the Generator Preferences for End-to-end Task-Oriented Dialog System? Wanshi Xu, Xianwei Zhuang, Zhanpeng Chen, Zhihong Zhu, Xuxin Cheng, Yuexian Zou |
| 2024 | What do Large Language Models Need for Machine Translation Evaluation? Shenbin Qian, Archchana Sindhujan, Minnie Kabra, Diptesh Kanojia, Constantin Orasan, Tharindu Ranasinghe, Frédéric Blain |
| 2024 | What is "Typological Diversity" in NLP? Esther Ploeger, Wessel Poelman, Miryam de Lhoneux, Johannes Bjerva |
| 2024 | What is lost in Normalization? Exploring Pitfalls in Multilingual ASR Model Evaluations. Kavya Manohar, Leena G. Pillai |
| 2024 | What the Harm? Quantifying the Tangible Impact of Gender Bias in Machine Translation with a Human-centered Study. Beatrice Savoldi, Sara Papi, Matteo Negri, Ana Guerberof Arenas, Luisa Bentivogli |
| 2024 | What's Mine becomes Yours: Defining, Annotating and Detecting Context-Dependent Paraphrases in News Interview Dialogs. Anna Wegmann, Tijs A. van den Broek, Dong Nguyen |
| 2024 | When Context Leads but Parametric Memory Follows in Large Language Models. Yufei Tao, Adam Hiatt, Erik Haake, Antonie J. Jetter, Ameeta Agrawal |
| 2024 | When Generative Adversarial Networks Meet Sequence Labeling Challenges. Yu Tong, Ge Chen, Guokai Zheng, Rui Li, Jiang Dazhi |
| 2024 | When Is Multilinguality a Curse? Language Modeling for 250 High- and Low-Resource Languages. Tyler A. Chang, Catherine Arnett, Zhuowen Tu, Ben Bergen |
| 2024 | When LLMs Meets Acoustic Landmarks: An Efficient Approach to Integrate Speech into Large Language Models for Depression Detection. Xiangyu Zhang, Hexin Liu, Kaishuai Xu, Qiquan Zhang, Daijiao Liu, Beena Ahmed, Julien Epps |
| 2024 | When Parts Are Greater Than Sums: Individual LLM Components Can Outperform Full Models. Ting-Yun Chang, Jesse Thomason, Robin Jia |
| 2024 | When Reasoning Meets Information Aggregation: A Case Study with Sports Narratives. Yebowen Hu, Kaiqiang Song, Sangwoo Cho, Xiaoyang Wang, Wenlin Yao, Hassan Foroosh, Dong Yu, Fei Liu |
| 2024 | Where Am I From? Identifying Origin of LLM-generated Content. Liying Li, Yihan Bai, Minhao Cheng |
| 2024 | Where am I? Large Language Models Wandering between Semantics and Structures in Long Contexts. Seonmin Koo, Jinsung Kim, Youngjoon Jang, Chanjun Park, Heuiseok Lim |
| 2024 | Where is the signal in tokenization space? Renato Lui Geh, Honghua Zhang, Kareem Ahmed, Benjie Wang, Guy Van den Broeck |
| 2024 | Which Programming Language and What Features at Pre-training Stage Affect Downstream Logical Inference Performance? Fumiya Uchiyama, Takeshi Kojima, Andrew Gambardella, Qi Cao, Yusuke Iwasawa, Yutaka Matsuo |
| 2024 | Which questions should I answer? Salience Prediction of Inquisitive Questions. Yating Wu, Ritika Mangla, Alex Dimakis, Greg Durrett, Junyi Jessy Li |
| 2024 | Whispers that Shake Foundations: Analyzing and Mitigating False Premise Hallucinations in Large Language Models. Hongbang Yuan, Pengfei Cao, Zhuoran Jin, Yubo Chen, Daojian Zeng, Kang Liu, Jun Zhao |
| 2024 | Whiteboard-of-Thought: Thinking Step-by-Step Across Modalities. Sachit Menon, Richard S. Zemel, Carl Vondrick |
| 2024 | Who is better at math, Jenny or Jingzhen? Uncovering Stereotypes in Large Language Models. Zara Siddique, Liam D. Turner, Luis Espinosa Anke |
| 2024 | Why Does New Knowledge Create Messy Ripple Effects in LLMs? Jiaxin Qin, Zixuan Zhang, Chi Han, Pengfei Yu, Manling Li, Heng Ji |
| 2024 | Why do objects have many names? A study on word informativeness in language use and lexical systems. Eleonora Gualdoni, Gemma Boleda |
| 2024 | Will LLMs Replace the Encoder-Only Models in Temporal Relation Classification? Gabriel Roccabruna, Massimo Rizzoli, Giuseppe Riccardi |
| 2024 | With Ears to See and Eyes to Hear: Sound Symbolism Experiments with Multimodal Large Language Models. Tyler Loakman, Yucheng Li, Chenghua Lin |
| 2024 | Word Alignment as Preference for Machine Translation. Qiyu Wu, Masaaki Nagata, Zhongtao Miao, Yoshimasa Tsuruoka |
| 2024 | Words Matter: Reducing Stigma in Online Conversations about Substance Use with Large Language Models. Layla Bouzoubaa, Elham Aghakhani, Rezvaneh Rezapour |
| 2024 | Words Worth a Thousand Pictures: Measuring and Understanding Perceptual Variability in Text-to-Image Generation. Raphael Tang, Xinyu Zhang, Lixinyu Xu, Yao Lu, Wenyan Li, Pontus Stenetorp, Jimmy Lin, Ferhan Ture |
| 2024 | Working Memory Identifies Reasoning Limits in Language Models. Chunhui Zhang, Yiren Jian, Zhongyu Ouyang, Soroush Vosoughi |
| 2024 | World to Code: Multi-modal Data Generation via Self-Instructed Compositional Captioning and Filtering. Jiacong Wang, Bohong Wu, Haiyong Jiang, Xun Zhou, Xin Xiao, Haoyuan Guo, Jun Xiao |
| 2024 | WorryWords: Norms of Anxiety Association for over 44k English Words. Saif Mohammad |
| 2024 | XDetox: Text Detoxification with Token-Level Toxicity Explanations. Beomseok Lee, Hyunwoo Kim, Keon Kim, Yong Suk Choi |
| 2024 | XplainLLM: A Knowledge-Augmented Dataset for Reliable Grounded Explanations in LLMs. Zichen Chen, Jianda Chen, Ambuj K. Singh, Misha Sra |
| 2024 | You Make me Feel like a Natural Question: Training QA Systems on Transformed Trivia Questions. Tasnim Kabir, Yoo Yeon Sung, Saptarashmi Bandyopadhyay, Hao Zou, Abhranil Chandra, Jordan L. Boyd-Graber |
| 2024 | ZEBRA: Zero-Shot Example-Based Retrieval Augmentation for Commonsense Question Answering. Francesco Molfese, Simone Conia, Riccardo Orlando, Roberto Navigli |
| 2024 | Zero-Shot Cross-Lingual NER Using Phonemic Representations for Low-Resource Languages. Jimin Sohn, Haeji Jung, Alex Cheng, Jooeon Kang, Yilin Du, David R. Mortensen |
| 2024 | Zero-Shot Detection of LLM-Generated Text using Token Cohesiveness. Shixuan Ma, Quan Wang |
| 2024 | Zero-shot Cross-Lingual Transfer for Synthetic Data Generation in Grammatical Error Detection. Gaetan Latouche, Marc-André Carbonneau, Benjamin Swanson |
| 2024 | Zero-shot Cross-domain Dialogue State Tracking via Context-aware Auto-prompting and Instruction-following Contrastive Decoding. Xiaoyu Dong, Yujie Feng, Zexin Lu, Guangyuan Shi, Xiao-Ming Wu |
| 2024 | mDPO: Conditional Preference Optimization for Multimodal Large Language Models. Fei Wang, Wenxuan Zhou, James Y. Huang, Nan Xu, Sheng Zhang, Hoifung Poon, Muhao Chen |
| 2024 | xCOMET-lite: Bridging the Gap Between Efficiency and Quality in Learned MT Evaluation Metrics. Daniil Larionov, Mikhail Seleznyov, Vasiliy Viskov, Alexander Panchenko, Steffen Eger |