Abstract is missing.
- Frontmatter [doi]
- AdapLeR: Speeding up Inference by Adaptive Length ReductionAli Modarressi, Hosein Mohebbi, Mohammad Taher Pilehvar. 1-15 [doi]
- Quantified Reproducibility Assessment of NLP ResultsAnya Belz, Maja Popovic, Simon Mille. 16-28 [doi]
- Rare Tokens Degenerate All Tokens: Improving Neural Text Generation via Adaptive Gradient Gating for Rare Token EmbeddingsSangwon Yu, Jongyoon Song, Heeseung Kim, Seongmin Lee 0005, Woo-Jong Ryu, Sungroh Yoon. 29-45 [doi]
- AlephBERT: Language Model Pre-training and Evaluation from Sub-Word to Sentence LevelAmit Seker, Elron Bandel, Dan Bareket, Idan Brusilovsky, Refael Shaked Greenfeld, Reut Tsarfaty. 46-56 [doi]
- Learning to Imagine: Integrating Counterfactual Thinking in Neural Discrete ReasoningMoxin Li, Fuli Feng, Hanwang Zhang, Xiangnan He 0001, Fengbin Zhu, Tat-Seng Chua. 57-69 [doi]
- Domain Adaptation in Multilingual and Multi-Domain Monolingual Settings for Complex Word IdentificationGeorge-Eduard Zaharia, Razvan-Alexandru Smadu, Dumitru-Clementin Cercel, Mihai Dascalu. 70-80 [doi]
- JointCL: A Joint Contrastive Learning Framework for Zero-Shot Stance DetectionBin Liang, Qinglin Zhu, Xiang Li, Min Yang 0007, Lin Gui 0003, Yulan He 0001, Ruifeng Xu. 81-91 [doi]
- [CASPI] Causal-aware Safe Policy Improvement for Task-oriented DialogueGovardana Sachithanandam Ramachandran, Kazuma Hashimoto, Caiming Xiong. 92-102 [doi]
- UniTranSeR: A Unified Transformer Semantic Representation Framework for Multimodal Task-Oriented Dialog SystemZhiyuan Ma, Jianjun Li, Guohui Li 0001, Yongjing Cheng. 103-114 [doi]
- Dynamic Schema Graph Fusion Network for Multi-Domain Dialogue State TrackingYue Feng, Aldo Lipani, Fanghua Ye 0001, Qiang Zhang, Emine Yilmaz. 115-126 [doi]
- Attention Temperature Matters in Abstractive Summarization DistillationShengqiang Zhang, Xingxing Zhang, Hangbo Bao, Furu Wei. 127-141 [doi]
- Towards Making the Most of Cross-Lingual Transfer for Zero-Shot Neural Machine TranslationGuanhua Chen, Shuming Ma, Yun Chen, Dongdong Zhang 0001, Jia Pan, Wenping Wang, Furu Wei. 142-157 [doi]
- TopWORDS-Seg: Simultaneous Text Segmentation and Word Discovery for Open-Domain Chinese Texts via Bayesian InferenceChangzai Pan, Maosong Sun, Ke Deng. 158-169 [doi]
- An Unsupervised Multiple-Task and Multiple-Teacher Model for Cross-lingual Named Entity RecognitionZhuoran Li, Chunming Hu, Xiaohui Guo, Junfan Chen, Wenyi Qin, Richong Zhang. 170-179 [doi]
- Discriminative Marginalized Probabilistic Neural Method for Multi-Document Summarization of Medical LiteratureGianluca Moro, Luca Ragazzi, Lorenzo Valgimigli, Davide Freddi. 180-189 [doi]
- Sparse Progressive Distillation: Resolving Overfitting under Pretrain-and-Finetune ParadigmShaoyi Huang, Dongkuan Xu, Ian En-Hsu Yen, Yijue Wang, Sung-En Chang, Bingbing Li, Shiyang Chen, Mimi Xie, Sanguthevar Rajasekaran, Hang Liu, Caiwen Ding. 190-200 [doi]
- CipherDAug: Ciphertext based Data Augmentation for Neural Machine TranslationNishant Kambhatla, Logan Born, Anoop Sarkar. 201-218 [doi]
- Overlap-based Vocabulary Generation Improves Cross-lingual Transfer Among Related LanguagesVaidehi Patil, Partha P. Talukdar, Sunita Sarawagi. 219-233 [doi]
- Long-range Sequence Modeling with Predictable Sparse AttentionYimeng Zhuang, Jing Zhang, Mei Tu. 234-243 [doi]
- Improving Personalized Explanation Generation through VisualizationShijie Geng, Zuohui Fu, Yingqiang Ge, Lei Li, Gerard de Melo, Yongfeng Zhang. 244-255 [doi]
- New Intent Discovery with Pre-training and Contrastive LearningYuwei Zhang, Haode Zhang, Li-Ming Zhan, Xiao-Ming Wu 0003, Albert Y. S. Lam. 256-269 [doi]
- Modeling U.S. State-Level Policies by Extracting Winners and Losers from Legislative TextsMaryam Davoodi, Eric Waltenburg, Dan Goldwasser. 270-284 [doi]
- Structural Characterization for Dialogue DisentanglementXinbei Ma, Zhuosheng Zhang 0001, Hai Zhao. 285-297 [doi]
- Multi-Party Empathetic Dialogue Generation: A New Task for Dialog SystemsLingyu Zhu 0007, Zhengkun Zhang, Jun Wang 0023, Hongbin Wang, Haiying Wu, Zhenglu Yang. 298-307 [doi]
- MISC: A Mixed Strategy-Aware Model integrating COMET for Emotional Support ConversationQuan Tu, Yanran Li, Jianwei Cui, Bin Wang 0004, Ji-Rong Wen, Rui Yan 0001. 308-319 [doi]
- GLM: General Language Model Pretraining with Autoregressive Blank InfillingZhengxiao Du, Yujie Qian, Xiao Liu, Ming Ding, Jiezhong Qiu, Zhilin Yang, Jie Tang 0001. 320-335 [doi]
- QuoteR: A Benchmark of Quote Recommendation for WritingFanchao Qi, Yanhui Yang, Jing Yi, Zhili Cheng, Zhiyuan Liu, Maosong Sun. 336-348 [doi]
- Towards Comprehensive Patent Approval Predictions: Beyond Traditional Document ClassificationXiaochen Gao, Zhaoyi Hou, Yifei Ning, Kewen Zhao, Beilei He, Jingbo Shang, Vish Krishnan. 349-372 [doi]
- Hypergraph Transformer: Weakly-Supervised Multi-hop Reasoning for Knowledge-based Visual Question AnsweringYu-Jung Heo, Eun-Sol Kim, Woo Suk Choi, Byoung-Tak Zhang. 373-390 [doi]
- Cross-Utterance Conditioned VAE for Non-Autoregressive Text-to-SpeechYang Li, Cheng Yu, Guangzhi Sun, Hua Jiang, Fanglei Sun, Weiqin Zu, Ying Wen, Yang Yang 0001, Jun Wang 0012. 391-400 [doi]
- Mix and Match: Learning-free Controllable Text Generationusing Energy Language ModelsFatemehsadat Mireshghallah, Kartik Goyal, Taylor Berg-Kirkpatrick. 401-415 [doi]
- So Different Yet So Alike! Constrained Unsupervised Text Style TransferAbhinav Ramesh Kashyap, Devamanyu Hazarika, Min-Yen Kan, Roger Zimmermann, Soujanya Poria. 416-431 [doi]
- e-CARE: a New Dataset for Exploring Explainable Causal ReasoningLi Du, Xiao Ding, Kai Xiong, Ting Liu 0001, Bing Qin 0001. 432-446 [doi]
- Fantastic Questions and Where to Find Them: FairytaleQA - An Authentic Dataset for Narrative ComprehensionYing Xu, Dakuo Wang, Mo Yu, Daniel Ritchie, Bingsheng Yao, Tongshuang Wu, Zheng Zhang, Toby Jia-Jun Li, Nora Bradford, Branda Sun, Tran-Hoang, Yisi Sang, Yufang Hou 0001, Xiaojuan Ma, Diyi Yang, Nanyun Peng, Zhou Yu, Mark Warschauer. 447-460 [doi]
- KaFSP: Knowledge-Aware Fuzzy Semantic Parsing for Conversational Question Answering over a Large-Scale Knowledge BaseJunzhuo Li, Deyi Xiong. 461-473 [doi]
- Multilingual Knowledge Graph Completion with Self-Supervised Adaptive Graph AlignmentZijie Huang 0002, Zheng Li, Haoming Jiang, Tianyu Cao, Hanqing Lu, Bing Yin, Karthik Subbian, Yizhou Sun, Wei Wang. 474-485 [doi]
- Modeling Hierarchical Syntax Structure with Triplet Position for Source Code SummarizationJuncai Guo 0003, Jin Liu 0016, Yao Wan, Li Li 0029, Pingyi Zhou. 486-500 [doi]
- FewNLU: Benchmarking State-of-the-Art Methods for Few-Shot Natural Language UnderstandingYanan Zheng, Jing Zhou, Yujie Qian, Ming Ding, Chonghua Liao, Li Jian, Ruslan Salakhutdinov, Jie Tang, Sebastian Ruder, Zhilin Yang. 501-516 [doi]
- Learn to Adapt for Generalized Zero-Shot Text ClassificationYiwen Zhang, Caixia Yuan, Xiaojie Wang 0006, Ziwei Bai, Yongbin Liu. 517-527 [doi]
- TableFormer: Robust Transformer Modeling for Table-Text EncodingJingfeng Yang, Aditya Gupta, Shyam Upadhyay, Luheng He, Rahul Goel, Shachi Paul. 528-537 [doi]
- Perceiving the World: Question-guided Reinforcement Learning for Text-based GamesYunqiu Xu, Meng Fang, Ling Chen, Yali Du, Joey Zhou, Chengqi Zhang. 538-560 [doi]
- Neural Label Search for Zero-Shot Multi-Lingual Extractive SummarizationRuipeng Jia, Xingxing Zhang, Yanan Cao, Zheng Lin 0001, Shi Wang, Furu Wei. 561-570 [doi]
- Few-Shot Class-Incremental Learning for Named Entity RecognitionRui Wang, Tong Yu 0001, Handong Zhao, SungChul Kim, Subrata Mitra, Ruiyi Zhang, Ricardo Henao. 571-582 [doi]
- Improving Meta-learning for Low-resource Text Classification and Generation via Memory ImitationYingxiu Zhao, Zhiliang Tian, Huaxiu Yao, Yinhe Zheng, Dongkyu Lee, Yiping Song, Jian Sun, Nevin L. Zhang. 583-595 [doi]
- Quality Controlled Paraphrase GenerationElron Bandel, Ranit Aharonov, Michal Shmueli-Scheuer, Ilya Shnayderman, Noam Slonim, Liat Ein-Dor. 596-609 [doi]
- Controllable Dictionary Example Generation: Generating Example Sentences for Specific Targeted AudiencesXingwei He 0003, Siu-Ming Yiu. 610-627 [doi]
- AraT5: Text-to-Text Transformers for Arabic Language GenerationEl Moatez Billah Nagoudi, AbdelRahim A. Elmadany, Muhammad Abdul-Mageed. 628-647 [doi]
- Legal Judgment Prediction via Event Extraction with ConstraintsYi Feng, Chuanyi Li, Vincent Ng 0001. 648-664 [doi]
- Answer-level Calibration for Free-form Multiple Choice Question AnsweringSawan Kumar. 665-679 [doi]
- Learning When to Translate for Streaming SpeechQian Dong, Yaoming Zhu, Mingxuan Wang, Lei Li 0005. 680-694 [doi]
- Compact Token Representations with Contextual Quantization for Efficient Document Re-rankingYingrui Yang, Yifan Qiao, Tao Yang. 695-707 [doi]
- Early Stopping Based on Unlabeled Samples in Text ClassificationHongseok Choi, Dongha Choi, Hyunju Lee. 708-718 [doi]
- Meta-learning via Language Model In-context TuningYanda Chen, Ruiqi Zhong, Sheng Zha, George Karypis, He He 0001. 719-730 [doi]
- It is AI's Turn to Ask Humans a Question: Question-Answer Pair Generation for Children's Story BooksBingsheng Yao, Dakuo Wang, Tongshuang Wu, Zheng Zhang, Toby Jia-Jun Li, Mo Yu, Ying Xu. 731-744 [doi]
- Prompt-Based Rule Discovery and Boosting for Interactive Weakly-Supervised LearningRongzhi Zhang, Yue Yu, Pranav Shetty, Le Song, Chao Zhang. 745-758 [doi]
- Constrained Multi-Task Learning for Bridging ResolutionHideo Kobayashi, Yufang Hou 0001, Vincent Ng 0001. 759-770 [doi]
- DEAM: Dialogue Coherence Evaluation using AMR-based Semantic ManipulationsSarik Ghazarian, Nuan Wen, Aram Galstyan, Nanyun Peng. 771-785 [doi]
- HIBRIDS: Attention with Hierarchical Biases for Structure-aware Long Document SummarizationShuyang Cao, Lu Wang 0008. 786-807 [doi]
- De-Bias for Generative Extraction in Unified NER TaskShuai Zhang, Yongliang Shen 0001, Zeqi Tan, Yiquan Wu, Weiming Lu 0001. 808-818 [doi]
- An Information-theoretic Approach to Prompt Engineering Without Ground Truth LabelsTaylor Sorensen, Joshua Robinson, Christopher Michael Rytting, Alexander Glenn Shaw, Kyle Jeffrey Rogers, Alexia Pauline Delorey, Mahmoud Khalil, Nancy Fulda, David Wingate. 819-862 [doi]
- Expanding Pretrained Models to Thousands More Languages via Lexicon-based AdaptationXinyi Wang, Sebastian Ruder, Graham Neubig. 863-877 [doi]
- Language-agnostic BERT Sentence EmbeddingFangxiaoyu Feng, Yinfei Yang, Daniel Cer, Naveen Arivazhagan, Wei Wang 0236. 878-891 [doi]
- Nested Named Entity Recognition with Span-level GraphsJuncheng Wan, Dongyu Ru, Weinan Zhang 0001, Yong Yu 0001. 892-903 [doi]
- CogTaskonomy: Cognitively Inspired Task Taxonomy Is Beneficial to Transfer Learning in NLPYifei Luo, Minghui Xu, Deyi Xiong. 904-920 [doi]
- RoCBert: Robust Chinese Bert with Multimodal Contrastive PretrainingHui Su, Weiwei Shi, Xiaoyu Shen 0001, Zhou Xiao, Tuo Ji, Jiarui Fang, Jie Zhou 0016. 921-931 [doi]
- Premise-based Multimodal Reasoning: Conditional Inference on Joint Textual and Visual CluesQingxiu Dong, Ziwei Qin, Heming Xia, Tian Feng, Shoujie Tong, Haoran Meng, Lin Xu, Zhongyu Wei, Weidong Zhan, Baobao Chang, Sujian Li, Tianyu Liu 0001, Zhifang Sui. 932-946 [doi]
- Parallel Instance Query Network for Named Entity RecognitionYongliang Shen 0001, XiaoBin Wang, Zeqi Tan, Guangwei Xu, Pengjun Xie, Fei Huang, Weiming Lu 0001, Yueting Zhuang. 947-961 [doi]
- ProphetChat: Enhancing Dialogue Generation with Simulation of Future ConversationChang Liu 0030, Xu Tan 0003, Chongyang Tao, Zhenxin Fu, Dongyan Zhao 0001, Tie-Yan Liu, Rui Yan. 962-973 [doi]
- Modeling Multi-hop Question Answering as Single Sequence PredictionSemih Yavuz, Kazuma Hashimoto, Yingbo Zhou, Nitish Shirish Keskar, Caiming Xiong. 974-990 [doi]
- Learning Disentangled Semantic Representations for Zero-Shot Cross-Lingual Transfer in Multilingual Machine Reading ComprehensionLinjuan Wu, Shaojuan Wu, Xiaowang Zhang, Deyi Xiong, Shizhan Chen, Zhiqiang Zhuang, Zhiyong Feng 0001. 991-1000 [doi]
- Multi-Granularity Structural Knowledge Distillation for Language Model CompressionChang Liu, Chongyang Tao, Jiazhan Feng, Dongyan Zhao 0001. 1001-1011 [doi]
- Auto-Debias: Debiasing Masked Language Models with Automated Biased PromptsYue Guo, Yi Yang, Ahmed Abbasi. 1012-1023 [doi]
- Where to Go for the Holidays: Towards Mixed-Type Dialogs for Clarification of User GoalsZeming Liu, Jun Xu 0027, Zeyang Lei, Haifeng Wang 0001, Zheng-Yu Niu, Hua Wu. 1024-1034 [doi]
- Semi-supervised Domain Adaptation for Dependency Parsing with Dynamic Matching NetworkYing Li, Shuaike Li, Min Zhang 0005. 1035-1045 [doi]
- A Closer Look at How Fine-tuning Changes BERTYichu Zhou, Vivek Srikumar. 1046-1061 [doi]
- Sentence-aware Contrastive Learning for Open-Domain Passage RetrievalWu Hong, Zhuosheng Zhang 0001, Jinyuan Wang, Hai Zhao. 1062-1074 [doi]
- FaiRR: Faithful and Robust Deductive Reasoning over Natural LanguageSoumya Sanyal, Harman Singh, Xiang Ren 0001. 1075-1093 [doi]
- HiTab: A Hierarchical Table Dataset for Question Answering and Natural Language GenerationZhoujun Cheng, Haoyu Dong 0001, Zhiruo Wang, Ran Jia, Jiaqi Guo, Yan Gao 0002, Shi Han, Jian-Guang Lou, Dongmei Zhang. 1094-1110 [doi]
- Doctor Recommendation in Online Health Forums via Expertise LearningXiaoxin Lu, Yubo Zhang, Jing Li 0049, Shi Zong. 1111-1123 [doi]
- Continual Prompt Tuning for Dialog State TrackingQi Zhu 0007, Bing Li, Fei Mi, Xiaoyan Zhu 0001, Minlie Huang. 1124-1137 [doi]
- There's a Time and Place for Reasoning Beyond the ImageXingyu Fu, Ben Zhou, Ishaan Preetam Chandratreya, Carl Vondrick, Dan Roth. 1138-1149 [doi]
- FORTAP: Using Formulas for Numerical-Reasoning-Aware Table PretrainingZhoujun Cheng, Haoyu Dong 0001, Ran Jia, Pengfei Wu, Shi Han, Fan Cheng, Dongmei Zhang. 1150-1166 [doi]
- Multimodal fusion via cortical network inspired lossesShiv Shankar. 1167-1178 [doi]
- Modeling Temporal-Modal Entity Graph for Procedural Multimodal Machine ComprehensionHuibin Zhang, Zhengkun Zhang, Yao Zhang, Jun Wang, Yufan Li, Ning Jiang, Xin Wei, Zhenglu Yang. 1179-1189 [doi]
- Explanation Graph Generation via Pre-trained Language Models: An Empirical Study with Contrastive LearningSwarnadeep Saha, Prateek Yadav, Mohit Bansal. 1190-1208 [doi]
- Unsupervised Extractive Opinion Summarization Using Sparse CodingSomnath Basu Roy Chowdhury, Chao Zhao, Snigdha Chaturvedi. 1209-1225 [doi]
- LexSubCon: Integrating Knowledge from Lexical Resources into Contextual Embeddings for Lexical SubstitutionGeorge Michalopoulos, Ian McKillop, Alexander Wong, Helen H. Chen. 1226-1236 [doi]
- Think Before You Speak: Explicitly Generating Implicit Commonsense Knowledge for Response GenerationPei Zhou, Karthik Gopalakrishnan 0001, Behnam Hedayatnia, Seokhwan Kim, Jay Pujara, Xiang Ren 0001, Yang Liu, Dilek Hakkani-Tur. 1237-1252 [doi]
- Flow-Adapter Architecture for Unsupervised Machine TranslationYihong Liu, Haris Jabbar, Hinrich Schütze. 1253-1266 [doi]
- Efficient Unsupervised Sentence Compression by Fine-tuning Transformers with Reinforcement LearningDemian Gholipour Ghalandari, Chris Hokamp, Georgiana Ifrim. 1267-1280 [doi]
- Tracing Origins: Coreference-aware Machine Reading ComprehensionZhuosheng Zhang 0001, Hai Zhao. 1281-1292 [doi]
- WatClaimCheck: A new Dataset for Claim Entailment and InferenceKashif Khan, Ruizhe Wang, Pascal Poupart. 1293-1304 [doi]
- FrugalScore: Learning Cheaper, Lighter and Faster Evaluation Metrics for Automatic Text GenerationMoussa Kamal Eddine, Guokan Shang, Antoine J.-P. Tixier, Michalis Vazirgiannis. 1305-1318 [doi]
- A Well-Composed Text is Half Done! Composition Sampling for Diverse Conditional GenerationShashi Narayan, Gonçalo Simões, Yao Zhao, Joshua Maynez, Dipanjan Das 0001, Michael Collins 0001, Mirella Lapata. 1319-1339 [doi]
- Synthetic Question Value Estimation for Domain Adaptation of Question AnsweringXiang Yue, Ziyu Yao, Huan Sun. 1340-1351 [doi]
- Better Language Model with Hypernym Class PredictionHe Bai, Tong Wang, Alessandro Sordoni, Peng Shi 0010. 1352-1362 [doi]
- Tackling Fake News Detection by Continually Improving Social Context Representations using Graph Neural NetworksNikhil Mehta, Maria Pacheco, Dan Goldwasser. 1363-1380 [doi]
- Understanding Gender Bias in Knowledge Base EmbeddingsYupei Du, Qi Zheng, Yuanbin Wu, Man Lan, Yan Yang, Meirong Ma. 1381-1395 [doi]
- Computational Historical Linguistics and Language Diversity in South AsiaAryaman Arora, Adam Farris, Samopriya Basu, Suresh Kolichala. 1396-1409 [doi]
- Faithful or Extractive? On Mitigating the Faithfulness-Abstractiveness Trade-off in Abstractive SummarizationFaisal Ladhak, Esin Durmus, He He 0001, Claire Cardie, Kathleen R. McKeown. 1410-1421 [doi]
- Slangvolution: A Causal Analysis of Semantic Change and Frequency Dynamics in SlangDaphna Keidar, Andreas Opedal, Zhijing Jin, Mrinmaya Sachan. 1422-1442 [doi]
- Spurious Correlations in Reference-Free Evaluation of Text GenerationEsin Durmus, Faisal Ladhak, Tatsunori Hashimoto. 1443-1454 [doi]
- On The Ingredients of an Effective Zero-shot Semantic ParserPengcheng Yin, John Wieting, Avirup Sil, Graham Neubig. 1455-1474 [doi]
- Bias Mitigation in Machine Translation Quality EstimationHanna Behnke, Marina Fomicheva, Lucia Specia. 1475-1487 [doi]
- Unified Speech-Text Pre-training for Speech Translation and RecognitionYun Tang, Hongyu Gong, Ning Dong, Changhan Wang, Wei-Ning Hsu, Jiatao Gu, Alexei Baevski, Xian Li, Abdelrahman Mohamed, Michael Auli, Juan Pino. 1488-1499 [doi]
- Match the Script, Adapt if Multilingual: Analyzing the Effect of Multilingual Pretraining on Cross-lingual TransferabilityYoshinari Fujinuma, Jordan L. Boyd-Graber, Katharina Kann. 1500-1512 [doi]
- Structured Pruning Learns Compact and Accurate ModelsMengzhou Xia, Zexuan Zhong, Danqi Chen. 1513-1528 [doi]
- How can NLP Help Revitalize Endangered Languages? A Case Study and Roadmap for the Cherokee LanguageShiyue Zhang, Benjamin Frey, Mohit Bansal. 1529-1541 [doi]
- Differentiable Multi-Agent Actor-Critic for Multi-Step Radiology Report SummarizationSanjeev Kumar Karn, Ning Liu, Hinrich Schütze, Oladimeji Farri. 1542-1553 [doi]
- Online Semantic Parsing for Latency Reduction in Task-Oriented DialogueJiawei Zhou, Jason Eisner, Michael Newman, Emmanouil Antonios Platanios, Sam Thomson. 1554-1576 [doi]
- Few-Shot Tabular Data Enrichment Using Fine-Tuned Transformer ArchitecturesAsaf Harari, Gilad Katz. 1577-1591 [doi]
- Summ$^N$: A Multi-Stage Summarization Framework for Long Input Dialogues and DocumentsYusen Zhang 0001, Ansong Ni, Ziming Mao, Chen Henry Wu, Chenguang Zhu, Budhaditya Deb, Ahmed Hassan Awadallah, Dragomir R. Radev, Rui Zhang. 1592-1604 [doi]
- Open Domain Question Answering with A Unified Knowledge InterfaceKaixin Ma, Hao Cheng 0002, Xiaodong Liu, Eric Nyberg, Jianfeng Gao. 1605-1620 [doi]
- Principled Paraphrase Generation with Parallel CorporaAitor Ormazabal, Mikel Artetxe, Aitor Soroa, Gorka Labaka, Eneko Agirre. 1621-1638 [doi]
- GlobalWoZ: Globalizing MultiWoZ to Develop Multilingual Task-Oriented Dialogue SystemsBosheng Ding, Junjie Hu, Lidong Bing, Mahani Aljunied Mahani, Shafiq R. Joty, Luo Si, Chunyan Miao. 1639-1657 [doi]
- Domain Knowledge Transferring for Pre-trained Language Model via Calibrated Activation Boundary DistillationDongha Choi, Hongseok Choi, Hyunju Lee. 1658-1669 [doi]
- Retrieval-guided Counterfactual Generation for QABhargavi Paranjape, Matthew Lamm, Ian Tenney. 1670-1686 [doi]
- DYLE: Dynamic Latent Extraction for Abstractive Long-Input SummarizationZiming Mao, Chen Henry Wu, Ansong Ni, Yusen Zhang 0001, Rui Zhang, Tao Yu 0009, Budhaditya Deb, Chenguang Zhu, Ahmed Hassan Awadallah, Dragomir R. Radev. 1687-1698 [doi]
- Searching for fingerspelled content in American Sign LanguageBowen Shi, Diane Brentari, Greg Shakhnarovich, Karen Livescu. 1699-1712 [doi]
- Skill Induction and Planning with Latent LanguagePratyusha Sharma, Antonio Torralba 0001, Jacob Andreas. 1713-1726 [doi]
- Fully-Semantic Parsing and Generation: the BabelNet Meaning RepresentationAbelardo Carlos Martinez Lorenzo, Marco Maru, Roberto Navigli. 1727-1741 [doi]
- Leveraging Similar Users for Personalized Language Modeling with Limited DataCharles Welch, Chenxi Gu, Jonathan K. Kummerfeld, Verónica Pérez-Rosas, Rada Mihalcea. 1742-1752 [doi]
- DEEP: DEnoising Entity Pre-training for Neural Machine TranslationJunjie Hu 0001, Hiroaki Hayashi, KyungHyun Cho, Graham Neubig. 1753-1766 [doi]
- Multi-Modal Sarcasm Detection via Cross-Modal Graph Convolutional NetworkBin Liang, Chenwei Lou, Xiang Li, Min Yang, Lin Gui 0003, Yulan He 0001, Wenjie Pei, Ruifeng Xu. 1767-1777 [doi]
- Composable Sparse Fine-Tuning for Cross-Lingual TransferAlan Ansell, Edoardo Maria Ponti, Anna Korhonen, Ivan Vulic. 1778-1796 [doi]
- Toward Annotator Group Bias in CrowdsourcingHaochen Liu, Joseph Thekinen, Sinem Mollaoglu, Da Tang, Ji Yang, Youlong Cheng, Hui Liu 0031, Jiliang Tang. 1797-1806 [doi]
- Under the Morphosyntactic Lens: A Multifaceted Evaluation of Gender Bias in Speech TranslationBeatrice Savoldi, Marco Gaido, Luisa Bentivogli, Matteo Negri, Marco Turchi. 1807-1824 [doi]
- Answering Open-Domain Multi-Answer Questions via a Recall-then-Verify FrameworkZhihong Shao, Minlie Huang. 1825-1838 [doi]
- Probing as Quantifying Inductive BiasAlexander Immer, Lucas Torroba Hennigen, Vincent Fortuin, Ryan Cotterell. 1839-1851 [doi]
- Probing Structured Pruning on Multilingual Pre-trained Models: Settings, Algorithms, and EfficiencyYanyang Li, Fuli Luo, Runxin Xu, Songfang Huang, Fei Huang, Liwei Wang. 1852-1865 [doi]
- GPT-D: Inducing Dementia-related Linguistic Anomalies by Deliberate Degradation of Artificial Neural Language ModelsChangye Li, David S. Knopman, Weizhe Xu, Trevor Cohen, Serguei Pakhomov. 1866-1877 [doi]
- An Empirical Survey of the Effectiveness of Debiasing Techniques for Pre-trained Language ModelsNicholas Meade, Elinor Poole-Dayan, Siva Reddy. 1878-1898 [doi]
- Exploring and Adapting Chinese GPT to Pinyin Input MethodMinghuan Tan, Yong Dai, Duyu Tang, Zhangyin Feng, Guoping Huang, Jing Jiang, Jiwei Li, Shuming Shi. 1899-1909 [doi]
- Enhancing Cross-lingual Natural Language Inference by Prompt-learning from Cross-lingual TemplatesKunxun Qi, Hai Wan, Jianfeng Du, Haolan Chen. 1910-1923 [doi]
- Sense Embeddings are also Biased - Evaluating Social Biases in Static and Contextualised Sense EmbeddingsYi Zhou 0019, Masahiro Kaneko, Danushka Bollegala. 1924-1935 [doi]
- Hybrid Semantics for Goal-Directed Natural Language GenerationConnor Baumler, Soumya Ray. 1936-1946 [doi]
- Predicting Intervention Approval in Clinical Trials through Multi-Document SummarizationGeorgios Katsimpras, Georgios Paliouras. 1947-1957 [doi]
- BiTIIMT: A Bilingual Text-infilling Method for Interactive Machine TranslationYanling Xiao, Lemao Liu, Guoping Huang, Qu Cui, Shujian Huang, Shuming Shi 0001, Jiajun Chen. 1958-1969 [doi]
- Distributionally Robust Finetuning BERT for Covariate Drift in Spoken Language UnderstandingSamuel Broscheit, Quynh Do, Judith Gaspers. 1970-1985 [doi]
- Enhancing Chinese Pre-trained Language Model via Heterogeneous Linguistics GraphYanzeng Li, Jiangxia Cao, Xin Cong, Zhenyu Zhang 0006, Bowen Yu 0002, Hongsong Zhu, Tingwen Liu. 1986-1996 [doi]
- Divide and Denoise: Learning from Noisy Labels in Fine-Grained Entity Typing with Cluster-Wise Loss CorrectionKunyuan Pang, Haoyu Zhang, Jie Zhou 0013, Ting Wang. 1997-2006 [doi]
- Towards Robustness of Text-to-SQL Models Against Natural and Realistic Adversarial Table PerturbationXinyu Pi, Bing Wang, Yan Gao 0002, Jiaqi Guo, Zhoujun Li, Jian-Guang Lou. 2007-2022 [doi]
- Overcoming Catastrophic Forgetting beyond Continual Learning: Balanced Training for Neural Machine TranslationChenze Shao, Yang Feng 0004. 2023-2036 [doi]
- Metaphors in Pre-Trained Language Models: Probing and Generalization Across Datasets and LanguagesEhsan Aghazadeh, Mohsen Fayyaz, Yadollah Yaghoobzadeh. 2037-2050 [doi]
- Discrete Opinion Tree Induction for Aspect-based Sentiment AnalysisChenhua Chen, Zhiyang Teng, Zhongqing Wang, Yue Zhang 0004. 2051-2064 [doi]
- Investigating Non-local Features for Neural Constituency ParsingLeyang Cui, Sen Yang, Yue Zhang 0004. 2065-2075 [doi]
- Learning from Sibling Mentions with Scalable Graph Inference in Fine-Grained Entity TypingYi Chen 0019, Jiayang Cheng, Haiyun Jiang, Lemao Liu, Haisong Zhang, Shuming Shi 0001, Ruifeng Xu. 2076-2087 [doi]
- A Variational Hierarchical Model for Neural Cross-Lingual SummarizationYunlong Liang, Fandong Meng, Chulun Zhou, Jinan Xu, Yufeng Chen, Jinsong Su, Jie Zhou. 2088-2099 [doi]
- On the Robustness of Question Rewriting Systems to Questions of Varying HardnessHai Ye, Hwee Tou Ng, Wenjuan Han. 2100-2113 [doi]
- OpenHands: Making Sign Language Recognition Accessible with Pose-based Pretrained Models across LanguagesPrem Selvaraj, Gokul Nc, Pratyush Kumar, Mitesh M. Khapra. 2114-2133 [doi]
- bert2BERT: Towards Reusable Pretrained Language ModelsCheng Chen, Yichun Yin, Lifeng Shang, Xin Jiang 0002, Yujia Qin, Fengyu Wang, Zhi Wang, Xiao Chen, Zhiyuan Liu, Qun Liu 0001. 2134-2148 [doi]
- Vision-Language Pre-Training for Multimodal Aspect-Based Sentiment AnalysisYan Ling, Jianfei Yu, Rui Xia. 2149-2159 [doi]
- "You might think about slightly revising the title": Identifying Hedges in Peer-tutoring InteractionsYann Raphalen, Chloé Clavel, Justine Cassell. 2160-2174 [doi]
- Efficient Cluster-Based $k$-Nearest-Neighbor Machine TranslationDexin Wang, Kai Fan, Boxing Chen, Deyi Xiong. 2175-2187 [doi]
- Headed-Span-Based Projective Dependency ParsingSonglin Yang, Kewei Tu. 2188-2200 [doi]
- Decoding Part-of-Speech from Human EEG SignalsAlex Murphy, Bernd Bohnet, Ryan T. McDonald, Uta Noppeney. 2201-2210 [doi]
- Robust Lottery Tickets for Pre-trained Language ModelsRui Zheng, Bao Rong, Yuhao Zhou, Di Liang, Sirui Wang, Wei Wu, Tao Gui, Qi Zhang, Xuanjing Huang. 2211-2224 [doi]
- Knowledgeable Prompt-tuning: Incorporating Knowledge into Prompt Verbalizer for Text ClassificationShengding Hu, Ning Ding, Huadong Wang, Zhiyuan Liu, Jingang Wang, Juanzi Li, Wei Wu, Maosong Sun. 2225-2240 [doi]
- Cross-Lingual Contrastive Learning for Fine-Grained Entity Typing for Low-Resource LanguagesXu Han, Yuqi Luo, Weize Chen, Zhiyuan Liu, Maosong Sun, Botong Zhou, Fei Hao, Suncong Zheng. 2241-2250 [doi]
- MELM: Data Augmentation with Masked Entity Language Modeling for Low-Resource NERRan Zhou, Xin Li, Ruidan He, Lidong Bing, Erik Cambria, Luo Si, Chunyan Miao. 2251-2262 [doi]
- Word2Box: Capturing Set-Theoretic Semantics of Words using Box EmbeddingsShib Sankar Dasgupta, Michael Boratko, Siddhartha Mishra, Shriya Atmakuri, Dhruvesh Patel, Xiang Li 0069, Andrew McCallum. 2263-2276 [doi]
- IAM: A Comprehensive and Large-Scale Dataset for Integrated Argument Mining TasksLiYing Cheng, Lidong Bing, Ruidan He, Qian Yu, Yan Zhang, Luo Si. 2277-2287 [doi]
- PLANET: Dynamic Content Planning in Autoregressive Transformers for Long-form Text GenerationZhe Hu, Hou Pong Chan, Jiachen Liu, Xinyan Xiao, Hua Wu, Lifu Huang. 2288-2305 [doi]
- CTRLEval: An Unsupervised Reference-Free Metric for Evaluating Controlled Text GenerationPei Ke, Hao Zhou, Yankai Lin, Peng Li, Jie Zhou, Xiaoyan Zhu, Minlie Huang. 2306-2319 [doi]
- Beyond the Granularity: Multi-Perspective Dialogue Collaborative Selection for Dialogue State TrackingJinyu Guo, Kai Shuang, Jijie Li, Zihan Wang, Yixuan Liu. 2320-2332 [doi]
- Are Prompt-based Models Clueless?Pride Kavumba, Ryo Takahashi, Yusuke Oda. 2333-2352 [doi]
- Learning Confidence for Transformer-based Neural Machine TranslationYu Lu, Jiali Zeng, Jiajun Zhang, Shuangzhi Wu, Mu Li 0001. 2353-2364 [doi]
- Things not Written in Text: Exploring Spatial Commonsense from Visual SignalsXiao Liu 0032, Da Yin, Yansong Feng, Dongyan Zhao 0001. 2365-2376 [doi]
- Conditional Bilingual Mutual Information Based Adaptive Training for Neural Machine TranslationSongming Zhang, Yijin Liu, Fandong Meng, Yufeng Chen, Jinan Xu, Jian Liu, Jie Zhou. 2377-2389 [doi]
- ClusterFormer: Neural Clustering Attention for Efficient and Effective TransformerNingning Wang, Guobing Gan, Peng Zhang 0002, Shuai Zhang, Victor Junqiu Wei, Qun Liu, Xin Jiang 0002. 2390-2402 [doi]
- Bottom-Up Constituency Parsing and Nested Named Entity Recognition with Pointer NetworksSonglin Yang, Kewei Tu. 2403-2416 [doi]
- Redistributing Low-Frequency Words: Making the Most of Monolingual Data in Non-Autoregressive TranslationLiang Ding 0006, Longyue Wang, Shuming Shi 0001, Dacheng Tao, Zhaopeng Tu. 2417-2426 [doi]
- Dependency Parsing as MRC-based Span-Span PredictionLeilei Gan, Yuxian Meng, Kun Kuang, Xiaofei Sun, Chun Fan, Fei Wu 0001, Jiwei Li. 2427-2437 [doi]
- Adversarial Soft Prompt Tuning for Cross-Domain Sentiment AnalysisHui Wu, Xiaodong Shi. 2438-2447 [doi]
- Generating Scientific Claims for Zero-Shot Scientific Fact CheckingDustin Wright 0001, David Wadden, Kyle Lo, Bailey Kuehl, Arman Cohan, Isabelle Augenstein, Lucy Lu Wang. 2448-2460 [doi]
- Modeling Dual Read/Write Paths for Simultaneous Machine TranslationShaolei Zhang, Yang Feng. 2461-2477 [doi]
- ExtEnD: Extractive Entity DisambiguationEdoardo Barba, Luigi Procopio, Roberto Navigli. 2478-2488 [doi]
- Hierarchical Sketch Induction for Paraphrase GenerationTom Hosking, Hao Tang, Mirella Lapata. 2489-2501 [doi]
- Alignment-Augmented Consistent Translation for Multilingual Open Information ExtractionKeshav Kolluru, Muqeeth Mohammed, Shubham Mittal, Soumen Chakrabarti, Mausam. 2502-2517 [doi]
- Text-to-Table: A New Way of Information ExtractionXueqing Wu 0001, Jiacheng Zhang, Hang Li. 2518-2533 [doi]
- Accelerating Code Search with Deep Hashing and Code ClassificationWenchao Gu, Yanlin Wang, Lun Du, Hongyu Zhang 0002, Shi Han, Dongmei Zhang, Michael R. Lyu. 2534-2544 [doi]
- Other Roles Matter! Enhancing Role-Oriented Dialogue Summarization via Role InteractionsHaitao Lin, Junnan Zhu, Lu Xiang, Yu Zhou 0001, Jiajun Zhang, Chengqing Zong. 2545-2558 [doi]
- ClarET: Pre-training a Correlation-Aware Context-To-Event Transformer for Event-Centric Generation and ClassificationYucheng Zhou, Tao Shen, Xiubo Geng, Guodong Long, Daxin Jiang. 2559-2575 [doi]
- Measuring and Mitigating Name Biases in Neural Machine TranslationJun Wang, Benjamin I. P. Rubinstein, Trevor Cohn. 2576-2590 [doi]
- Understanding and Improving Sequence-to-Sequence Pretraining for Neural Machine TranslationWenxuan Wang, Wenxiang Jiao, Yongchang Hao, Xing Wang 0007, Shuming Shi, Zhaopeng Tu, Michael R. Lyu. 2591-2600 [doi]
- MSCTD: A Multimodal Sentiment Chat Translation DatasetYunlong Liang, Fandong Meng, Jinan Xu, Yufeng Chen, Jie Zhou. 2601-2613 [doi]
- Learning Disentangled Textual Representations via Statistical Measures of SimilarityPierre Colombo, Guillaume Staerman, Nathan Noiry, Pablo Piantanida. 2614-2630 [doi]
- On the Sensitivity and Stability of Model Interpretations in NLPFan Yin, Zhouxing Shi, Cho-Jui Hsieh, Kai-Wei Chang. 2631-2647 [doi]
- Down and Across: Introducing Crossword-Solving as a New NLP BenchmarkSaurabh Kulshreshtha, Olga Kovaleva, Namrata Shivagunde, Anna Rumshisky. 2648-2659 [doi]
- Generating Data to Mitigate Spurious Correlations in Natural Language Inference DatasetsYuxiang Wu, Matt Gardner 0001, Pontus Stenetorp, Pradeep Dasigi. 2660-2676 [doi]
- GL-CLeF: A Global-Local Contrastive Learning Framework for Cross-lingual Spoken Language UnderstandingLibo Qin, Qiguang Chen, Tianbao Xie, Qixin Li, Jian-Guang Lou, Wanxiang Che, Min-Yen Kan. 2677-2686 [doi]
- Good Examples Make A Faster Learner: Simple Demonstration-based Learning for Low-resource NERDong-Ho Lee, Akshen Kadakia, Kangmin Tan, Mahak Agarwal, Xinyu Feng, Takashi Shibuya 0001, Ryosuke Mitani, Toshiyuki Sekiya, Jay Pujara, Xiang Ren. 2687-2700 [doi]
- Contextual Representation Learning beyond Masked Language ModelingZhiyi Fu, Wangchunshu Zhou, Jingjing Xu, Hao Zhou 0012, Lei Li 0005. 2701-2714 [doi]
- Efficient Hyper-parameter Search for Knowledge Graph EmbeddingYongqi Zhang, Zhanke Zhou, Quanming Yao, Yong Li. 2715-2735 [doi]
- A Meta-framework for Spatiotemporal Quantity Extraction from TextQiang Ning, Ben Zhou, Hao Wu 0034, Haoruo Peng, Chuchu Fan, Matt Gardner 0001. 2736-2749 [doi]
- Leveraging Visual Knowledge in Language Tasks: An Empirical Study on Intermediate Pre-training for Cross-Modal Knowledge TransferWoojeong Jin, Dong-Ho Lee, Chenguang Zhu, Jay Pujara, Xiang Ren. 2750-2762 [doi]
- A Good Prompt Is Worth Millions of Parameters: Low-resource Prompt-based Learning for Vision-Language ModelsWoojeong Jin, Yu Cheng 0001, Yelong Shen, Weizhu Chen, Xiang Ren 0001. 2763-2775 [doi]
- Continual Few-shot Relation Learning via Embedding Space Regularization and Data AugmentationChengwei Qin, Shafiq Joty. 2776-2789 [doi]
- Variational Graph Autoencoding as Cheap Supervision for AMR Coreference ResolutionIrene Li, Linfeng Song, Kun Xu 0005, Dong Yu 0001. 2790-2800 [doi]
- Identifying Chinese Opinion Expressions with Extremely-Noisy Crowdsourcing AnnotationsXin Zhang, Guangwei Xu, Yueheng Sun, Meishan Zhang, XiaoBin Wang, Min Zhang. 2801-2813 [doi]
- Sequence-to-Sequence Knowledge Graph Completion and Question AnsweringApoorv Saxena, Adrian Kochsiek, Rainer Gemulla. 2814-2828 [doi]
- Learning to Mediate Disparities Towards Pragmatic CommunicationYuwei Bao, Sayan Ghosh, Joyce Chai. 2829-2842 [doi]
- Unsupervised Corpus Aware Language Model Pre-training for Dense Passage RetrievalLuyu Gao, Jamie Callan. 2843-2853 [doi]
- Multimodal Dialogue Response GenerationQingfeng Sun, Yujing Wang, Can Xu, Kai Zheng, Yaming Yang 0001, Huang Hu, Fei Xu, Jessica Zhang, Xiubo Geng, Daxin Jiang. 2854-2866 [doi]
- CAKE: A Scalable Commonsense-Aware Framework For Multi-View Knowledge Graph CompletionGuanglin Niu, Bo Li 0006, Yongfei Zhang, Shiliang Pu. 2867-2877 [doi]
- Confidence Based Bidirectional Global Context Aware Training Framework for Neural Machine TranslationChulun Zhou, Fandong Meng, Jie Zhou, Min Zhang, Hongji Wang, Jinsong Su. 2878-2889 [doi]
- BRIO: Bringing Order to Abstractive SummarizationYixin Liu, Pengfei Liu, Dragomir R. Radev, Graham Neubig. 2890-2903 [doi]
- Leveraging Relaxed Equilibrium by Lazy Transition for Sequence ModelingXi Ai, Bin Fang. 2904-2924 [doi]
- FIBER: Fill-in-the-Blanks as a Challenging Video Understanding Evaluation FrameworkSantiago Castro, Ruoyao Wang, Pingxuan Huang, Ian Stewart, Oana Ignat, Nan Liu, Jonathan C. Stroud, Rada Mihalcea. 2925-2940 [doi]
- KenMeSH: Knowledge-enhanced End-to-end Biomedical Text LabellingXindi Wang, Robert E. Mercer, Frank Rudzicz. 2941-2951 [doi]
- A Taxonomy of Empathetic Questions in Social DialogsEkaterina Svikhnushina, Iuliana Voinea, Anuradha Welivita, Pearl Pu. 2952-2973 [doi]
- Enhanced Multi-Channel Graph Convolutional Network for Aspect Sentiment Triplet ExtractionHao Chen, Zepeng Zhai, Fangxiang Feng, Ruifan Li, Xiaojie Wang. 2974-2985 [doi]
- ProtoTEx: Explaining Model Decisions with Prototype TensorsAnubrata Das 0001, Chitrank Gupta, Venelin Kovatchev, Matthew Lease, Junyi Jessy Li. 2986-2997 [doi]
- Show Me More Details: Discovering Hierarchies of Procedures from Semi-structured Web DataShuyan Zhou, Li Zhang, Yue Yang, Qing Lyu 0001, Pengcheng Yin, Chris Callison-Burch, Graham Neubig. 2998-3012 [doi]
- Cross-Modal Discrete Representation LearningAlexander H. Liu, SouYoung Jin, Cheng-I Lai, Andrew Rouditchenko, Aude Oliva, James R. Glass. 3013-3035 [doi]
- Improving Event Representation via Simultaneous Weakly Supervised Contrastive Learning and ClusteringJun Gao, Wei Wang, Changlong Yu, Huan Zhao, Wilfred Ng, Ruifeng Xu. 3036-3049 [doi]
- Contrastive Visual Semantic Pretraining Magnifies the Semantics of Natural Language RepresentationsRobert Wolfe, Aylin Caliskan. 3050-3061 [doi]
- ConTinTin: Continual Learning from Task InstructionsWenpeng Yin 0001, Jia Li, Caiming Xiong. 3062-3072 [doi]
- Automated Crossword SolvingEric Wallace, Nicholas Tomlin, Albert Xu, Kevin Yang, Eshaan Pathak, Matthew Ginsberg, Dan Klein. 3073-3085 [doi]
- Learned Incremental Representations for ParsingNikita Kitaev, Thomas Lu, Dan Klein. 3086-3095 [doi]
- Knowledge Enhanced Reflection Generation for Counseling DialoguesSiqi Shen, Verónica Pérez-Rosas, Charles Welch, Soujanya Poria, Rada Mihalcea. 3096-3107 [doi]
- Misinfo Reaction Frames: Reasoning about Readers' Reactions to News HeadlinesSaadia Gabriel, Skyler Hallinan, Maarten Sap, Pemi Nguyen, Franziska Roesner, Eunsol Choi, Yejin Choi. 3108-3127 [doi]
- On Continual Model Refinement in Out-of-Distribution Data StreamsBill Yuchen Lin, Sida Wang, Xi Victoria Lin, Robin Jia, Lin Xiao, Xiang Ren, Scott Yih. 3128-3139 [doi]
- Achieving Conversational Goals with Unsupervised Post-hoc Knowledge InjectionBodhisattwa Prasad Majumder, Harsh Jhamtani, Taylor Berg-Kirkpatrick, Julian J. McAuley. 3140-3153 [doi]
- Generated Knowledge Prompting for Commonsense ReasoningJiacheng Liu 0010, Alisa Liu, Ximing Lu, Sean Welleck, Peter West, Ronan Le Bras, Yejin Choi, Hannaneh Hajishirzi. 3154-3169 [doi]
- Training Data is More Valuable than You Think: A Simple and Effective Method by Retrieving from Training DataShuohang Wang, Yichong Xu, Yuwei Fang, Yang Liu, Siqi Sun, Ruochen Xu, Chenguang Zhu 0001, Michael Zeng 0001. 3170-3179 [doi]
- Life after BERT: What do Other Muppets Understand about Language?Vladislav Lialin, Kevin Zhao, Namrata Shivagunde, Anna Rumshisky. 3180-3193 [doi]
- Tailor: Generating and Perturbing Text with Semantic ControlsAlexis Ross, Tongshuang Wu, Hao Peng, Matthew E. Peters, Matt Gardner 0001. 3194-3213 [doi]
- TruthfulQA: Measuring How Models Mimic Human FalsehoodsStephanie Lin, Jacob Hilton, Owain Evans. 3214-3252 [doi]
- Adaptive Testing and Debugging of NLP ModelsMarco Túlio Ribeiro, Scott M. Lundberg. 3253-3267 [doi]
- Right for the Right Reason: Evidence Extraction for Trustworthy Tabular ReasoningVivek Gupta 0001, Shuo Zhang, Alakananda Vempala, Yujie He, Temma Choji, Vivek Srikumar. 3268-3283 [doi]
- Interactive Word Completion for Plains CreeWilliam Lane, Atticus Harrigan, Antti Arppe. 3284-3294 [doi]
- LAGr: Label Aligned Graphs for Better Systematic Generalization in Semantic ParsingDora Jambor, Dzmitry Bahdanau. 3295-3308 [doi]
- ToxiGen: A Large-Scale Machine-Generated Dataset for Adversarial and Implicit Hate Speech DetectionThomas Hartvigsen, Saadia Gabriel, Hamid Palangi, Maarten Sap, Dipankar Ray, Ece Kamar. 3309-3326 [doi]
- Direct Speech-to-Speech Translation With Discrete UnitsAnn Lee, Peng-Jen Chen, Changhan Wang, Jiatao Gu, Sravya Popuri, Xutai Ma, Adam Polyak, Yossi Adi, Qing He, Yun Tang, Juan Pino, Wei-Ning Hsu. 3327-3339 [doi]
- Hallucinated but Factual! Inspecting the Factuality of Hallucinations in Abstractive SummarizationMeng Cao, Yue Dong, Jackie Chi Kit Cheung. 3340-3354 [doi]
- EntSUM: A Data Set for Entity-Centric Extractive SummarizationMounica Maddela, Mayank Kulkarni, Daniel Preotiuc-Pietro. 3355-3366 [doi]
- Sentence-level Privacy for Document EmbeddingsCasey Meehan, Khalil Mrini, Kamalika Chaudhuri. 3367-3380 [doi]
- Dataset Geography: Mapping Language Data to Language UsersFahim Faisal, Yinkai Wang, Antonios Anastasopoulos. 3381-3411 [doi]
- ILDAE: Instance-Level Difficulty Analysis of Evaluation DataNeeraj Varshney, Swaroop Mishra, Chitta Baral. 3412-3425 [doi]
- Image Retrieval from Contextual DescriptionsBenno Krojer, Vaibhav Adlakha, Vibhav Vineet, Yash Goyal, Edoardo Maria Ponti, Siva Reddy. 3426-3440 [doi]
- Multilingual Molecular Representation Learning via Contrastive Pre-trainingZhihui Guo, Pramod Kumar Sharma 0003, Andy Martinez, Liang Du, Robin Abraham. 3441-3453 [doi]
- Investigating Failures of Automatic Translationin the Case of Unambiguous GenderAdi Renduchintala, Adina Williams. 3454-3469 [doi]
- Cross-Task Generalization via Natural Language Crowdsourcing InstructionsSwaroop Mishra, Daniel Khashabi, Chitta Baral, Hannaneh Hajishirzi. 3470-3487 [doi]
- Imputing Out-of-Vocabulary Embeddings with LOVE Makes LanguageModels Robust with Little CostLihu Chen, Gaël Varoquaux, Fabian M. Suchanek. 3488-3504 [doi]
- NumGLUE: A Suite of Fundamental yet Challenging Mathematical Reasoning TasksSwaroop Mishra, Arindam Mitra, Neeraj Varshney, Bhavdeep Singh Sachdeva, Peter Clark, Chitta Baral, Ashwin Kalyan. 3505-3523 [doi]
- Upstream Mitigation Is Not All You Need: Testing the Bias Transfer Hypothesis in Pre-Trained Language ModelsRyan Steed, Swetasudha Panda, Ari Kobren, Michael L. Wick. 3524-3542 [doi]
- Improving Multi-label Malevolence Detection in Dialogues through Multi-faceted Label Correlation EnhancementYangjun Zhang, Pengjie Ren, Wentao Deng, Zhumin Chen, Maarten de Rijke. 3543-3555 [doi]
- How Do We Answer Complex Questions: Discourse Structure of Long-form AnswersFangyuan Xu, Junyi Jessy Li, Eunsol Choi. 3556-3572 [doi]
- Understanding Iterative Revision from Human-Written TextWanyu Du, Vipul Raheja, Dhruv Kumar 0005, Zae Myung Kim, Melissa Lopez, Dongyeop Kang. 3573-3590 [doi]
- Making Transformers Solve Compositional TasksSantiago Ontañón, Joshua Ainslie, Zachary Fisher, Vaclav Cvicek. 3591-3607 [doi]
- Can Transformer be Too Compositional? Analysing Idiom Processing in Neural Machine TranslationVerna Dankers, Christopher G. Lucas, Ivan Titov. 3608-3626 [doi]
- ConditionalQA: A Complex Reading Comprehension Dataset with Conditional AnswersHaitian Sun, William W. Cohen, Ruslan Salakhutdinov. 3627-3637 [doi]
- Prompt-free and Efficient Few-shot Learning with Language ModelsRabeeh Karimi Mahabadi, Luke Zettlemoyer, James Henderson 0001, Lambert Mathias, Marzieh Saeidi, Veselin Stoyanov, Majid Yazdani. 3638-3652 [doi]
- Continual Sequence Generation with Adaptive Compositional ModulesYanzhe Zhang, Xuezhi Wang 0002, Diyi Yang. 3653-3667 [doi]
- An Investigation of the (In)effectiveness of Counterfactually Augmented DataNitish Joshi, He He. 3668-3681 [doi]
- Inducing Positive Perspectives with Text ReframingCaleb Ziems, Minzhi Li, Anthony Zhang, Diyi Yang. 3682-3700 [doi]
- VALUE: Understanding Dialect Disparity in NLUCaleb Ziems, Jiaao Chen, Camille Harris, Jessica Anderson, Diyi Yang. 3701-3720 [doi]
- From the Detection of Toxic Spans in Online Discussions to the Analysis of Toxic-to-Civil TransferJohn Pavlopoulos, Leo Laugier, Alexandros Xenos, Jeffrey Sorensen, Ion Androutsopoulos. 3721-3734 [doi]
- FormNet: Structural Encoding beyond Sequential Modeling in Form Document Information ExtractionChen-Yu Lee, Chun-Liang Li, Timothy Dozat, Vincent Perot, Guolong Su, Nan Hua, Joshua Ainslie, Renshen Wang, Yasuhisa Fujii, Tomas Pfister. 3735-3754 [doi]
- The Moral Integrity Corpus: A Benchmark for Ethical Dialogue SystemsCaleb Ziems, Jane A. Yu, Yi-Chia Wang, Alon Y. Halevy, Diyi Yang. 3755-3773 [doi]
- Token Dropping for Efficient BERT PretrainingLe Hou, Richard Yuanzhe Pang, Tianyi Zhou, Yuexin Wu, Xinying Song, Xiaodan Song, Denny Zhou. 3774-3784 [doi]
- DialFact: A Benchmark for Fact-Checking in DialoguePrakhar Gupta, Chien-Sheng Wu, Wenhao Liu, Caiming Xiong. 3785-3801 [doi]
- The Trade-offs of Domain Adaptation for Neural Language ModelsDavid Grangier, Dan Iter. 3802-3813 [doi]
- Towards Afrocentric NLP for African Languages: Where We Are and Where We Can GoIfe Adebara, Muhammad Abdul-Mageed. 3814-3841 [doi]
- Ensembling and Knowledge Distilling of Large Sequence Taggers for Grammatical Error CorrectionMaksym Tarnavskyi, Artem N. Chernodub, Kostiantyn Omelianchuk. 3842-3852 [doi]
- Speaker Information Can Guide Models to Better Inductive Biases: A Case Study On Predicting Code-SwitchingAlissa Ostapenko, Shuly Wintner, Melinda Fricke, Yulia Tsvetkov. 3853-3867 [doi]
- Detecting Unassimilated Borrowings in Spanish: An Annotated Corpus and Approaches to ModelingElena Álvarez Mellado, Constantine Lignos. 3868-3888 [doi]
- Is Attention Explanation? An Introduction to the DebateAdrien Bibal, Rémi Cardon, David Alfter, Rodrigo Wilkens, Xiaoou Wang, Thomas François, Patrick Watrin. 3889-3900 [doi]
- There Are a Thousand Hamlets in a Thousand People's Eyes: Enhancing Knowledge-grounded Dialogue with Personal MemoryTingchen Fu, Xueliang Zhao, Chongyang Tao, Ji-Rong Wen, Rui Yan 0001. 3901-3913 [doi]
- Neural Pipeline for Zero-Shot Data-to-Text GenerationZdenek Kasner, Ondrej Dusek. 3914-3932 [doi]
- Not always about you: Prioritizing community needs when developing endangered language technologyZoey Liu, Crystal Richardson, Richard J. Hatcher, Emily Prud'hommeaux. 3933-3944 [doi]
- Automatic Identification and Classification of Bragging in Social MediaMali Jin, Daniel Preotiuc-Pietro, A. Seza Dogruöz, Nikolaos Aletras. 3945-3959 [doi]
- Automatic Error Analysis for Document-level Information ExtractionAliva Das, Xinya Du, Barry Wang, Kejian Shi, Jiayuan Gu, Thomas Porter, Claire Cardie. 3960-3975 [doi]
- Learning Functional Distributional Semantics with Visual DataYinhong Liu, Guy Emerson. 3976-3988 [doi]
- ePiC: Employing Proverbs in Context as a Benchmark for Abstract Language UnderstandingSayan Ghosh, Shashank Srivastava. 3989-4004 [doi]
- Chart-to-Text: A Large-Scale Benchmark for Chart SummarizationShankar Kantharaj, Rixie Tiffany Ko Leong, Xiang Lin, Ahmed Masry, Megh Thakkar, Enamul Hoque, Shafiq Joty. 4005-4023 [doi]
- Characterizing Idioms: Conventionality and ContingencyMichaela Socolof, Jackie Chi Kit Cheung, Michael Wagner 0019, Timothy J. O'Donnell. 4024-4037 [doi]
- Bag-of-Words vs. Graph vs. Sequence in Text Classification: Questioning the Necessity of Text-Graphs and the Surprising Strength of a Wide MLPLukas Galke, Ansgar Scherp. 4038-4051 [doi]
- Generative Pretraining for Paraphrase EvaluationJack Weston, Raphael Lenain, Udeepa Meepegama, Emil Fristed. 4052-4073 [doi]
- Incorporating Stock Market Signals for Twitter Stance DetectionCostanza Conforti, Jakob Berndt, Mohammad Taher Pilehvar, Chryssi Giannitsarou, Flavio Toxvaerd, Nigel Collier. 4074-4091 [doi]
- Multilingual Mix: Example Interpolation Improves Multilingual Neural Machine TranslationYong Cheng, Ankur Bapna, Orhan Firat, Yuan Cao 0007, Pidong Wang, Wolfgang Macherey. 4092-4102 [doi]
- Word Segmentation as Unsupervised Constituency ParsingRaquel G. Alhama. 4103-4112 [doi]
- SafetyKit: First Aid for Measuring Safety in Open-domain Conversational SystemsEmily Dinan, Gavin Abercrombie, A. Stevie Bergman, Shannon L. Spruit, Dirk Hovy, Y-Lan Boureau, Verena Rieser. 4113-4133 [doi]
- Zero-Shot Cross-lingual Semantic ParsingTom Sherborne, Mirella Lapata. 4134-4153 [doi]
- The Paradox of the Compositionality of Natural Language: A Neural Machine Translation Case StudyVerna Dankers, Elia Bruni, Dieuwke Hupkes. 4154-4175 [doi]
- Multilingual Document-Level Translation Enables Zero-Shot Transfer From Sentences to DocumentsBiao Zhang, Ankur Bapna, Melvin Johnson, Ali Dabirmoghaddam, Naveen Arivazhagan, Orhan Firat. 4176-4192 [doi]
- Cross-Lingual Phrase RetrievalHeqi Zheng, Xiao Zhang, Zewen Chi, Heyan Huang, Yan Tan, Tian Lan, Wei Wei 0002, Xian-Ling Mao. 4193-4204 [doi]
- Improving Compositional Generalization with Self-Training for Data-to-Text GenerationSanket Vaibhav Mehta, Jinfeng Rao, Yi Tay, Mihir Kale, Ankur Parikh, Emma Strubell. 4205-4219 [doi]
- MMCoQA: Conversational Question Answering over Text, Tables, and ImagesYongqi Li, Wenjie Li, Liqiang Nie. 4220-4231 [doi]
- Effective Token Graph Modeling using a Novel Labeling Strategy for Structured Sentiment AnalysisWenxuan Shi, Fei Li, Jingye Li, Hao Fei 0001, Donghong Ji. 4232-4241 [doi]
- PromDA: Prompt-based Data Augmentation for Low-Resource NLU TasksYufei Wang 0003, Can Xu, Qingfeng Sun, Huang Hu, Chongyang Tao, Xiubo Geng, Daxin Jiang. 4242-4255 [doi]
- Disentangled Sequence to Sequence Learning for Compositional GeneralizationHao Zheng, Mirella Lapata. 4256-4268 [doi]
- RST Discourse Parsing with Second-Stage EDU-Level Pre-trainingNan Yu, Meishan Zhang, Guohong Fu, Min Zhang. 4269-4280 [doi]
- SimKGC: Simple Contrastive Knowledge Graph Completion with Pre-trained Language ModelsLiang Wang 0046, Wei Zhao, Zhuoyu Wei, Jingming Liu. 4281-4294 [doi]
- Do Transformer Models Show Similar Attention Patterns to Task-Specific Human Gaze?Oliver Eberle, Stephanie Brandl, Jonas Pilot, Anders Søgaard. 4295-4309 [doi]
- LexGLUE: A Benchmark Dataset for Legal Language Understanding in EnglishIlias Chalkidis, Abhik Jana, Dirk Hartung, Michael J. Bommarito II, Ion Androutsopoulos, Daniel Martin Katz, Nikolaos Aletras. 4310-4330 [doi]
- DiBiMT: A Novel Benchmark for Measuring Word Sense Disambiguation Biases in Machine TranslationNiccolò Campolungo, Federico Martelli, Francesco Saina, Roberto Navigli. 4331-4352 [doi]
- Improving Word Translation via Two-Stage Contrastive LearningYaoyiran Li, Fangyu Liu 0001, Nigel Collier, Anna Korhonen, Ivan Vulic. 4353-4374 [doi]
- Scheduled Multi-task Learning for Neural Chat TranslationYunlong Liang, Fandong Meng, Jinan Xu, Yufeng Chen 0005, Jie Zhou 0016. 4375-4388 [doi]
- FairLex: A Multilingual Benchmark for Evaluating Fairness in Legal Text ProcessingIlias Chalkidis, Tommaso Pasini, Sheng Zhang 0022, Letizia Tomada, Sebastian Felix Schwemer, Anders Søgaard. 4389-4406 [doi]
- Towards Abstractive Grounded Summarization of Podcast TranscriptsKaiqiang Song, Chen Li, Xiaoyang Wang, Dong Yu, Fei Liu 0004. 4407-4418 [doi]
- FiNER: Financial Numeric Entity Recognition for XBRL TaggingLefteris Loukas, Manos Fergadiotis, Ilias Chalkidis, Eirini Spyropoulou, Prodromos Malakasiotis, Ion Androutsopoulos, Georgios Paliouras. 4419-4431 [doi]
- Keywords and Instances: A Hierarchical Contrastive Learning Framework Unifying Hybrid Granularities for Text GenerationMingzhe Li, Xiexiong Lin, Xiuying Chen, Jinxiong Chang, Qishen Zhang, Feng Wang, Taifeng Wang, Zhongyi Liu, Wei Chu, Dongyan Zhao 0001, Rui Yan 0001. 4432-4441 [doi]
- EPT-X: An Expression-Pointer Transformer model that generates eXplanations for numbersBugeun Kim, Kyung Seo Ki, Sangkyu Rhim, Gahgene Gweon. 4442-4458 [doi]
- Identifying the Human Values behind ArgumentsJohannes Kiesel, Milad Alshomary, Nicolas Handke, Xiaoni Cai, Henning Wachsmuth, Benno Stein 0001. 4459-4471 [doi]
- BenchIE: A Framework for Multi-Faceted Fact-Based Open Information Extraction EvaluationKiril Gashteovski, Mingying Yu, Bhushan Kotnis, Carolin Lawrence, Mathias Niepert, Goran Glavas. 4472-4490 [doi]
- Leveraging Unimodal Self-Supervised Learning for Multimodal Audio-Visual Speech RecognitionXichen Pan, Peiyu Chen, Yichen Gong, Helong Zhou, Xinbing Wang, Zhouhan Lin. 4491-4503 [doi]
- SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive SummarizationMathieu Ravaut, Shafiq Joty, Nancy F. Chen. 4504-4524 [doi]
- Understanding Multimodal Procedural Knowledge by Sequencing Multimodal Instructional ManualsTe-Lin Wu, Alexander Spangher, Pegah Alipoormolabashi, Marjorie Freedman, Ralph M. Weischedel, Nanyun Peng. 4525-4542 [doi]
- Zoom Out and Observe: News Environment Perception for Fake News DetectionQiang Sheng, Juan Cao, Xueyao Zhang, Rundong Li, Danding Wang, Yongchun Zhu. 4543-4556 [doi]
- Divide and Rule: Effective Pre-Training for Context-Aware Multi-Encoder Translation ModelsLorenzo Lupo, Marco Dinarelli, Laurent Besacier. 4557-4572 [doi]
- Saliency as Evidence: Event Detection with Trigger Saliency AttributionJian Liu, Yufeng Chen, Jinan Xu. 4573-4585 [doi]
- SRL4E - Semantic Role Labeling for Emotions: A Unified Evaluation FrameworkCesare Campagnano, Simone Conia, Roberto Navigli. 4586-4601 [doi]
- Context Matters: A Pragmatic Study of PLMs' Negation UnderstandingReto Gubelmann, Siegfried Handschuh. 4602-4621 [doi]
- Probing for Predicate Argument Structures in Pretrained Language ModelsSimone Conia, Roberto Navigli. 4622-4632 [doi]
- Multilingual Generative Language Models for Zero-Shot Cross-Lingual Event Argument ExtractionKuan-Hao Huang, I-Hung Hsu, Prem Natarajan, Kai-Wei Chang, Nanyun Peng. 4633-4646 [doi]
- Identifying Moments of Change from Longitudinal User TextAdam Tsakalidis, Federico Nanni, Anthony Hills, Jenny Chim, Jiayu Song, Maria Liakata. 4647-4660 [doi]
- Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue SystemYixuan Su, Lei Shu, Elman Mansimov, Arshit Gupta, Deng Cai 0002, Yi-An Lai, Yi Zhang. 4661-4676 [doi]
- Graph Enhanced Contrastive Learning for Radiology Findings SummarizationJinpeng Hu, Zhuo Li, Zhihong Chen, Zhen Li, Xiang Wan, Tsung-Hui Chang. 4677-4688 [doi]
- Semi-Supervised Formality Style Transfer with Consistency TrainingAo Liu, an Wang, Naoaki Okazaki. 4689-4701 [doi]
- Cross-Lingual Ability of Multilingual Masked Language Models: A Study of Language StructureYuan Chai, Yaobo Liang, Nan Duan. 4702-4712 [doi]
- Rare and Zero-shot Word Sense Disambiguation using Z-ReweightingYing Su, Hongming Zhang, Yangqiu Song, Tong Zhang. 4713-4723 [doi]
- Nibbling at the Hard Core of Word Sense DisambiguationMarco Maru, Simone Conia, Michele Bevilacqua, Roberto Navigli. 4724-4737 [doi]
- Large Scale Substitution-based Word Sense InductionMatan Eyal, Shoval Sadde, Hillel Taub-Tabib, Yoav Goldberg. 4738-4752 [doi]
- Can Synthetic Translations Improve Bitext Quality?Eleftheria Briakou, Marine Carpuat. 4753-4766 [doi]
- Unsupervised Dependency Graph NetworkYikang Shen, Shawn Tan, Alessandro Sordoni, Peng Li, Jie Zhou, Aaron C. Courville. 4767-4784 [doi]
- WikiDiverse: A Multimodal Entity Linking Dataset with Diversified Contextual Topics and Entity TypesXuwu Wang, Junfeng Tian, Min Gui, Zhixu Li, Rui Wang, Ming Yan, Lihan Chen, Yanghua Xiao. 4785-4797 [doi]
- Rewire-then-Probe: A Contrastive Recipe for Probing Biomedical Knowledge of Pre-trained Language ModelsZaiqiao Meng, Fangyu Liu 0001, Ehsan Shareghi, Yixuan Su, Charlotte Collins, Nigel Collier. 4798-4810 [doi]
- Fine- and Coarse-Granularity Hybrid Self-Attention for Efficient BERTJing Zhao, Yifan Wang, Junwei Bao 0001, Youzheng Wu, Xiaodong He 0002. 4811-4820 [doi]
- Compression of Generative Pre-trained Language Models via QuantizationChaofan Tao, Lu Hou, Wei Zhang, Lifeng Shang, Xin Jiang, Qun Liu, Ping Luo, Ngai Wong. 4821-4836 [doi]
- Visual-Language Navigation Pretraining via Prompt-based Environmental Self-explorationXiwen Liang, Fengda Zhu, Lingling Li, Hang Xu, Xiaodan Liang. 4837-4851 [doi]
- DialogVED: A Pre-trained Latent Variable Encoder-Decoder Model for Dialog Response GenerationWei Chen 0088, Yeyun Gong, Song Wang, Bolun Yao, Weizhen Qi, Zhongyu Wei, Xiaowu Hu, Bartuer Zhou, Yi Mao, Weizhu Chen, Biao Cheng, Nan Duan. 4852-4864 [doi]
- Contextual Fine-to-Coarse Distillation for Coarse-grained Response Selection in Open-Domain ConversationsWei Chen, Yeyun Gong, Can Xu, Huang Hu, Bolun Yao, Zhongyu Wei, Zhihao Fan, Xiaowu Hu, Bartuer Zhou, Biao Cheng, Daxin Jiang, Nan Duan. 4865-4877 [doi]
- Textomics: A Dataset for Genomics Data Summary GenerationMu-Chun Wang, Zixuan Liu, Sheng Wang. 4878-4891 [doi]
- A Contrastive Framework for Learning Sentence Representations from Pairwise and Triple-wise Perspective in Angular SpaceYuhao Zhang, Hongji Zhu, Yongliang Wang, Nan Xu, Xiaobo Li, Binqiang Zhao. 4892-4903 [doi]
- Packed Levitated Marker for Entity and Relation ExtractionDeming Ye, Yankai Lin, Peng Li 0030, Maosong Sun. 4904-4917 [doi]
- An Interpretable Neuro-Symbolic Reasoning Framework for Task-Oriented Dialogue GenerationShiquan Yang, Rui Zhang, Sarah M. Erfani, Jey Han Lau. 4918-4935 [doi]
- Impact of Evaluation Methodologies on Code SummarizationPengyu Nie, Jiyang Zhang, Junyi Jessy Li, Raymond J. Mooney, Milos Gligoric. 4936-4960 [doi]
- KG-FiD: Infusing Knowledge Graph in Fusion-in-Decoder for Open-Domain Question AnsweringDonghan Yu, Chenguang Zhu 0001, Yuwei Fang, Wenhao Yu 0002, Shuohang Wang, Yichong Xu, Xiang Ren, Yiming Yang, Michael Zeng 0001. 4961-4974 [doi]
- Which side are you on? Insider-Outsider classification in conspiracy-theoretic social mediaPavan Holur, Tianyi Wang, Shadi Shahsavari, Timothy R. Tangherlini, Vwani Roychowdhury. 4975-4987 [doi]
- Learning From Failure: Data Capture in an Australian Aboriginal CommunityÉric Le Ferrand, Steven Bird, Laurent Besacier. 4988-4998 [doi]
- Deep Inductive Logic Reasoning for Multi-Hop Reading ComprehensionWenya Wang, Sinno Jialin Pan. 4999-5009 [doi]
- CICERO: A Dataset for Contextualized Commonsense Inference in DialoguesDeepanway Ghosal, Siqi Shen, Navonil Majumder, Rada Mihalcea, Soujanya Poria. 5010-5028 [doi]
- A Comparative Study of Faithfulness Metrics for Model Interpretability MethodsChun Sik Chan, Huanqi Kong, Guanqing Liang. 5029-5038 [doi]
- SPoT: Better Frozen Model Adaptation through Soft Prompt TransferTu Vu, Brian Lester, Noah Constant, Rami Al-Rfou', Daniel Cer. 5039-5059 [doi]
- Pass off Fish Eyes for Pearls: Attacking Model Selection of Pre-trained ModelsBiru Zhu, Yujia Qin, Fanchao Qi, Yangdong Deng, Zhiyuan Liu, Maosong Sun, Ming Gu 0001. 5060-5072 [doi]
- Educational Question Generation of Children Storybooks via Question Type Distribution Learning and Event-centric SummarizationZhenjie Zhao, Yufang Hou 0001, Dakuo Wang, Mo Yu, Chengzhong Liu, Xiaojuan Ma. 5073-5085 [doi]
- HeterMPC: A Heterogeneous Graph Neural Network for Response Generation in Multi-Party ConversationsJia-Chen Gu, Chao-Hong Tan, Chongyang Tao, Zhen-Hua Ling, Huang Hu, Xiubo Geng, Daxin Jiang. 5086-5097 [doi]
- The patient is more dead than alive: exploring the current state of the multi-document summarisation of the biomedical literatureYulia Otmakhova 0001, Karin Verspoor, Timothy Baldwin, Jey Han Lau. 5098-5111 [doi]
- A Multi-Document Coverage Reward for RELAXed Multi-Document SummarizationJacob Parnell, Inigo Jauregi Unanue, Massimo Piccardi. 5112-5128 [doi]
- KNN-Contrastive Learning for Out-of-Domain Intent ClassificationYunhua Zhou, Peiju Liu, Xipeng Qiu. 5129-5141 [doi]
- A Neural Network Architecture for Program Understanding Inspired by Human BehaviorsRenyu Zhu, Lei Yuan, Xiang Li, Ming Gao 0001, Wenyuan Cai. 5142-5153 [doi]
- FaVIQ: FAct Verification from Information-seeking QuestionsJungsoo Park, Sewon Min, Jaewoo Kang, Luke Zettlemoyer, Hannaneh Hajishirzi. 5154-5166 [doi]
- Simulating Bandit Learning from User Feedback for Extractive Question AnsweringGe Gao, Eunsol Choi, Yoav Artzi. 5167-5179 [doi]
- Beyond Goldfish Memory: Long-Term Open-Domain ConversationJing Xu, Arthur Szlam, Jason Weston. 5180-5197 [doi]
- ReCLIP: A Strong Zero-Shot Baseline for Referring Expression ComprehensionSanjay Subramanian, William Merrill, Trevor Darrell, Matt Gardner 0001, Sameer Singh 0001, Anna Rohrbach. 5198-5215 [doi]
- Dynamic Prefix-Tuning for Generative Template-based Event ExtractionXiao Liu 0029, Heyan Huang, Ge Shi, Bo Wang. 5216-5228 [doi]
- E-LANG: Energy-Based Joint Inferencing of Super and Swift Language ModelsMohammad Akbari, Amin Banitalebi-Dehkordi, Yong Zhang. 5229-5244 [doi]
- PRIMERA: Pyramid-based Masked Sentence Pre-training for Multi-document SummarizationWen Xiao, Iz Beltagy, Giuseppe Carenini, Arman Cohan. 5245-5263 [doi]
- Dynamic Global Memory for Document-level Argument ExtractionXinya Du, Sha Li, Heng Ji. 5264-5275 [doi]
- Measuring the Impact of (Psycho-)Linguistic and Readability Features and Their Spill Over Effects on the Prediction of Eye Movement PatternsDaniel Wiechmann, Elma Kerz. 5276-5290 [doi]
- Alternative Input Signals Ease Transfer in Multilingual Machine TranslationSimeng Sun, Angela Fan, James Cross, Vishrav Chaudhary, Chau Tran, Philipp Koehn, Francisco Guzmán. 5291-5305 [doi]
- Phone-ing it in: Towards Flexible Multi-Modal Language Model Training by Phonetic Representations of DataColin Leong, Daniel Whitenack. 5306-5315 [doi]
- Noisy Channel Language Model Prompting for Few-Shot Text ClassificationSewon Min, Mike Lewis, Hannaneh Hajishirzi, Luke Zettlemoyer. 5316-5330 [doi]
- Multilingual unsupervised sequence segmentation transfers to extremely low-resource languagesC. M. Downey, Shannon Drizin, Levon Haroutunian, Shivin Thukral. 5331-5346 [doi]
- KinyaBERT: a Morphology-aware Kinyarwanda Language ModelAntoine Nzeyimana, Andre Niyongabo Rubungo. 5347-5363 [doi]
- On the Calibration of Pre-trained Language Models using Mixup Guided by Area Under the Margin and SaliencySeoyeon Park, Cornelia Caragea. 5364-5374 [doi]
- IMPLI: Investigating NLI Models' Performance on Figurative LanguageKevin Stowe, Prasetya Utama, Iryna Gurevych. 5375-5388 [doi]
- QAConv: Question Answering on Informative ConversationsChien-Sheng Wu, Andrea Madotto, Wenhao Liu, Pascale Fung, Caiming Xiong. 5389-5411 [doi]
- Prix-LM: Pretraining for Multilingual Knowledge Base ConstructionWenxuan Zhou, Fangyu Liu 0001, Ivan Vulic, Nigel Collier, Muhao Chen. 5412-5424 [doi]
- Semantic Composition with PSHRG for Derivation Tree Reconstruction from Graph-Based Meaning RepresentationsChun Hei Lo, Wai Lam, Hong Cheng 0001. 5425-5439 [doi]
- HOLM: Hallucinating Objects with Language Models for Referring Expression Recognition in Partially-Observed ScenesVolkan Cirik, Louis-Philippe Morency, Taylor Berg-Kirkpatrick. 5440-5453 [doi]
- Multi Task Learning For Zero Shot Performance Prediction of Multilingual ModelsKabir Ahuja, Shanu Kumar, Sandipan Dandapat, Monojit Choudhury. 5454-5467 [doi]
- ∞-former: Infinite Memory TransformerPedro Henrique Martins, Zita Marinho, André F. T. Martins. 5468-5485 [doi]
- Systematic Inequalities in Language Technology Performance across the World's LanguagesDamián E. Blasi, Antonios Anastasopoulos, Graham Neubig. 5486-5505 [doi]
- CaMEL: Case Marker Extraction without LabelsLeonie Weissweiler, Valentin Hofmann, Masoud Jalili Sabet, Hinrich Schütze. 5506-5516 [doi]
- Improving Generalizability in Implicitly Abusive Language Detection with Concept Activation VectorsIsar Nejadgholi, Kathleen C. Fraser, Svetlana Kiritchenko. 5517-5529 [doi]
- Reports of personal experiences and stories in argumentation: datasets and analysisNeele Falk, Gabriella Lapesa. 5530-5553 [doi]
- Non-neural Models Matter: a Re-evaluation of Neural Referring Expression Generation SystemsFahime Same, Guanyi Chen, Kees van Deemter. 5554-5567 [doi]
- Bridging the Generalization Gap in Text-to-SQL Parsing with Schema ExpansionChen Zhao, Yu Su 0001, Adam Pauls, Emmanouil Antonios Platanios. 5568-5578 [doi]
- Predicate-Argument Based Bi-Encoder for Paraphrase IdentificationQiwei Peng 0002, David J. Weir, Julie Weeds, Yekun Chai. 5579-5589 [doi]
- MINER: Improving Out-of-Vocabulary Named Entity Recognition from an Information Theoretic PerspectiveXiao Wang, Shihan Dou, Limao Xiong, Yicheng Zou, Qi Zhang, Tao Gui, Liang Qiao 0001, Zhanzhan Cheng, Xuanjing Huang. 5590-5600 [doi]
- Leveraging Wikipedia article evolution for promotional tone detectionChristine de Kock, Andreas Vlachos 0001. 5601-5613 [doi]
- From text to talk: Harnessing conversational corpora for humane and diversity-aware language technologyMark Dingemanse, Andreas Liesenfeld. 5614-5633 [doi]
- Flooding-X: Improving BERT's Resistance to Adversarial Attacks via Loss-Restricted Fine-TuningQin Liu, Rui Zheng, Bao Rong, Jingyi Liu, Zhihua Liu, Zhanzhan Cheng, Liang Qiao 0001, Tao Gui, Qi Zhang 0001, Xuanjing Huang. 5634-5644 [doi]
- RoMe: A Robust Metric for Evaluating Natural Language GenerationMd. Rashad Al Hasan Rony, Liubov Kovriguina, Debanjan Chaudhuri, Ricardo Usbeck, Jens Lehmann 0001. 5645-5657 [doi]
- Finding Structural Knowledge in Multimodal-BERTVictor Milewski, Miryam de Lhoneux, Marie-Francine Moens. 5658-5671 [doi]
- Fully Hyperbolic Neural NetworksWeize Chen, Xu Han, Yankai Lin, Hexu Zhao, Zhiyuan Liu, Peng Li, Maosong Sun, Jie Zhou. 5672-5686 [doi]
- Neural Machine Translation with Phrase-Level Universal Visual RepresentationsQingkai Fang, Yang Feng. 5687-5698 [doi]
- M3ED: Multi-modal Multi-scene Multi-label Emotional Dialogue DatabaseJinming Zhao, Tenggan Zhang, Jingwen Hu 0003, Yuchen Liu, Qin Jin, Xinchao Wang, Haizhou Li 0001. 5699-5710 [doi]
- Few-shot Named Entity Recognition with Self-describing NetworksJiawei Chen, Qing Liu, Hongyu Lin, Xianpei Han, Le Sun 0001. 5711-5722 [doi]
- SpeechT5: Unified-Modal Encoder-Decoder Pre-Training for Spoken Language ProcessingJunyi Ao, Rui Wang, Long Zhou, Chengyi Wang 0002, Shuo Ren, Yu Wu, Shujie Liu 0001, Tom Ko, Qing Li, Yu Zhang, Zhihua Wei, Yao Qian, Jinyu Li 0001, Furu Wei. 5723-5738 [doi]
- Human Evaluation and Correlation with Automatic Metrics in Consultation Note GenerationFrancesco Moramarco, Alex Papadopoulos-Korfiatis, Mark Perera, Damir Juric, Jack Flann, Ehud Reiter, Anya Belz, Aleksandar Savkov. 5739-5754 [doi]
- Unified Structure Generation for Universal Information ExtractionYaojie Lu 0001, Qing Liu, Dai Dai, Xinyan Xiao, Hongyu Lin, Xianpei Han, Le Sun 0001, Hua Wu. 5755-5772 [doi]
- Subgraph Retrieval Enhanced Model for Multi-hop Knowledge Base Question AnsweringJing Zhang, Xiaokang Zhang, Jifan Yu, Jian Tang 0005, Jie Tang 0005, Cuiping Li 0001, Hong Chen. 5773-5784 [doi]
- Pre-training to Match for Unified Low-shot Relation ExtractionFangchao Liu, Hongyu Lin, Xianpei Han, Boxi Cao, Le Sun 0001. 5785-5795 [doi]
- Can Prompt Probe Pretrained Language Models? Understanding the Invisible Risks from a Causal ViewBoxi Cao, Hongyu Lin, Xianpei Han, Fangchao Liu, Le Sun 0001. 5796-5808 [doi]
- Evaluating Extreme Hierarchical Multi-label ClassificationEnrique Amigó, Agustín D. Delgado. 5809-5819 [doi]
- What does the sea say to the shore? A BERT based DST style approach for speaker to dialogue attribution in novelsCarolina Cuesta-Lázaro, Animesh Prasad, Trevor Wood. 5820-5829 [doi]
- Measuring Fairness of Text Classifiers via Prediction SensitivitySatyapriya Krishna, Rahul Gupta, Apurv Verma, Jwala Dhamala, Yada Pruksachatkun, Kai-Wei Chang. 5830-5842 [doi]
- RotateQVS: Representing Temporal Information as Rotations in Quaternion Vector Space for Temporal Knowledge Graph CompletionKai Chen, Ye Wang, Yitong Li, Aiping Li. 5843-5857 [doi]
- Feeding What You Need by Understanding What You LearnedXiaoqiang Wang, Bang Liu, Fangli Xu, Bo Long, Siliang Tang, Lingfei Wu. 5858-5874 [doi]
- Probing Simile Knowledge from Pre-trained Language ModelsWeijie Chen, Yongzhu Chang, Rongsheng Zhang, Jiashu Pu, Guandan Chen, Le Zhang, Yadong Xi, Yijiang Chen, Chang Su. 5875-5887 [doi]
- An Effective and Efficient Entity Alignment Decoding Algorithm via Third-Order Tensor IsomorphismXin Mao, Meirong Ma, Hao Yuan, Jianchao Zhu, Zongyu Wang, Rui Xie, Wei Wu, Man Lan. 5888-5898 [doi]
- Entailment Graph Learning with Textual Entailment and Soft TransitivityZhibin Chen, Yansong Feng, Dongyan Zhao 0001. 5899-5910 [doi]
- Logic Traps in Evaluating Attribution ScoresYiming Ju, Yuanzhe Zhang, Zhao Yang, Zhongtao Jiang, Kang Liu 0001, Jun Zhao 0001. 5911-5922 [doi]
- Continual Pre-training of Language Models for Math Problem Understanding with Syntax-Aware Memory NetworkZheng Gong, Kun Zhou, Xin Zhao, Jing Sha, Shijin Wang, Ji-Rong Wen. 5923-5933 [doi]
- Multitasking Framework for Unsupervised Simple Definition GenerationCunliang Kong, Yun Chen, Hengyuan Zhang, Liner Yang, Erhong Yang. 5934-5943 [doi]
- Learning to Reason Deductively: Math Word Problem Solving as Complex Relation ExtractionZhanming Jie, Jierui Li, Wei Lu. 5944-5955 [doi]
- When did you become so smart, oh wise one?! Sarcasm Explanation in Multi-modal Multi-party DialoguesShivani Kumar, Atharva Kulkarni, Md. Shad Akhtar, Tanmoy Chakraborty 0002. 5956-5968 [doi]
- Toward Interpretable Semantic Textual Similarity via Optimal Transport-based Contrastive Sentence LearningSeonghyeon Lee, Dongha Lee, Seongbo Jang, Hwanjo Yu. 5969-5979 [doi]
- Pre-training and Fine-tuning Neural Topic Model: A Simple yet Effective Approach to Incorporating External KnowledgeLinhai Zhang, Xuemeng Hu, Boyu Wang, Deyu Zhou, Qian-Wen Zhang, Yunbo Cao. 5980-5989 [doi]
- Multi-View Document Representation Learning for Open-Domain Dense RetrievalShunyu Zhang, Yaobo Liang, Ming Gong, Daxin Jiang, Nan Duan. 5990-6000 [doi]
- Graph Pre-training for AMR Parsing and GenerationXuefeng Bai 0001, Yulong Chen 0001, Yue Zhang 0004. 6001-6015 [doi]
- Turning Tables: Generating Examples from Semi-structured Tables for Endowing Language Models with Reasoning SkillsOri Yoran, Alon Talmor, Jonathan Berant. 6016-6031 [doi]
- RNG-KBQA: Generation Augmented Iterative Ranking for Knowledge Base Question AnsweringXi Ye, Semih Yavuz, Kazuma Hashimoto, Yingbo Zhou, Caiming Xiong. 6032-6043 [doi]
- Rethinking Self-Supervision Objectives for Generalizable Coherence ModelingPrathyusha Jwalapuram, Shafiq Joty, Xiang Lin. 6044-6059 [doi]
- Just Rank: Rethinking Evaluation with Word and Sentence SimilaritiesBin Wang, C.-C. Kuo, Haizhou Li 0001. 6060-6077 [doi]
- MarkupLM: Pre-training of Text and Markup Language for Visually Rich Document UnderstandingJunlong Li, Yiheng Xu, Lei Cui 0001, Furu Wei. 6078-6087 [doi]
- CLIP Models are Few-Shot Learners: Empirical Studies on VQA and Visual EntailmentHaoyu Song 0002, Li Dong 0004, Weinan Zhang 0003, Ting Liu 0001, Furu Wei. 6088-6100 [doi]
- KQA Pro: A Dataset with Explicit Compositional Programs for Complex Question Answering over Knowledge BaseShulin Cao, Jiaxin Shi, Liangming Pan, Lunyiu Nie, Yutong Xiang, Lei Hou 0001, Juanzi Li, Bin He, Hanwang Zhang. 6101-6119 [doi]
- Debiased Contrastive Learning of Unsupervised Sentence RepresentationsKun Zhou, Beichen Zhang, Xin Zhao, Ji-Rong Wen. 6120-6130 [doi]
- MSP: Multi-Stage Prompting for Making Pre-trained Language Models Better TranslatorsZhixing Tan, Xiangwen Zhang, Shuo Wang, Yang Liu 0005. 6131-6142 [doi]
- SalesBot: Transitioning from Chit-Chat to Task-Oriented DialoguesSsu Chiu, Maolin Li, Yen-Ting Lin, Yun-Nung Chen. 6143-6158 [doi]
- UCTopic: Unsupervised Contrastive Learning for Phrase Representations and Topic MiningJiacheng Li, Jingbo Shang, Julian McAuley. 6159-6169 [doi]
- XLM-E: Cross-lingual Language Model Pre-training via ELECTRAZewen Chi, Shaohan Huang, Li Dong 0004, Shuming Ma, Bo Zheng, Saksham Singhal, Payal Bajaj, Xia Song, Xian-Ling Mao, Heyan Huang, Furu Wei. 6170-6182 [doi]
- Nested Named Entity Recognition as Latent Lexicalized Constituency ParsingChao Lou, Songlin Yang, Kewei Tu. 6183-6198 [doi]
- Can Explanations Be Useful for Calibrating Black Box Models?Xi Ye, Greg Durrett. 6199-6212 [doi]
- OIE@OIA: an Adaptable and Efficient Open Information Extraction FrameworkXin Wang, Minlong Peng, Mingming Sun, Ping Li. 6213-6226 [doi]
- ReACC: A Retrieval-Augmented Code Completion FrameworkShuai Lu, Nan Duan, Hojae Han, Daya Guo, Seung-won Hwang, Alexey Svyatkovskiy. 6227-6240 [doi]
- Does Recommend-Revise Produce Reliable Annotations? An Analysis on Missing Instances in DocREDQuzhe Huang, Shibo Hao, Yuan Ye 0001, Shengqi Zhu, Yansong Feng, Dongyan Zhao 0001. 6241-6252 [doi]
- UniPELT: A Unified Framework for Parameter-Efficient Language Model TuningYuning Mao, Lambert Mathias, Rui Hou, Amjad Almahairi, Hao Ma, Jiawei Han 0001, Scott Yih, Madian Khabsa. 6253-6264 [doi]
- An Empirical Study of Memorization in NLPXiaosen Zheng, Jing Jiang. 6265-6278 [doi]
- AmericasNLI: Evaluating Zero-shot Natural Language Understanding of Pretrained Multilingual Models in Truly Low-resource LanguagesAbteen Ebrahimi, Manuel Mager, Arturo Oncevay, Vishrav Chaudhary, Luis Chiruzzo, Angela Fan, John Ortega, Ricardo Ramos, Annette Rios, Iván Vladimir Meza Ruíz, Gustavo Giménez Lugo, Elisabeth Mager, Graham Neubig, Alexis Palmer, Rolando A. Coto Solano, Ngoc Thang Vu, Katharina Kann. 6279-6299 [doi]
- Towards Learning (Dis)-Similarity of Source Code from Program ContrastsYangruibo Ding, Luca Buratti, Saurabh Pujar, Alessandro Morari, Baishakhi Ray, Saikat Chakraborty. 6300-6312 [doi]
- Guided Attention Multimodal Multitask Financial Forecasting with Inter-Company Relationships and Global and Local NewsGary Ang, Ee-Peng Lim. 6313-6326 [doi]
- On Vision Features in Multimodal Machine TranslationBei Li, Chuanhao Lv, Zefan Zhou, Tao Zhou, Tong Xiao, Anxiang Ma, Jingbo Zhu. 6327-6337 [doi]
- CONTaiNER: Few-Shot Named Entity Recognition via Contrastive LearningSarkar Snigdha Sarathi Das, Arzoo Katiyar, Rebecca J. Passonneau, Rui Zhang. 6338-6353 [doi]
- Cree Corpus: A Collection of nêhiyawêwin ResourcesDaniela Teodorescu, Josie Matalski, Delaney Lothian, Denilson Barbosa, Carrie Demmans Epp. 6354-6364 [doi]
- Learning to Rank Visual Stories From Human Ranking DataChi-Yang Hsu, Yun-Wei Chu, Vincent Chen, Kuan-Chieh Lo, Chacha Chen, Ting-Hao Huang, Lun-Wei Ku. 6365-6378 [doi]
- Universal Conditional Masked Language Pre-training for Neural Machine TranslationPengfei Li, Liangyou Li, Meng Zhang, Minghao Wu, Qun Liu. 6379-6391 [doi]
- CARETS: A Consistency And Robustness Evaluative Test Suite for VQACarlos E. Jimenez, Olga Russakovsky, Karthik Narasimhan. 6392-6405 [doi]
- Phrase-aware Unsupervised Constituency ParsingXiaotao Gu, Yikang Shen, Jiaming Shen, Jingbo Shang, Jiawei Han 0001. 6406-6415 [doi]
- Achieving Reliable Human Assessment of Open-Domain Dialogue SystemsTianbo Ji, Yvette Graham, Gareth J. F. Jones, Chenyang Lyu, Qun Liu 0001. 6416-6437 [doi]
- Updated Headline Generation: Creating Updated Summaries for Evolving News StoriesSheena Panthaplackel, Adrian Benton, Mark Dredze. 6438-6461 [doi]
- SaFeRDialogues: Taking Feedback Gracefully after Conversational Safety FailuresMegan Ung, Jing Xu, Y.-Lan Boureau. 6462-6481 [doi]
- Compositional Generalization in Dependency ParsingEmily Goodwin, Siva Reddy, Timothy O'Donnell, Dzmitry Bahdanau. 6482-6493 [doi]
- ASPECTNEWS: Aspect-Oriented Summarization of News DocumentsOjas Ahuja, Jiacheng Xu, Akshay Gupta, Kevin Horecka, Greg Durrett. 6494-6506 [doi]
- MemSum: Extractive Summarization of Long Documents Using Multi-Step Episodic Markov Decision ProcessesNianlong Gu, Elliott Ash, Richard H. R. Hahnloser. 6507-6522 [doi]
- CLUES: A Benchmark for Learning Classifiers using Natural Language ExplanationsRakesh R. Menon, Sayan Ghosh, Shashank Srivastava. 6523-6546 [doi]
- Substructure Distribution Projection for Zero-Shot Cross-Lingual Dependency ParsingFreda Shi, Kevin Gimpel, Karen Livescu. 6547-6563 [doi]
- Multilingual Detection of Personal Employment Status on TwitterManuel Tonneau, Dhaval Adjodah, João Palotti, Nir Grinberg, Samuel Fraiberger. 6564-6587 [doi]
- MultiHiertt: Numerical Reasoning over Multi Hierarchical Tabular and Textual DataYilun Zhao, Yunxiang Li, Chenying Li, Rui Zhang. 6588-6600 [doi]
- Transformers in the loop: Polarity in neural models of languageLisa Bylinina, Alexey Tikhonov. 6601-6610 [doi]
- Bridging the Data Gap between Training and Inference for Unsupervised Neural Machine TranslationZhiwei He 0002, Xing Wang 0007, Rui Wang, Shuming Shi, Zhaopeng Tu. 6611-6623 [doi]
- SDR: Efficient Neural Re-ranking using Succinct Document RepresentationNachshon Cohen, Amit Portnoy, Besnik Fetahu, Amir Ingber. 6624-6637 [doi]
- The AI Doctor Is In: A Survey of Task-Oriented Dialogue Systems for Healthcare ApplicationsMina Valizadeh, Natalie Parde. 6638-6660 [doi]
- SHIELD: Defending Textual Neural Networks against Multiple Black-Box Adversarial Attacks with Stochastic Multi-Expert PatcherThai Le, Noseong Park, Dongwon Lee 0001. 6661-6674 [doi]
- Accurate Online Posterior Alignments for Principled Lexically-Constrained DecodingSoumya Chatterjee, Sunita Sarawagi, Preethi Jyothi. 6675-6689 [doi]
- Leveraging Task Transferability to Meta-learning for Clinical Section Classification with Limited DataZhuohao Chen, Jangwon Kim, Ram Bhakta, Mustafa Y. Sir. 6690-6702 [doi]
- Reinforcement Guided Multi-Task Learning Framework for Low-Resource Stereotype DetectionRajkumar Pujari, Erik Oveson, Priyanka Kulkarni, Elnaz Nouri. 6703-6712 [doi]
- Letters From the Past: Modeling Historical Sound Change Through Diachronic Character EmbeddingsSidsel Boldsen, Patrizia Paggio. 6713-6722 [doi]
- A Token-level Reference-free Hallucination Detection Benchmark for Free-form Text GenerationTianyu Liu, Yizhe Zhang, Chris Brockett, Yi Mao, Zhifang Sui, Weizhu Chen, Bill Dolan. 6723-6737 [doi]
- Low-Rank Softmax Can Have Unargmaxable Classes in Theory but Rarely in PracticeAndreas Grivas, Nikolay Bogoychev, Adam Lopez. 6738-6758 [doi]
- Prompt for Extraction? PAIE: Prompting Argument Interaction for Event Argument ExtractionYubo Ma, Zehao Wang, Yixin Cao 0005, Mukai Li, Meiqi Chen, Kun Wang, Jing Shao. 6759-6774 [doi]
- Reducing Position Bias in Simultaneous Machine Translation with Length-Aware FrameworkShaolei Zhang, Yang Feng. 6775-6788 [doi]
- A Statutory Article Retrieval Dataset in FrenchAntoine Louis, Gerasimos Spanakis. 6789-6803 [doi]
- ParaDetox: Detoxification with Parallel DataVarvara Logacheva, Daryna Dementieva, Sergey Ustyantsev, Daniil Moskovskiy, David Dale, Irina Krotova, Nikita Semenov, Alexander Panchenko. 6804-6818 [doi]
- Interpreting Character Embeddings With Perceptual Representations: The Case of Shape, Sound, and ColorSidsel Boldsen, Manex Agirrezabal, Nora Hollenstein. 6819-6836 [doi]
- Fine-Grained Controllable Text Generation Using Non-Residual PromptingFredrik Carlsson, Joey Öhman, Fangyu Liu 0001, Severine Verlinden, Joakim Nivre, Magnus Sahlgren. 6837-6857 [doi]
- Language-Agnostic Meta-Learning for Low-Resource Text-to-Speech with Articulatory FeaturesFlorian Lux, Ngoc Thang Vu. 6858-6868 [doi]
- TwittIrish: A Universal Dependencies Treebank of Tweets in Modern IrishLauren Cassidy, Teresa Lynn, James Barry, Jennifer Foster. 6869-6884 [doi]
- Length Control in Abstractive Summarization by Pretraining Information SelectionYizhu Liu, Qi Jia 0003, Kenny Q. Zhu. 6885-6895 [doi]
- CQG: A Simple and Effective Controlled Generation Framework for Multi-hop Question GenerationZichu Fei, Qi Zhang, Tao Gui, Di Liang, Sirui Wang, Wei Wu, Xuanjing Huang. 6896-6906 [doi]
- Word Order Does Matter and Shuffled Language Models Know ItMostafa Abdou, Vinit Ravishankar, Artur Kulmizev, Anders Søgaard. 6907-6919 [doi]
- An Empirical Study on Explanations in Out-of-Domain SettingsGeorge Chrysostomou, Nikolaos Aletras. 6920-6938 [doi]
- MILIE: Modular & Iterative Multilingual Open Information ExtractionBhushan Kotnis, Kiril Gashteovski, Daniel Rubio, Ammar Shaker, Vanesa Rodriguez-Tembras, Makoto Takamoto, Mathias Niepert, Carolin Lawrence. 6939-6950 [doi]
- What Makes Reading Comprehension Questions Difficult?Saku Sugawara, Nikita Nangia, Alex Warstadt, Samuel R. Bowman. 6951-6971 [doi]
- From Simultaneous to Streaming Machine Translation by Leveraging Streaming HistoryJavier Iranzo-Sánchez, Jorge Civera, Alfons Juan-Císcar. 6972-6985 [doi]
- A Rationale-Centric Framework for Human-in-the-loop Machine LearningJinghui Lu, Linyi Yang, Brian MacNamee, Yue Zhang. 6986-6996 [doi]
- Challenges and Strategies in Cross-Cultural NLPDaniel Hershcovich, Stella Frank, Heather C. Lent, Miryam de Lhoneux, Mostafa Abdou, Stephanie Brandl, Emanuele Bugliarello, Laura Cabello Piqueras, Ilias Chalkidis, Ruixiang Cui, Constanza Fierro, Katerina Margatina, Phillip Rust, Anders Søgaard. 6997-7013 [doi]
- Prototypical Verbalizer for Prompt-based Few-shot TuningGanqu Cui, Shengding Hu, Ning Ding, Longtao Huang, Zhiyuan Liu. 7014-7024 [doi]
- Clickbait Spoiling via Question Answering and Passage RetrievalMatthias Hagen, Maik Fröbe, Artur Jurk, Martin Potthast. 7025-7036 [doi]
- BERT Learns to Teach: Knowledge Distillation with Meta LearningWangchunshu Zhou, Canwen Xu, Julian McAuley. 7037-7049 [doi]
- STEMM: Self-learning with Speech-text Manifold Mixup for Speech TranslationQingkai Fang, Rong Ye, Lei Li, Yang Feng, Mingxuan Wang. 7050-7062 [doi]
- Integrating Vectorized Lexical Constraints for Neural Machine TranslationShuo Wang, Zhixing Tan, Yang Liu. 7063-7073 [doi]
- MPII: Multi-Level Mutual Promotion for Inference and InterpretationYan Liu, Sanyuan Chen, Yazheng Yang, Qi Dai. 7074-7084 [doi]
- StableMoE: Stable Routing Strategy for Mixture of ExpertsDamai Dai, Li Dong 0004, Shuming Ma, Bo Zheng, Zhifang Sui, Baobao Chang, Furu Wei. 7085-7095 [doi]
- Boundary Smoothing for Named Entity RecognitionEnwei Zhu, Jinpeng Li 0002. 7096-7108 [doi]
- Incorporating Hierarchy into Text Encoder: a Contrastive Learning Approach for Hierarchical Text ClassificationZihan Wang, Peiyi Wang, Lianzhe Huang, Xin Sun 0013, Houfeng Wang. 7109-7119 [doi]
- Signal in Noise: Exploring Meaning Encoded in Random Character Sequences with Character-Aware Language ModelsMark Chu, Bhargav Srinivasa Desikan, Ethan O. Nadler, Donald Ruggiero Lo Sardo, Elise Darragh-Ford, Douglas Guilbeault. 7120-7134 [doi]
- Hyperlink-induced Pre-training for Passage Retrieval in Open-domain Question AnsweringJiawei Zhou, Xiaoguang Li, Lifeng Shang, Lan Luo, Ke Zhan, Enrui Hu, Xinyu Zhang, Hao Jiang, Zhao Cao, Fan Yu, Xin Jiang, Qun Liu, Lei Chen 0002. 7135-7146 [doi]
- AdaLoGN: Adaptive Logic Graph Network for Reasoning-Based Machine Reading ComprehensionXiao Li, Gong Cheng 0001, Ziheng Chen, Yawei Sun, Yuzhong Qu. 7147-7161 [doi]
- CAMERO: Consistency Regularized Ensemble of Perturbed Language Models with Weight SharingChen Liang, Pengcheng He, Yelong Shen, Weizhu Chen, Tuo Zhao. 7162-7175 [doi]
- Interpretability for Language Learners Using Example-Based Grammatical Error CorrectionMasahiro Kaneko, Sho Takase, Ayana Niwa, Naoaki Okazaki. 7176-7187 [doi]
- Rethinking Negative Sampling for Handling Missing Entity AnnotationsYangming Li, Lemao Liu, Shuming Shi 0001. 7188-7197 [doi]
- Distantly Supervised Named Entity Recognition via Confidence-Based Multi-Class Positive and Unlabeled LearningKang Zhou, Yuepei Li, Qi Li. 7198-7211 [doi]
- UniXcoder: Unified Cross-Modal Pre-training for Code RepresentationDaya Guo, Shuai Lu, Nan Duan, Yanlin Wang, Ming Zhou 0001, Jian Yin 0001. 7212-7225 [doi]
- One Country, 700+ Languages: NLP Challenges for Underrepresented Languages and Dialects in IndonesiaAlham Fikri Aji, Genta Indra Winata, Fajri Koto, Samuel Cahyawijaya, Ade Romadhony, Rahmad Mahendra, Kemal Kurniawan, David Moeljadi, Radityo Eko Prasojo, Timothy Baldwin, Jey Han Lau, Sebastian Ruder. 7226-7249 [doi]
- Is GPT-3 Text Indistinguishable from Human Text? Scarecrow: A Framework for Scrutinizing Machine TextYao Dou, Maxwell Forbes, Rik Koncel-Kedziorski, Noah A. Smith, Yejin Choi. 7250-7274 [doi]
- Transkimmer: Transformer Learns to Layer-wise SkimYue Guan, Zhengyi Li, Jingwen Leng, Zhouhan Lin, Minyi Guo. 7275-7286 [doi]
- SkipBERT: Efficient Inference with Shallow Layer SkippingJue Wang 0019, Ke Chen 0005, Gang Chen 0001, Lidan Shou, Julian McAuley. 7287-7301 [doi]
- Pretraining with Artificial Language: Studying Transferable Knowledge in Language ModelsRyokan Ri, Yoshimasa Tsuruoka. 7302-7315 [doi]
- mLUKE: The Power of Entity Representations in Multilingual Pretrained Language ModelsRyokan Ri, Ikuya Yamada, Yoshimasa Tsuruoka. 7316-7330 [doi]
- Evaluating Factuality in Text SimplificationAshwin Devaraj, William Sheffield, Byron C. Wallace, Junyi Jessy Li. 7331-7345 [doi]
- Requirements and Motivations of Low-Resource Speech Synthesis for Language RevitalizationAidan Pine, Dan Wells, Nathan Thanyehténhas Brinklow, Patrick Littell, Korin Richmond. 7346-7359 [doi]
- Sharpness-Aware Minimization Improves Language Model GeneralizationDara Bahri, Hossein Mobahi, Yi Tay. 7360-7371 [doi]
- Adversarial Authorship Attribution for DeobfuscationWanyue Zhai, Jonathan Rusert, Zubair Shafiq, Padmini Srinivasan. 7372-7384 [doi]
- Weakly Supervised Word Segmentation for Computational Language DocumentationShu Okabe, Laurent Besacier, François Yvon. 7385-7398 [doi]
- SciNLI: A Corpus for Natural Language Inference on Scientific TextMobashir Sadat, Cornelia Caragea. 7399-7409 [doi]
- Neural reality of argument structure constructionsBai Li, Zining Zhu, Guillaume Thomas, Frank Rudzicz, Yang Xu 0023. 7410-7423 [doi]
- On the Robustness of Offensive Language ClassifiersJonathan Rusert, Zubair Shafiq, Padmini Srinivasan. 7424-7438 [doi]
- Few-shot Controllable Style Transfer for Low-Resource Multilingual SettingsKalpesh Krishna, Deepak Nathani, Xavier Garcia, Bidisha Samanta, Partha Talukdar. 7439-7468 [doi]
- ABC: Attention with Bounded-memory ControlHao Peng, Jungo Kasai, Nikolaos Pappas 0002, Dani Yogatama, Zhaofeng Wu, Lingpeng Kong, Roy Schwartz 0001, Noah A. Smith. 7469-7483 [doi]
- The Dangers of Underclaiming: Reasons for Caution When Reporting How NLP Systems FailSamuel R. Bowman. 7484-7499 [doi]
- RELiC: Retrieving Evidence for Literary ClaimsKatherine Thai, Yapei Chang, Kalpesh Krishna, Mohit Iyyer. 7500-7518 [doi]
- Analyzing Generalization of Vision and Language Navigation to Unseen Outdoor AreasRaphael Schumann, Stefan Riezler. 7519-7532 [doi]
- Adapting Coreference Resolution Models through Active LearningMichelle Yuan, Patrick Xia, Chandler May, Benjamin Van Durme, Jordan L. Boyd-Graber. 7533-7549 [doi]
- An Imitation Learning Curriculum for Text Editing with Non-Autoregressive ModelsSweta Agrawal, Marine Carpuat. 7550-7563 [doi]
- Memorisation versus Generalisation in Pre-trained Language ModelsMichael Tänzer, Sebastian Ruder, Marek Rei. 7564-7578 [doi]
- ChatMatch: Evaluating Chatbots by Autonomous Chat TournamentsRuolan Yang, Zitong Li, Haifeng Tang, Kenny Q. Zhu. 7579-7590 [doi]
- Do self-supervised speech models develop human-like perception biases?Juliette Millet, Ewan Dunbar. 7591-7605 [doi]
- Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future DirectionsJing Gu, Eliana Stefani, Qi Wu, Jesse Thomason, Xin Wang. 7606-7623 [doi]
- Learning to Generate Programs for Table Fact Verification via Structure-Aware Semantic ParsingSuixin Ou, Yongmei Liu. 7624-7638 [doi]
- Cluster & Tune: Boost Cold Start Performance in Text ClassificationEyal Shnarch, Ariel Gera, Alon Halfon, Lena Dankin, Leshem Choshen, Ranit Aharonov, Noam Slonim. 7639-7653 [doi]
- Overcoming a Theoretical Limitation of Self-AttentionDavid Chiang 0001, Peter Cholak. 7654-7664 [doi]
- Prediction Difference Regularization against Perturbation for Neural Machine TranslationDengji Guo, Zhengrui Ma, Min Zhang, Yang Feng 0004. 7665-7675 [doi]
- Make the Best of Cross-lingual Transfer: Evidence from POS Tagging with over 100 LanguagesWietse de Vries, Martijn Wieling, Malvina Nissim. 7676-7685 [doi]
- Should a Chatbot be Sarcastic? Understanding User Preferences Towards Sarcasm GenerationSilviu Vlad Oprea, Steven R. Wilson 0001, Walid Magdy. 7686-7700 [doi]
- How Do Seq2Seq Models Perform on End-to-End Data-to-Text Generation?Xunjian Yin, Xiaojun Wan 0001. 7701-7710 [doi]
- Probing for Labeled Dependency TreesMax Müller-Eberstein, Rob van der Goot, Barbara Plank. 7711-7726 [doi]
- DoCoGen: Domain Counterfactual Generation for Low Resource Domain AdaptationNitay Calderon, Eyal Ben-David, Amir Feder, Roi Reichart. 7727-7746 [doi]
- LiLT: A Simple yet Effective Language-Independent Layout Transformer for Structured Document UnderstandingJiapeng Wang, Lianwen Jin, Kai Ding. 7747-7757 [doi]
- Dependency-based Mixture Language ModelsZhixian Yang, Xiaojun Wan 0001. 7758-7773 [doi]
- Can Unsupervised Knowledge Transfer from Social Discussions Help Argument Mining?Subhabrata Dutta, Jeevesh Juneja, Dipankar Das 0001, Tanmoy Chakraborty 0002. 7774-7786 [doi]
- Entity-based Neural Local Coherence ModelingSungho Jeon 0002, Michael Strube 0001. 7787-7805 [doi]
- "That Is a Suspicious Reaction!": Interpreting Logits Variation to Detect NLP Adversarial AttacksEdoardo Mosca, Shreyash Agarwal, Javier Rando-Ramirez, Georg Groh. 7806-7816 [doi]
- Local Languages, Third Spaces, and other High-Resource ScenariosSteven Bird. 7817-7829 [doi]
- That Slepen Al the Nyght with Open Ye! Cross-era Sequence Segmentation with Switch-memoryXuemei Tang, Qi Su. 7830-7840 [doi]
- Fair and Argumentative Language Modeling for Computational ArgumentationCarolin Holtermann, Anne Lauscher, Simone Paolo Ponzetto. 7841-7861 [doi]
- Learning Adaptive Segmentation Policy for End-to-End Simultaneous TranslationRuiqing Zhang, Zhongjun He, Hua Wu, Haifeng Wang. 7862-7874 [doi]
- Can Pre-trained Language Models Interpret Similes as Smart as Human?Qianyu He, Sijie Cheng, Zhixu Li, Rui Xie, Yanghua Xiao. 7875-7887 [doi]
- CBLUE: A Chinese Biomedical Language Understanding Evaluation BenchmarkNingyu Zhang, Mosha Chen, Zhen Bi, Xiaozhuan Liang, Lei Li, Xin Shang, Kangping Yin, Chuanqi Tan, Jian Xu, Fei Huang, Luo Si, Yuan Ni, Guotong Xie, Zhifang Sui, Baobao Chang, Hui Zong, Zheng Yuan 0002, Linfeng Li, Jun Yan, Hongying Zan, Kunli Zhang, Buzhou Tang, Qingcai Chen. 7888-7915 [doi]
- Learning Non-Autoregressive Models from Search for Unsupervised Sentence SummarizationPuyuan Liu, Chenyang Huang 0001, Lili Mou. 7916-7929 [doi]
- Learning to Generalize to More: Continuous Semantic Augmentation for Neural Machine TranslationXiangpeng Wei, Heng Yu, Yue Hu 0002, Rongxiang Weng, Weihua Luo, Rong Jin. 7930-7944 [doi]
- Lexical Knowledge Internalization for Neural Dialog GenerationZhiyong Wu 0003, Wei Bi, Xiang Li, Lingpeng Kong, Ben Kao. 7945-7958 [doi]
- Modeling Syntactic-Semantic Dependency Correlations in Semantic Role Labeling Using Mixture ModelsJunjie Chen, Xiangheng He, Yusuke Miyao. 7959-7969 [doi]
- Learning the Beauty in Songs: Neural Singing Voice BeautifierJinglin Liu, Chengxi Li 0002, Yi Ren 0006, Zhiying Zhu, Zhou Zhao. 7970-7983 [doi]
- A Model-agnostic Data Manipulation Method for Persona-based Dialogue GenerationYu Cao, Wei Bi, Meng Fang, Shuming Shi 0001, Dacheng Tao. 7984-8002 [doi]
- LinkBERT: Pretraining Language Models with Document LinksMichihiro Yasunaga, Jure Leskovec, Percy Liang. 8003-8016 [doi]
- Improving Time Sensitivity for Question Answering over Temporal Knowledge GraphsChao Shang, Guangtao Wang, Peng Qi 0003, Jing Huang 0019. 8017-8026 [doi]
- Self-supervised Semantic-driven Phoneme Discovery for Zero-resource Speech RecognitionLiming Wang, Siyuan Feng, Mark Hasegawa-Johnson, Chang Dong Yoo. 8027-8047 [doi]
- Softmax Bottleneck Makes Language Models Unable to Represent Multi-mode Word DistributionsHaw-Shiuan Chang, Andrew McCallum. 8048-8073 [doi]
- Ditch the Gold Standard: Re-evaluating Conversational Question AnsweringHuihan Li, Tianyu Gao, Manan Goenka, Danqi Chen. 8074-8085 [doi]
- Fantastically Ordered Prompts and Where to Find Them: Overcoming Few-Shot Prompt Order SensitivityYao Lu, Max Bartolo, Alastair Moore, Sebastian Riedel 0001, Pontus Stenetorp. 8086-8098 [doi]
- Situated Dialogue Learning through Procedural Environment GenerationPrithviraj Ammanabrolu, Renee Jia, Mark O. Riedl. 8099-8116 [doi]
- UniTE: Unified Translation EvaluationYu Wan 0004, Dayiheng Liu, Baosong Yang, Haibo Zhang, Boxing Chen, Derek F. Wong, Lidia S. Chao. 8117-8127 [doi]
- Program Transfer for Answering Complex Questions over Knowledge BasesShulin Cao, Jiaxin Shi, Zijun Yao 0002, Xin Lv, Jifan Yu, Lei Hou 0001, Juanzi Li, Zhiyuan Liu 0001, JingHui Xiao. 8128-8140 [doi]
- EAG: Extract and Generate Multi-way Aligned Corpus for Complete Multi-lingual Neural Machine TranslationYulin Xu, Zhen Yang, Fandong Meng, Jie Zhou. 8141-8153 [doi]
- Using Context-to-Vector with Graph Retrofitting to Improve Word EmbeddingsJiangbin Zheng, Yile Wang, Ge Wang, Jun Xia, Yufei Huang, Guojiang Zhao, Yue Zhang 0004, Stan Z. Li. 8154-8163 [doi]
- Multimodal Sarcasm Target Identification in TweetsJiquan Wang, Lin Sun, Yi Liu, Meizhi Shao, Zengwei Zheng. 8164-8175 [doi]
- Flexible Generation from Fragmentary Linguistic InputPeng Qian, Roger Levy. 8176-8196 [doi]
- Revisiting Over-Smoothness in Text to SpeechYi Ren 0006, Xu Tan, Tao Qin, Zhou Zhao, Tie-Yan Liu. 8197-8213 [doi]
- Coherence boosting: When your pretrained language model is not paying enough attentionNikolay Malkin, Zhen Wang, Nebojsa Jojic. 8214-8236 [doi]
- Uncertainty Estimation of Transformer Predictions for Misclassification DetectionArtem Vazhentsev, Gleb Kuzmin, Artem Shelmanov, Akim Tsvigun, Evgenii Tsymbalov, Kirill Fedyanin, Maxim Panov, Alexander Panchenko, Gleb Gusev, Mikhail Burtsev, Manvel Avetisian, Leonid Zhukov. 8237-8252 [doi]
- VALSE: A Task-Independent Benchmark for Vision and Language Models Centered on Linguistic PhenomenaLetitia Parcalabescu, Michele Cafagna, Lilitta Muradjan, Anette Frank, Iacer Calixto, Albert Gatt. 8253-8280 [doi]
- The Grammar-Learning Trajectories of Neural Language ModelsLeshem Choshen, Guy Hacohen, Daphna Weinshall, Omri Abend. 8281-8297 [doi]
- Generating Scientific Definitions with Controllable ComplexityTal August, Katharina Reinecke, Noah A. Smith. 8298-8317 [doi]
- Label Semantic Aware Pre-training for Few-shot Text ClassificationAaron Mueller, Jason Krone, Salvatore Romeo, Saab Mansour, Elman Mansimov, Yi Zhang, Dan Roth. 8318-8334 [doi]
- ODE Transformer: An Ordinary Differential Equation-Inspired Model for Sequence GenerationBei Li, Quan Du, Tao Zhou, Yi Jing, Shuhan Zhou, Xin Zeng, Tong Xiao, Jingbo Zhu, Xuebo Liu 0002, Min Zhang. 8335-8351 [doi]
- A Comparison of Strategies for Source-Free Domain AdaptationXin Su, Yiyun Zhao, Steven Bethard. 8352-8367 [doi]
- Ethics Sheets for AI TasksSaif M. Mohammad. 8368-8379 [doi]
- Learning Disentangled Representations of Negation and UncertaintyJake Vasilakes, Chrysoula Zerva, Makoto Miwa, Sophia Ananiadou. 8380-8397 [doi]
- GLAT: Glancing at Latent Variables for Parallel Text GenerationYu Bao, Hao Zhou, Shujian Huang, Dongqi Wang, Lihua Qian, Xinyu Dai, Jiajun Chen, Lei Li. 8398-8409 [doi]
- PPT: Pre-trained Prompt Tuning for Few-shot LearningYuxian Gu, Xu Han, Zhiyuan Liu, Minlie Huang. 8410-8423 [doi]
- Deduplicating Training Data Makes Language Models BetterKatherine Lee, Daphne Ippolito, Andrew Nystrom, Chiyuan Zhang, Douglas Eck, Chris Callison-Burch, Nicholas Carlini. 8424-8445 [doi]
- Improving the Generalizability of Depression Detection by Leveraging Clinical QuestionnairesThong Nguyen, Andrew Yates, Ayah Zirikly, Bart Desmet, Arman Cohan. 8446-8459 [doi]
- Internet-Augmented Dialogue GenerationMojtaba Komeili, Kurt Shuster 0001, Jason Weston. 8460-8478 [doi]
- SUPERB-SG: Enhanced Speech processing Universal PERformance Benchmark for Semantic and Generative CapabilitiesHsiang-Sheng Tsai, Heng-Jui Chang, Wen-Chin Huang, Zili Huang, Kushal Lakhotia, Shu-Wen Yang, Shuyan Dong, Andy T. Liu, Cheng-I Lai, Jiatong Shi, Xuankai Chang, Phil Hall, Hsuan-Jui Chen, Shang-wen Li 0001, Shinji Watanabe 0001, Abdelrahman Mohamed, Hung-yi Lee. 8479-8492 [doi]
- Knowledge Neurons in Pretrained TransformersDamai Dai, Li Dong 0004, Yaru Hao, Zhifang Sui, Baobao Chang, Furu Wei. 8493-8502 [doi]
- Meta-Learning for Fast Cross-Lingual Adaptation in Dependency ParsingAnna Langedijk, Verna Dankers, Phillip Lippe, Sander Bos, Bryan Cardenas Guevara, Helen Yannakoudakis, Ekaterina Shutova. 8503-8520 [doi]
- French CrowS-Pairs: Extending a challenge dataset for measuring social bias in masked language models to a language other than EnglishAurélie Névéol, Yoann Dupont, Julien Bezançon, Karën Fort. 8521-8531 [doi]
- Few-Shot Learning with Siamese Networks and Label TuningThomas Müller 0014, Guillermo Pérez-Torró, Marc Franco-Salvador. 8532-8545 [doi]
- Inferring Rewards from Language in ContextJessy Lin, Daniel Fried, Dan Klein, Anca D. Dragan. 8546-8560 [doi]
- Generating Biographies on Wikipedia: The Impact of Gender Bias on the Retrieval-Based Generation of Women BiographiesAngela Fan, Claire Gardent. 8561-8576 [doi]
- Your Answer is Incorrect... Would you like to know why? Introducing a Bilingual Short Answer Feedback DatasetAnna Filighera, Siddharth Parihar, Tim Steuer, Tobias Meuser, Sebastian Ochs. 8577-8591 [doi]
- Towards Better Characterization of ParaphrasesTimothy Liu, De Wen Soh. 8592-8601 [doi]
- SummScreen: A Dataset for Abstractive Screenplay SummarizationMingda Chen, Zewei Chu, Sam Wiseman, Kevin Gimpel. 8602-8615 [doi]
- Sparsifying Transformer Models with Trainable Representation PoolingMichal Pietruszka, Lukasz Borchmann, Lukasz Garncarek. 8616-8633 [doi]
- Uncertainty Determines the Adequacy of the Mode and the Tractability of Decoding in Sequence-to-Sequence ModelsFelix Stahlberg, Ilia Kulikov, Shankar Kumar. 8634-8645 [doi]
- FlipDA: Effective and Robust Data Augmentation for Few-Shot LearningJing Zhou, Yanan Zheng, Jie Tang, Li Jian, Zhilin Yang. 8646-8665 [doi]
- Text-Free Prosody-Aware Generative Spoken Language ModelingEugene Kharitonov, Ann Lee, Adam Polyak, Yossi Adi, Jade Copet, Kushal Lakhotia, Tu Anh Nguyen, Morgane Rivière, Abdelrahman Mohamed, Emmanuel Dupoux, Wei-Ning Hsu. 8666-8681 [doi]
- Lite Unified Modeling for Discriminative Reading ComprehensionYilin Zhao, Hai Zhao, Libin Shen, Yinggong Zhao. 8682-8695 [doi]
- Bilingual alignment transfers to multilingual alignment for unsupervised parallel text miningChih-chan Tien, Shane Steinert-Threlkeld. 8696-8706 [doi]
- End-to-End Modeling via Information Tree for One-Shot Natural Language Spatial Video GroundingMengze Li, Tianbao Wang, Haoyu Zhang, Shengyu Zhang, Zhou Zhao, Jiaxu Miao, Wenqiao Zhang, Wenming Tan, Jin Wang, Peng Wang, Shiliang Pu, Fei Wu. 8707-8717 [doi]
- RNSum: A Large-Scale Dataset for Automatic Release Note Generation via Commit Logs SummarizationHisashi Kamezawa, Noriki Nishida, Nobuyuki Shimizu, Takashi Miyazaki, Hideki Nakayama. 8718-8735 [doi]
- Improving Machine Reading Comprehension with Contextualized Commonsense KnowledgeKai Sun 0006, Dian Yu 0001, Jianshu Chen, Dong Yu 0001, Claire Cardie. 8736-8747 [doi]
- Modeling Persuasive Discourse to Adaptively Support Students' Argumentative WritingThiemo Wambsganss, Christina Niklaus. 8748-8760 [doi]
- Active Evaluation: Efficient NLG Evaluation with Few Pairwise ComparisonsAkash Kumar Mohankumar, Mitesh M. Khapra. 8761-8781 [doi]
- The Moral Debater: A Study on the Computational Generation of Morally Framed ArgumentsMilad Alshomary, Roxanne El Baff, Timon Gurcke, Henning Wachsmuth. 8782-8797 [doi]
- Pyramid-BERT: Reducing Complexity via Successive Core-set based Token SelectionXin Huang, Ashish Khetan, Rene Bidart, Zohar Karnin. 8798-8817 [doi]
- Probing for the Usage of Grammatical NumberKarim Lasri, Tiago Pimentel, Alessandro Lenci, Thierry Poibeau, Ryan Cotterell. 8818-8831 [doi]