


default search action
EMNLP 2023: Singapore
- Mingxuan Wang, Imed Zitouni:

Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing: EMNLP 2023 - Industry Track, Singapore, December 6-10, 2023. Association for Computational Linguistics 2023 - Frontmatter.

- Tingfeng Cao

, Chengyu Wang, Bingyan Liu, Ziheng Wu, Jinhui Zhu, Jun Huang:
BeautifulPrompt: Towards Automatic Prompt Engineering for Text-to-Image Synthesis. 1-11 - Chenhui Mao, Xiexiong Lin, Xin Jin, Xin Zhang:

Enhancing Language Model with Unit Test Techniques for Efficient Regular Expression Generation. 12-19 - Takuma Udagawa, Aashka Trivedi, Michele Merler, Bishwaranjan Bhattacharjee:

A Comparative Analysis of Task-Agnostic Distillation Methods for Compressing Transformer Language Models. 20-31 - Tong Zhang, Junhong Liu, Chen Huang, Jia Liu, Hongru Liang, Zujie Wen, Wenqiang Lei:

Towards Effective Automatic Debt Collection with Persona Awareness. 32-45 - Nidhi Tiwari, Sneha Kola, Milos Milunovic, Si-qing Chen, Marjan Slavkovski:

Gatekeeper to save COGS and improve efficiency of Text Prediction. 46-53 - Nathan Brown, Ashton Williamson, Tahj Anderson, Logan Lawrence:

Efficient Transformer Knowledge Distillation: A Performance Review. 54-65 - Changzhen Ji, Yating Zhang, Adam Jatowt, Haipang Wu:

CDD: A Large Scale Dataset for Legal Intelligence Research. 66-73 - Noé Tits:

MUST&P-SRL: Multi-lingual and Unified Syllabification in Text and Phonetic Domains for Speech Representation Learning. 74-82 - Masha Belyi, Charlotte Dzialo, Chaitanya Dwivedi, Prajit Muppidi, Kanna Shimizu:

Personalized Dense Retrieval on Global Index for Voice-enabled Conversational Systems. 83-92 - Fengjun Wang, Moran Beladev, Ofri Kleinfeld, Elina Frayerman, Tal Shachar, Eran Fainman, Karen Lastmann Assaraf, Sarai Mizrachi, Benjamin Wang:

Text2Topic: Multi-Label Text Classification System for Efficient Topic Detection in User Generated Content with Zero-Shot Capabilities. 93-103 - Kee Kiat Koo, Ashutosh Joshi, Nishaanth Reddy, Karim Bouyarmane, Ismail B. Tutar, Vaclav Petricek, Changhe Yuan:

Deep Metric Learning to Hierarchically Rank - An Application in Product Retrieval. 104-112 - Youngja Park, Weiqiu You:

A Pretrained Language Model for Cyber Threat Intelligence. 113-122 - Rong Tian, Zijing Zhao

, Weijie Liu, Haoyan Liu
, Weiquan Mao, Zhe Zhao, Kan Zhou:
SAMP: A Model Inference Toolkit of Post-Training Quantization for Text Processing via Self-Adaptive Mixed-Precision. 123-130 - Sanjay Agrawal, Vivek Sembium, Ankith M. S:

KD-Boost: Boosting Real-Time Semantic Matching in E-commerce with Knowledge Distillation. 131-141 - Jingfen Zhang, Xuan Guo, Sravan Bodapati, Christopher Potts:

Multi-teacher Distillation for Multilingual Spelling Correction. 142-151 - Wei-Te Chen, Keiji Shinzato, Naoki Yoshinaga, Yandi Xia:

Does Named Entity Recognition Truly Not Scale Up to Real-world Product Attribute Extraction? 152-159 - Yilun Zhao, Haowei Zhang, Shengyun Si, Linyong Nan, Xiangru Tang, Arman Cohan:

Investigating Table-to-Text Generation Capabilities of Large Language Models in Real-World Information Seeking Scenarios. 160-175 - Tongxin Hu, Zhuang Li, Xin Jin, Lizhen Qu, Xin Zhang:

TMID: A Comprehensive Real-world Dataset for Trademark Infringement Detection in E-Commerce. 176-184 - Zhengyuan Liu, Siti Umairah Md. Salleh, Hong Choon Oh, Pavitra Krishnaswamy, Nancy F. Chen:

Joint Dialogue Topic Segmentation and Categorization: A Case Study on Clinical Spoken Conversations. 185-193 - Junjie Wang, Yicheng Chen, Wangshu Zhang, Sen Hu, Teng Xu, Jing Zheng:

AdapterDistillation: Non-Destructive Task Composition with Knowledge Distillation. 194-201 - Yuqing Wang, Prashanth Vijayaraghavan, Ehsan Degan:

PROMINET: Prototype-based Multi-View Network for Interpretable Email Response Prediction. 202-215 - Justin Chiu:

Retrieval-Enhanced Dual Encoder Training for Product Matching. 216-222 - Jun-Yan He, Zhi-Qi Cheng, Chenyang Li, Jingdong Sun, Wangmeng Xiang, Xianhui Lin, Xiaoyang Kang, Zengke Jin, Yusen Hu, Bin Luo, Yifeng Geng, Xuansong Xie:

WordArt Designer: User-Driven Artistic Typography Synthesis using Large Language Models. 223-232 - Nobuhiro Kaji:

Lattice Path Edit Distance: A Romanization-aware Edit Distance for Extracting Misspelling-Correction Pairs from Japanese Search Query Logs. 233-242 - Pengzhi Gao, Liwen Zhang, Zhongjun He, Hua Wu, Haifeng Wang:

Learning Multilingual Sentence Representations with Cross-lingual Consistency Regularization. 243-262 - Josiane Van Dorpe, Zachary Yang, Nicolas Grenon-Godbout, Grégoire Winterstein:

Unveiling Identity Biases in Toxicity Detection : A Game-Focused Dataset and Reactivity Analysis Approach. 263-274 - Yucheng Lin, Tim Chang, Yaning Chang, Jianqiang Ma, Donghui Li, Ting Peng, Zang Li, Zhiyi Zhou, Feng Wang:

ORANGE: Text-video Retrieval via Watch-time-aware Heterogeneous Graph Contrastive Learning. 275-283 - Christopher Hidey, Sarthak Sarthak:

Compute-Efficient Churn Reduction for Conversational Agents. 284-293 - Fangkai Yang, Pu Zhao, Zezhong Wang, Lu Wang, Bo Qiao, Jue Zhang, Mohit Garg, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang:

Empower Large Language Model to Perform Better on Industrial Domain-Specific Question Answering. 294-312 - Dan Li, Zi Long Zhu, Janneke van de Loo, Agnes Masip Gomez, Vikrant Yadav, Georgios Tsatsaronis

, Zubair Afzal:
Enhancing Extreme Multi-Label Text Classification: Addressing Challenges in Model, Data, and Evaluation. 313-321 - Chengcan Ye, Ting Peng, Tim Chang, Zhiyi Zhou, Feng Wang:

Query-aware Multi-modal based Ranking Relevance in Video Search. 322-330 - Jack Good, Jimit Majmudar, Christophe Dupuy, Jixuan Wang, Charith Peris, Clement Chung, Richard S. Zemel, Rahul Gupta:

Coordinated Replay Sample Selection for Continual Federated Learning. 331-342 - Md. Tahmid Rahman Laskar, Xue-Yong Fu, Cheng Chen, Shashi Bhushan TN:

Building Real-World Meeting Summarization Systems using Large Language Models: A Practical Perspective. 343-352 - Spurthi Amba Hombaiah, Tao Chen, Mingyang Zhang, Michael Bendersky, Marc Najork

, Matt Colen, Sergey Levi, Vladimir Ofitserov, Tanvir Amin:
Creator Context for Tweet Recommendation. 353-363 - Tyler Vuong, Karel Mundnich, Dhanush Bekal, Veera Raghavendra Elluru, Srikanth Ronanki, Sravan Bodapati:

AdaBERT-CTC: Leveraging BERT-CTC for Text-Only Domain Adaptation in ASR. 364-371 - Denis Kochedykov, Fenglin Yin, Sreevidya Khatravath:

Conversing with databases: Practical Natural Language Querying. 372-379 - Bhaktipriya Radharapu, Kevin Robinson, Lora Aroyo, Preethi Lahoti:

AART: AI-Assisted Red-Teaming with Diverse Data Generation for New LLM-powered Applications. 380-395 - Dhruv Kumar, Vipul Raheja, Alice Kaiser-Schatzlein, Robyn Perry, Apurva Joshi, Justin Hugues-Nuger, Samuel Lou, Navid Chowdhury:

Speakerly: A Voice-based Writing Assistant for Text Composition. 396-407 - Xianzhi Li, Samuel Chan, Xiaodan Zhu, Yulong Pei, Zhiqiang Ma, Xiaomo Liu, Sameena Shah:

Are ChatGPT and GPT-4 General-Purpose Solvers for Financial Text Analytics? A Study on Several Typical Tasks. 408-422 - Zhongkai Sun, Zhengyang Zhao, Sixing Lu, Chengyuan Ma, Xiaohu Liu, Xing Fan, Wei Shen, Chenlei Guo:

CL-QR: Cross-Lingual Enhanced Query Reformulation for Multi-lingual Conversational AI Agents. 423-431 - Zhongkai Sun, Yingxue Zhou, Jie Hao, Xing Fan, Yanbin Lu, Chengyuan Ma, Wei Shen, Chenlei Guo:

Improving Contextual Query Rewrite for Conversational AI Agents through User-preference Feedback Learning. 432-439 - Bhavuk Singhal, Sindhuja Gopalan, Amrith Krishna, Malolan Chetlur:

Scaling Neural ITN for Numbers and Temporal Expressions in Tamil: Findings for an Agglutinative Low-resource Language. 440-450 - Gabrielle Cohn, Rishika Agarwal, Deepanshu Gupta, Siddharth Patwardhan:

EELBERT: Tiny Models through Dynamic Embeddings. 451-459 - Hasmot Ali, AKM Shahariar Azad Rabby, Md. Majedul Islam, A. k. m Mahamud, Nazmul Hasan, Fuad Rahman:

Gold Standard Bangla OCR Dataset: An In-Depth Look at Data Preprocessing and Annotation Processes. 460-470 - Zhenting Qi, Xiaoyu Tan, Shaojie Shi, Chao Qu, Yinghui Xu, Yuan Qi:

PILLOW: Enhancing Efficient Instruction Fine-tuning via Prompt Matching. 471-482 - Lilach Eden, Yoav Kantor, Matan Orbach, Yoav Katz, Noam Slonim, Roy Bar-Haim:

Welcome to the Real World: Efficient, Incremental and Scalable Key Point Analysis. 483-491 - Hadeel Saadany, Constantin Orasan

:
Automatic Linking of Judgements to UK Supreme Court Hearings. 492-500 - Zhiping Wang, Peng Lin

, Hainan Zhang
, Hongshen Chen, Tianhao Li, Zhuoye Ding, Sulong Xu, Jinghe Hu:
Automatic Marketing Theme and Commodity Construction System for E-commerce. 501-508 - Shumpei Inoue, Minh-Tien Nguyen, Hiroki Mizokuchi, Tuan-Anh D. Nguyen, Huu-Hiep Nguyen, Dung Le:

Towards Safer Operations: An Expert-involved Dataset of High-Pressure Gas Incidents for Preventing Future Failures. 509-521 - Yuanzhou Yao, Zhao Zhang, Kaijia Yang, Huasheng Liang, Qiang Yan, Yongjun Xu:

An Auxiliary Task Boosted Multi-task Learning Method for Service Account Retrieval with Limited Human Annotation. 522-531 - Siyu An, Ye Liu, Haoyuan Peng, Di Yin:

VKIE: The Application of Key Information Extraction on Video Text. 532-540 - Varun Nathan, Ayush Kumar, Jithendra Vepa:

Investigating the Role and Impact of Disfluency on Summarization. 541-551 - Sandeep Sricharan Mukku

, Manan Soni, Chetan Aggarwal, Jitenkumar Rana, Promod Yenigalla, Rashmi Patange, Shyam Mohan:
InsightNet : Structured Insight Mining from Customer Feedback. 552-566 - Karan Singla, Yeon-Jun Kim, Srinivas Bangalore:

E2E Spoken Entity Extraction for Virtual Agents. 567-574 - Ansel Blume, Nasser Zalmout, Heng Ji, Xian Li:

Generative Models for Product Attribute Extraction. 575-585 - Md. Rashad Al Hasan Rony, Christian Suess, Sinchana Ramakanth Bhat, Viju Sudhi

, Julia Schneider, Maximilian Vogel, Roman Teucher, Ken E. Friedl, Soumya R. Sahoo:
CarExpert: Leveraging Large Language Models for In-Car Conversational Question Answering. 586-604 - Andrea Zugarini, Andrew Zamai, Marco Ernandes, Leonardo Rigutini

:
BUSTER: a "BUSiness Transaction Entity Recognition" dataset. 605-611 - Leonidas Gee, Leonardo Rigutini

, Marco Ernandes, Andrea Zugarini:
Multi-word Tokenization for Sequence Compression. 612-621 - Shangching Liu, Shengkun Wang, Tsungyao Chang, Wenqi Lin, Chung-Wei Hsiung, Yi-Chen Hsieh, Yu-Ping Cheng, Sian-Hong Luo, Jianwei Zhang:

JarviX: A LLM No code Platform for Tabular Data Analysis and Optimization. 622-630 - Sai Muralidhar Jayanthi, Devang Kulshreshtha, Saket Dingliwal, Srikanth Ronanki, Sravan Bodapati:

Retrieve and Copy: Scaling ASR Personalization to Large Catalogs. 631-639 - Leon Liyang Zhang, Jiarui Lu, Joel Ruben Antony Moniz, Aditya Kulkarni, Dhivya Piraviperumal, Tien Dung Tran, Nick Tzou, Hong Yu:

STEER: Semantic Turn Extension-Expansion Recognition for Voice Assistants. 640-649 - Xiaoyu Tan, Shaojie Shi, Xihe Qiu, Chao Qu, Zhenting Qi, Yinghui Xu, Yuan Qi:

Self-Criticism: Aligning Large Language Models with their Understanding of Helpfulness, Honesty, and Harmlessness. 650-662 - Besnik Fetahu, Zhiyu Chen, Oleg Rokhlenko, Shervin Malmasi:

InstructPTS: Instruction-Tuning LLMs for Product Title Summarization. 663-674 - Lei Wang, Songheng Zhang, Yun Wang, Ee-Peng Lim

, Yong Wang:
LLM4Vis: Explainable Visualization Recommendation using ChatGPT. 675-692 - Kriti Aggarwal, Aditi Khandelwal, Kumar Tanmay, Owais Khan Mohammed, Qiang Liu, Monojit Choudhury, Hardik Hansrajbhai Chauhan, Subhojit Som, Vishrav Chaudhary, Saurabh Tiwary:

DUBLIN: Visual Document Understanding By Language-Image Network. 693-706 - Lijun Yu

, Jin Miao, Xiaoyu Sun, Jiayi Chen, Alexander G. Hauptmann, Hanjun Dai, Wei Wei:
DocumentNet: Bridging the Data Gap in Document Pre-training. 707-722 - Jihyuk Kim, Minsoo Kim, Joonsuk Park, Seung-won Hwang:

Relevance-assisted Generation for Robust Zero-shot Retrieval. 723-731 - Aryan Jain, Jitenkumar Rana, Chetan Aggarwal:

Too much of product information : Don't worry, let's look for evidence! 732-738 - Xinli Yu, Zheng Chen, Yanbin Lu:

Harnessing LLMs for Temporal Data - A Study on Explainable Financial Time Series Forecasting. 739-753 - Minh Thuan Nguyen, Khanh-Tung Tran, Nhu-Van Nguyen

, Xuan-Son Vu:
ViGPTQA - State-of-the-Art LLMs for Vietnamese Question Answering: System Overview, Core Models Training, and Evaluations. 754-764 - Jinkyung Jo, Dayeon Ki, Soyoung Yoon, Minjoon Seo:

An Integrated Search System for Korea Weather Data. 765-774 - Mingming Li, Chunyuan Yuan, Huimu Wang, Peng Wang, Jingwei Zhuo, Binbin Wang, Lin Liu, Sulong Xu:

Adaptive Hyper-parameter Learning for Deep Semantic Retrieval. 775-782 - Hojae Han, Yu Jin Kim, Byoungjip Kim, Youngwon Lee, Kyungjae Lee

, Kyungmin Lee, Moontae Lee, Kyunghoon Bae, Seung-won Hwang:
On Sample-Efficient Code Generation. 783-791 - Zhoujun Cheng, Jungo Kasai, Tao Yu:

Batch Prompting: Efficient Inference with Large Language Model APIs. 792-810 - Zheng Chen, Ziyan Jiang, Fan Yang, Eunah Cho, Xing Fan, Xiaojiang Huang, Yanbin Lu, Aram Galstyan:

Graph Meets LLM: A Novel Approach to Collaborative Filtering for Robust Conversational Understanding. 811-819 - David Q. Sun, Artem Abzaliev, Hadas Kotek, Christopher Klein, Zidi Xiu, Jason D. Williams:

DELPHI: Data for Evaluating LLMs' Performance in Handling Controversial Issues. 820-827 - Saiful Haq, Ashutosh Sharma, Pushpak Bhattacharyya:

Angel: Enterprise Search System for the Non-Profit Industry. 828-835

manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.


Google
Google Scholar
Semantic Scholar
Internet Archive Scholar
CiteSeerX
ORCID














