Adapting General-Purpose Embedding Models to Private Datasets Using Keyword-based Retrieval.
Yubai Wei, Jiale Han, Yi Yang. Findings of Annual Meeting of the Association for Computational Linguistics (ACL Findings). Long Paper. 2025.
Know the Unknown: An Uncertainty-Sensitive Method for LLM Instruction Tuning.
Jiaqi Li, Yixuan Tang, Yi Yang. Findings of Annual Meeting of the Association for Computational Linguistics (ACL Findings). Long Paper. 2025.
Bias A-head? Analyzing Bias in Transformer-Based Language Model Attention Heads.
Hanyu Duan, Yi Yang, Ahmed Abbasi, John P. Lalor, Kar Yan Tam. Fifth Workshop on Trustworthy Natural Language Processing (TrustNLP). 2025.
Predicting Practically? Domain Generalization for Predictive Analytics in Real-world Environments.
Hanyu Duan, Yi Yang, Ahmed Abbasi, Kar Yan Tam. Major Revision.
Forget Me If You Can: Auditing User Data Revocation in Recommendation Systems.
Zhihao Zhu, Yi Yang, Jin Chen, Yangyang Fan, Defu Lian. Major Revision.
PatentsNET: A Graph Representation Learning Approach for Predicting Patent Economic Value and Litigation Risk.
Zhitao Yin, Yi Yang, Zhuoyi Peng, Zhenghan Zhang. Major Revision.
Conversation Modeling: Analyzing Earnings Conference Calls for Financial Risk Prediction.
Yanlong Huang, Yi Yang, Yangyang Fan, Kunpeng Zhang. Major Revision.
Reading Between the Lines: A Text-based Deep Learning Approach for Understanding Company Dynamics.
Hanyu Duan, Yi Yang, Kar Yan Tam. Major Revision.
Hypergraph Modeling of Supply Chains: Unveiling the Impact of High-Order and Temporal Dynamics on Credit Risk Prediction.
Jialei Han, Yi Yang, Yangyang Fan, Zhongju Zhang. Under Review
TDDBench: A Benchmark for Training Data Detection.
Zhihao Zhu, Yi Yang, Defu Lian. International Conference on Learning Representations (ICLR). 2025.
Adversarial Mixup Unlearning.
Zhuoyi Peng, Yixuan Tang, Yi Yang. International Conference on Learning Representations (ICLR). 2025.
Divide-and-Contrast: A Text-based Method for Firm Market Risk Prediction.
Yi He, Yi Yang, Defu Lian, Kunpeng Zhang. INFORMS Journal on Computing. 2025.
Efficient Multi-Expert Tabular Language Model for Banking.
Yue Guo, Wentao Zhang, Xiaojun Zhang, Vincent W Zheng, Yi Yang. SIGKDD Conference on Knowledge Discovery and Data Mining - Applied Data Science Track (KDD). 2025.
Hierarchical Deep Document Model.
Yi Yang, John Lalor, Ahmed Abbasi, Daniel Zeng. Transactions on Knowledge and Data Engineering (TKDE). 2024.
Exploring the Relationship between In-Context Learning and Instruction Tuning.
Hanyu Duan, Yixuan Tang, Yi Yang, Ahmed Abbasi, Kar Yan Tam. Findings of Conference on Empirical Methods in Natural Language Processing (EMNLP Findings). Long Paper. 2024.
MultiHop-RAG: Benchmarking Retrieval-Augmented Generation for Multi-Hop Queries.
Yixuan Tang, Yi Yang. Conference on Language Modeling (COLM). 2024.
EconNLI: Evaluating Large Language Models on Economics Reasoning.
Yue Guo, Yi Yang. Findings of Annual Meeting of the Association for Computational Linguistics (ACL Findings). Long Paper. 2024.
Connecting the Dots: Inferring Patent Phrase Similarity with Retrieved Phrase Graphs.
Zhuoyi Peng, Yi Yang. Findings of Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL Findings). Long Paper. 2024.
TM-OKC: An Unsupervised Topic Model for Text in Online Knowledge Communities.
Dongcheng Zhang, Kunpeng Zhang, Yi Yang, and David Schweidel. MIS Quarterly, 48, no. 3 (2024): 931-978.
Benchmarking Intersectional Biases in NLP.
John Lalor, Yi Yang, Kendall Smith, Nicole Forsgren, Ahmed Abbasi. Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL). Long Paper. 2022.
Interpreting Twitter User Geolocation.
Ting Zhong, Tianliang Wang, Fan Zhou, Goce Trajcevski, Kunpeng Zhang and Yi Yang. Annual Meeting of the Association for Computational Linguistics. (ACL). Short Paper. 2020.