Publications
publications by categories in reversed chronological order. generated by jekyll-scholar.
2025
- Medical foundation large language models for comprehensive text analysis and beyondnpj Digital Medicine, 2025
- ELAINE-medLLM: Lightweight English Japanese Chinese Trilingual Large Language Model for Bio-medical DomainIn Proceedings of the 31st International Conference on Computational Linguistics, 2025
- Proceedings of the Joint Workshop of the 9th Financial Technology and Natural Language Processing (FinNLP), the 6th Financial Narrative Processing (FNP), and the 1st Workshop on Large Language Models for Finance and Legal (LLMFinLegal)In Proceedings of the Joint Workshop of the 9th Financial Technology and Natural Language Processing (FinNLP), the 6th Financial Narrative Processing (FNP), and the 1st Workshop on Large Language Models for Finance and Legal (LLMFinLegal), 2025
- FinNLP-FNP-LLMFinLegal-2025 Shared Task: Financial Misinformation Detection Challenge TaskIn Proceedings of the Joint Workshop of the 9th Financial Technology and Natural Language Processing (FinNLP), the 6th Financial Narrative Processing (FNP), and the 1st Workshop on Large Language Models for Finance and Legal (LLMFinLegal), 2025
- Retrieval-augmented Large Language Models for Financial Time Series ForecastingarXiv preprint arXiv:2502.05878, 2025
- PH-LLM: Public Health Large Language Models for InfoveillancemedRxiv, 2025
- Fino1: On the Transferability of Reasoning Enhanced LLMs to FinancearXiv preprint arXiv:2502.08127, 2025
- FLAG-Trader: Fusion LLM-Agent with Gradient-based Reinforcement Learning for Financial Trading2025
- Enhancing Financial Time-Series Forecasting with Retrieval-Augmented Large Language ModelsarXiv e-prints, 2025
- Plutus: Benchmarking Large Language Models in Low-Resource Greek FinancearXiv preprint arXiv:2502.18772, 2025
- Medical foundation large language models for comprehensive text analysis and beyondnpj Digital Medicine, 2025
- Open FinLLM leaderboard: Towards financial ai readinessarXiv preprint arXiv:2501.10963, 2025
2024
- The Lay Person’s Guide to Biomedicine: Orchestrating Large Language ModelsarXiv preprint arXiv:2402.13498, 2024
- SuicidEmoji: Derived Emoji Dataset and Tasks for Suicide-Related Social ContentIn Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval, 2024
- Supporting the working life exposome: Annotating occupational exposure for enhanced literature searchPlos one, 2024
- Are Large Language Models True Healthcare Jacks-of-All-Trades? Benchmarking Across Health Professions Beyond Physician ExamsarXiv preprint arXiv:2406.11328, 2024
- Selective Preference Optimization via Token-Level Reward Function EstimationarXiv preprint arXiv:2408.13518, 2024
- LitFM: A Retrieval Augmented Structure-aware Foundation Model For Citation GraphsarXiv preprint arXiv:2409.12177, 2024
- Language Enhanced Model for Eye (LEME): An Open-Source Ophthalmology-Specific Large Language ModelarXiv preprint arXiv:2410.03740, 2024
- CDEMapper: Enhancing NIH Common Data Element Normalization using Large Language ModelsarXiv preprint arXiv:2412.00491, 2024
- Temporal relation extraction with contrastive prototypical samplingKnowledge-Based Systems, 2024
- Graph contrastive topic modelExpert Systems with Applications, 2024
- RAEmoLLM: Retrieval Augmented LLMs for Cross-Domain Misinformation Detection Using In-Context Learning based on Emotional InformationarXiv preprint arXiv:2406.11093, 2024
- Edge contrastive learning for link predictionInformation Processing & Management, 2024
- MetaAligner: Conditional Weak-to-Strong Correction for Generalizable Multi-Objective Alignment of Language ModelsNeurIPS 2024, 2024
- HealMe: Harnessing Cognitive Reframing in Large Language Models for PsychotherapyACL 2024, 2024
- HARMONIC: Harnessing LLMs for Tabular Data Synthesis and Privacy ProtectionNeurIPS Dataset and Benchmark Track 2024, 2024
- Dólares or dollars? unraveling the bilingual prowess of financial llms between spanish and englishIn Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024
- No Language is an Island: Unifying Chinese and English in Financial Large Language Models, Instruction Data, and Benchmarks2024
- Finnlp-agentscen-2024 shared task: Financial challenges in large language models-finllmsIn Proceedings of the Eighth Financial Technology and Natural Language Processing and the 1st Agent AI for Scenario Planning, 2024
- Fmdllama: Financial misinformation detection based on large language modelsarXiv preprint arXiv:2409.16452, 2024
- AuditWen: An Open-Source Large Language Model for AuditIn China National Conference on Chinese Computational Linguistics, 2024
- Ucfe: A user-centric financial expertise benchmark for large language modelsarXiv preprint arXiv:2410.14059, 2024
- INVESTORBENCH: A Benchmark for Financial Decision-Making Tasks with LLM-based AgentarXiv preprint arXiv:2412.18174, 2024
- Overview of the biolaysumm 2024 shared task on the lay summarization of biomedical research articlesarXiv preprint arXiv:2408.08566, 2024
- MentaLLaMA: interpretable mental health analysis on social media with large language modelsIn WWW, 2024
- Back to the future: Towards explainable temporal reasoning with large language modelsIn WWW, 2024
- Factual consistency evaluation of summarization in the Era of large language modelsExpert Systems with Applications, 2024
- Advancing entity recognition in biomedicine via instruction tuning of large language modelsBioinformatics, 2024
- Emollms: A series of emotional large language models and annotation tools for comprehensive affective analysisIn Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2024
- FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision MakingIn NeurIPS, 2024
- Finben: A holistic financial benchmark for large language modelsNeurIPS, 2024
- Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications2024
2023
- Word grounded graph convolutional networkarXiv preprint arXiv:2305.06434, 2023
- O-114 Natural language processing as a tool for developing and updating job exposure matrices for chemical exposures in the general population2023
- A survey on biomedical text summarization with pre-trained language modelarXiv preprint arXiv:2304.08763, 2023
- Temporal Relation Extraction with Contrastive Prototypical SamplingAvailable at SSRN 4482481, 2023
- Can Language Models Make Fun? A Case Study in Chinese Comical CrosstalkIn Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2023
- LongDocFACTScore: Evaluating the Factuality of Long Document Abstractive SummarisationCOLING 2024, 2023
- Mastering pair trading with risk-aware recurrent reinforcement learningarXiv preprint arXiv:2304.00364, 2023
- Jingcheng Du, Yan Hu, Vipina Kuttichi Keloth, Xueqing Peng, Kalpana Raja, Rui Zhang, Zhiyong Lu, and Hua Xu. Large language models in biomedical natural language processing: benchmarks, baselines, and recommendationsarXiv preprint arXiv:2305.16326, 2023
- A systematic evaluation of large language models for biomedical natural language processing: benchmarks, baselines, and recommendationsarXiv preprint arXiv:2305.16326, 2023
- CitationSum: Citation-aware Graph Contrastive Learning for Scientific Paper SummarizationIn WWW ’23: Proceedings of the ACM Web Conference 2023, 2023
- On the evaluations of chatgpt and emotion-enhanced prompting for mental health analysisEMNLP 2023, 2023
- Zero-shot temporal relation extraction with chatgptIn BioNLP@ACL 2023, 2023
- A Survey for Biomedical Text Summarization: From Pre-trained to Large Language ModelsarXiv preprint arXiv:2304.08763, 2023
- Faithful AI in Medicine: A Systematic Review with Large Language Models and BeyondIn medRxiv, 2023
- Factreranker: Fact-guided reranker for faithful radiology report summarizationarXiv preprint arXiv:2303.08335, 2023
- A scoping review on multimodal deep learning in biomedical images and textsJournal of biomedical informatics, 2023
- Knowledge-enhanced graph topic transformer for explainable biomedical text summarizationIEEE journal of biomedical and health informatics, 2023
- The wall street neophyte: A zero-shot analysis of chatgpt over multimodal stock movement prediction challengesarXiv preprint arXiv:2304.05351, 2023
- Pixiu: A comprehensive benchmark, instruction dataset and large language model for financeNeurIPS, 2023
- Empowering many, biasing a few: Generalist credit scoring through large language modelsarXiv preprint arXiv:2310.00566, 2023
- LAiW: a Chinese legal large language models benchmarkarXiv preprint arXiv:2310.05620, 2023
- Select and trade: Towards unified pair trading with hierarchical reinforcement learningIn Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2023
- Large language models in biomedical natural language processing: benchmarks, baselines, and recommendationsarXiv e-prints, 2023
2022
- DGR: Decomposition Graph Reconstruction for Question UnderstandingIn 2022 International Joint Conference on Neural Networks (IJCNN), 2022
- Pre-trained language models with domain knowledge for biomedical extractive summarizationKnowledge-Based Systems, 2022
- GenCompareSum: a hybrid unsupervised summarization method using salienceIn Proceedings of the 21st workshop on biomedical language processing, 2022
- Pre-trained language models with domain knowledge for biomedical extractive summarizationKnowledge-Based Systems, 2022
- Gretel: Graph contrastive topic enhanced language model for long document extractive summarizationIn COLING 2022, 2022
- Readability Controllable Biomedical Document SummarizationIn Findings of the Association for Computational Linguistics: EMNLP 2022, 2022
- SMiLE: Schema-augmented multi-level contrastive learning for knowledge graph link predictionarXiv preprint arXiv:2210.04870, 2022
2021
- Graph Relational Topic Model with Higher-order Graph Attention Auto-encodersIn Findings of ACL 2021, 2021
- Neural variational sparse topic model for sparse explainable text representationInformation Processing & Management, 2021
- Inductive topic variational graph auto-encoder for text classificationIn proceedings of the 2021 conference of the North American chapter of the association for computational linguistics: human language technologies, 2021
- Graph topic neural network for document representationIn WWW, 2021
- Graph neural collaborative topic model for citation recommendationTOIS, 2021
- Pre-trained language models in biomedical domain: A systematic surveyACM computing survey, 2021
2020
- MLR: A Two-stage Conversational Query Rewriting Model with Multi-task LearningarXiv preprint arXiv:2004.05812, 2020
- Msrenet: Multi-step reformulation for open-domain question answeringIn Natural Language Processing and Chinese Computing: 9th CCF International Conference, NLPCC 2020, Zhengzhou, China, October 14–18, 2020, Proceedings, Part II 9, 2020
- DTC: transfer learning for commonsense machine comprehensionNeurocomputing, 2020
- A two-stage conversational query rewriting model with multi-task learningIn Companion Proceedings of the Web Conference 2020, 2020
- 基于深度学习的主题模型研究计算机学报, 2020
- Unsupervised software repositories mining and its application to code searchSoftware: Practice and Experience, 2020
- Generation of topic evolution graphs from short text streamsNeurocomputing, 2020
- Neural joint attention code search over structure embeddings for software Q&A sitesJournal of Systems and Software, 2020
2019
- 基于带注意力机制 CNN 的联合知识表示模型中文信息学报, 2019
- Fine-grained tomato disease recognition based on attention residual mechanism.2019
- Discriminative regularization with conditional generative adversarial nets for semi-supervised learningIn 2019 International Joint Conference on Neural Networks (IJCNN), 2019
- Discriminative regularized deep generative models for semi-supervised learningIn 2019 IEEE international conference on data mining (ICDM), 2019
- 基于注意力残差机制的细粒度番茄病害识别华南农业大学学报, 2019
- Incorporating word embeddings into topic modeling of short textKnowledge and Information Systems, 2019
- Fine-grained tomato disease recognition based on attention residual mechanismJournal of South China Agricultural University, 2019
2018
- 基于带注意力机制CNN的联合知识表示模型In The 17th Chinese National Conference on Computational Linguistics (CCL 2018) and the Sixth International Symposium on Natural Language Processing based on Naturally Annotated Big Data (NLP-NABD 2018), 2018
- Extraction of pig contour based on fully convolutional networks.2018
- Block Bayesian sparse topical codingIn 2018 IEEE 22nd International Conference on Computer Supported Cooperative Work in Design ((CSCWD)), 2018
- Topic-Net Conversation ModelIn International Conference on Web Information Systems Engineering, 2018
- Extraction of pig contour based on fully convolutional networksJournal of South China Agricultural University, 2018
- 基于全卷积网络的生猪轮廓提取华南农业大学学报, 2018
- Neural sparse topical codingIn Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), 2018
- Bayesian sparse topical codingIEEE Transactions on Knowledge and Data Engineering, 2018
2017
- Parallelization of massive textstream compression based on compressed sensingACM Transactions on Information Systems (TOIS), 2017
2016
- Sparse topical coding with sparse groups2016
- KPCA-WT: an efficient framework for high quality microblog extraction in time-frequency domainIn International Conference on Web-Age Information Management, 2016
- Sparse topical coding with sparse groupsIn International Conference on Web-Age Information Management, 2016
- 面向社交媒体文本的话题检测与追踪技术研究综述武汉大学学报 (理学版), 2016
- Improving distant supervision of relation extraction with unsupervised methodsIn International Conference on Web Information Systems Engineering, 2016
2015
- Coherent topic hierarchy: A strategy for topic evolutionary analysis on microblog feedsIn International conference on web-age information management, 2015