Biography
I am a postdoctoral research fellow at the National University of Singapore (NUS), working with Prof. See-Kiong Ng. Before that, I received the doctoral degree in Computer Science from National University of Defense Technology (NUDT) in 2022, and received the bachelor degree in Software Engineering from Shandong University (SDU) in 2014.
Research Interests
I have a broad interest in Natural Language Processing, including Large Language Models, Text Generation, Information Retrieval, (Multi-modal) Information Extraction, Dialogue, and other NLP research.
Work Experience
-
2023.3 - present, Research Fellow, National University of Singapore
working with Prof. See-Kiong Ng.
-
2022.12 - 2023.2, Research Scientist, Haihe Laboratory
Education
-
Ph.D, 2018 - 2022, supervisor: Shen Changxiang (Academician of Chinese Academy of Engineering), College of Computer, National University of Defense Technology (NUDT)
-
M.S., 2015 - 2018, Institute of Computer Application, China Academy of Engineering Physics (CAEP)
-
B.Eng., 2010 - 2014, School of Software, Shandong University (SDU)
Scholarships, Awards, and Honors
NUDT Outstanding Graduates (Ph.D) —— 2022 (1/34)
NUDT Outstanding Postgraduates (Ph.D) —— 2019, 2020, 2022
China National Scholarship (Ph.D) —— 2020 (1/246)
CASC First-Class Scholarship (Ph.D) —— 2020 (1/246)
CAEP Special Scholarship (Master) —— 2018 (8/546)
Beijing Outstanding Graduates (Master) —— 2018
CAEP Outstanding Graduates (Master) —— 2018
Publications (Sep. 2018 - Current)
-
Chain-of-Thought Improves Text Generation with Citations in Large Language Models.
Bin Ji, Huijun Liu, Mingzhe Du, See-Kiong Ng.
The 38th Annual AAAI Conference on Artificial Intelligence, 2023 (AAAI'24; CCF-A).
-
A Context-Aware Approach for Textual Adversarial Attack through Probability Difference Guided Beam Search.
Huijun Liu#, Bin Ji#, Jie Yu, Shasha Li, Jun Ma.
IEEE Transactions on Knowledge and Data Engineering, 2023 (TKDE; CCF-A, JCR Q1, CAS Ranking Q2), Co-first author.
-
Span-based Joint Entity and Relation Extraction Augmented with Sequence Tagging Mechanism.
Bin Ji, Shasha Li, Hao Xu, Jie Yu, Jun Ma, Huijun Liu.
Science China-Information Sciences, 2022 (SCIS; CCF-A, JCR Q1, CAS Ranking Q2).
-
Few-shot Named Entity Recognition with Entity-level Prototypical Network Enhanced by Dispersedly Distributed Prototypes.
Bin Ji, Shasha Li, Shaoduo Gan, Jie Yu, Jun Ma, Huijun Liu, Jing Yang.
Proceedings of the 29th International Conference on Computational Linguistics, Oral, 2022 (COLING'22; CCF-B).
-
Span-based Joint Entity and Relation Extraction with Attention-based Span-specific and Contextual Semantic Representations.
Bin Ji, Jie Yu, Shasha Li, Jun Ma, Qingbo Wu, Yusong Tan, Huijun Liu.
The 28th International Conference on Computational Linguistics, 2020 (COLING'20; CCF-B).
-
A Novel Bundling Learning Paradigm for Named Entity Recognition.
Bin Ji, Yalong Xie, Jie Yu, Shasha Li, Jun Ma, Yun Ji, Huijun Liu.
Knowledge-based Systems, 2022 (KBS; CAS Ranking Q1 Top, JCR Q1, CCF-C).
-
Research on Chinese Medical Named Entity Recognition based on Collaborative Cooperation of Multiple Neural Network Models.
Bin Ji, Shasha Li, Jie Yu, Jun Ma, Jintao Tang, Qingbo Wu, Yusong Tan, Huijun Liu, Yun Ji.
Journal of Biomedical Informatics, 2020 (JBI; JCR Q1, CCF-C).
-
A Two-Phase Paradigm for Joint Entity-Relation Extraction.
Bin Ji, Hao Xu, Jie Yu, Shasha Li, Jun Ma, Yuke Ji, Huijun Liu.
Computers, Materials & Continua, 2022 (CMC; JCR Q1).
-
A Hybrid Approach for Named Entity Recognition in Chinese Electronic Medical Record.
Bin Ji, Rui Liu, Shasha Li, Jie Yu, Qingbo Wu, Yusong Tan, Jiaju Wu.
BMC Medical Informatics and Decision Making, 2019 (BMC MIDM; JCR Q1).
-
A BiLSTM-CRF Method to Chinese Electronic Medical Record Named Entity Recognition.
Bin Ji, Rui Liu, Shasha Li, Jintao Tang, Jie Yu, Qian Li, WeiSang Xu.
2018 International Conference on Machine Learning and Natural Language Processing, 2018 (MLNLP'18; EI).
-
Textual Adversarial Attacks by Exchanging Text-self Words.
Huijun Liu, Jie Yu, Jun Ma, Shasha Li, Bin Ji∗, Zibo Yi, Miaomiao Li, Long Peng, Xiaodong Liu.
International Journal of Intelligent Systems, 2022 (IJIS; JCR Q1, CAS Ranking Q2, CCF-C).
-
From Static to Dynamic: A Continual Learning Framework for Large Language Models.
Mingzhe Du, Anh Tuan Luu, Bin Ji, See-Kiong Ng.
The 38th Annual AAAI Conference on Artificial Intelligence, 2023 (AAAI’24), Demo Track.
-
Non-Autoregressive Sentence Ordering.
Yi Bin, Wenhao Shi, Bin Ji, Jipeng Zhang, Yujuan Ding, Yang Yang.
The 2023 Conference on Empirical Methods in Natural Language Processing, 2023 (EMNLP'23), Findings.
-
Research Team Mining Algorithm based on Teacher-Student Relationship.
Shasha Li, Dong Liang, Jie Yu, Bin Ji∗, Jun Ma.
Journal of Computer Applications (计算机应用), 2020
-
Joint Extraction Method for Chinese Medical Events.
Jie Yu, Bin Ji, Lei Liu, Shasha Li, Jun Ma.
Computer Science (计算机科学), 2021 (Supervisor is the first author)
-
Dynamic Multi-View Fusion Mechanism For Chinese Relation Extraction.
Jing Yang, Bin Ji, Shasha Li, Jun Ma, Long Peng, Jie Yu.
Proceedings of the 27th Pacific-Asia Conference on Knowkedge Discovery and Data Mining, 2023 (PAKDD'23, CCF-C).
-
SciCN: A Scientific Dataset for Chinese Named Entity Recognition.
Jing Yang, Bin Ji, Shasha Li, Jun Ma, Jie Yu.
Computers, Materials & Continua, 2022 (CMC, SCI).
-
QAE: A Hard-Label Textual Attack Considering the Comprehensive Quality of Adversarial Examples.
Miaomiao Li, Jie Yu, Jun Ma, Shasha Li, Huijun Liu, Mengxue Du, Bin Ji.
The 12th CCF International Conference on Natural Language Processing and Chinese Computing, 2023 (NLPCC'23, CCF-C).
-
Learning Well-Separated and Representative Prototypes for Few-Shot Event Detection.
Xintong Zhang, Shasha Li, Bin Ji, Ting Wang.
The 12th CCF International Conference on Natural Language Processing and Chinese Computing, 2023 (NLPCC'23, CCF-C).
-
An Advanced ICD-9 Terminology Standardization Method based on BERT and Text Similarity.
Yijia Liu, Bin Ji, Jie Yu, Yusong Tan, Jun Ma, Qingbo Wu.
The 16th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery, 2020 (FSKD'20).
-
Topic-Grained Text Representation-based Model for Document Retrieval.
Mengxue Du, Shasha Li, Jie Yu, Jun Ma, Bin Ji, Huijun Liu, Wuhang Lin, Zibo Yi.
The 31st International Conference on Artificial Neural Networks, 2022 (ICANN'22, CCF-C).
-
SummScore: A Comprehensive Evaluation Metric for Summary Quality based on Cross-Encoder.
Wuhang Lin, Shasha Li, Chen Zhang, Bin Ji, Jie Yu, Jun Ma, Zibo Yi.
The 6th APWeb-WAIM International Joint Conference on Web and Big Data, 2022 (APWeb-WAIM'22; CCF-C).
-
A Unified Summarization Model with Semantic Guide and Keyword Coverage Mechanism.
Wuhang Lin, Jianling Li, Zibo Yi, Bin Ji, Shasha Li, Jie Yu, Jun Ma.
The 31st International Conference on Artificial Neural Networks, Oral, 2021 (ICANN'21; CCF-C).
-
A Self-Attention based Neural Architecture for Chinese Medical Named Entity Recognition.
Qian Wan, Jie Liu, Luona Wei, Bin Ji.
Mathematical Biosciences and Engineering, 2020 (MBE; SCI).
-
Span Classification Based Model for Clinical Concept Extraction.
Yongtao Tang, Jie Yu, Shasha Li, Bin Ji, Yusong Tan, Qingbo Wu.
The 16th International Conference on Natural Computation, Fuzzy Systems and Knowledge Discovery, 2020 (FSKD'20).
* denotes the corresponding author.
Academic Services
-
Conference PC & Reviewer:
2023: ACL, AISTATS, EMNLP
2022: EMNLP, AAAI
2021: AAAI
2020: ICKG
-
Journal Reviewer:
2023: CAAI Transactions on Intelligence Technology, ACM Transactions on Asian and Low-Resource Language Information Processing, IEEE Transactions on Neural Networks and Learning Systems, Knowledge-based Systems
2022: Knowledge-based Systems (KBS), BMC Bioinformatics, Information Sciences (IS), IEEE Systems Journal, Computer Sciences, Materials & Continua (CMC), Security and Communication Networks
2021: Expert Systems with Applications, Information Processing & Management (IPM), Computer Sciences, Computer, Materials & Continua (CMC)
2020: Intelligent Automation & Soft Computing (IASC)
Teaching Assistantships
Talks
-
Few-shot Named Entity Recognition with Entity-level Prototypical Network Enhanced by Dispersedly Distributed Prototypes.
The 29th International Conference on Computational Linguistics (COLING), 2022.
Oral presentation
-
Span-based Joint Entity and Relation Extraction with Attention-based Span-Specific and Contextual Semantic Representations.
The 28th International Conference on Computational Linguistics (COLING), 2022.
Oral presentation
-
A Joint Extraction Method for Chinese Medical Events.
The 14th China Conference on Knowledge Graph and Semantic Computing (CCKS), 2020.
Oral presentation
-
A Multi Neural Networks based Approach to Complex Chinese Medical Named Entity Recognition.
The 13th China Conference on Knowledge Graph and Semantic Computing (CCKS), 2019.
Oral presentation
Research Grants
Competitions
-
The Academic Competition of Chinese Medical Entity and Event Extraction (面向中文电子病历的医疗实体及事件抽取 - 医疗事件抽取), in CCKS, 2020.
Team Name: LHJB
Bin Ji, Huijun Liu, Haiwen Chen, Wuhang Lin, Junzhan Zhang, Qian Wan.
Rank 3rd out of 28 teams
-
The Academic Competition of Chinese Medical Named Entity Recognition (面向中文电子病历的命名实体识别 - 医疗实体及属性抽取(跨院迁移)) in CCKS, 2019.
Team Name: NUDT-YH
Bin Ji, Shan Zhao, Qian Wan, Huijun Liu.
Rank 1st out of 16 teams
-
The Academic Competition of Chinese Medical Event Extraction (面向中文电子病历的医疗事件抽取) in CHIP, 2018.
Bin Ji
Rank 2nd out of 33 teams
-
The Academic Competition of Chinese Medical Named Entity Recognition (面向中文电子病历的命名实体识别) in CCKS, 2018.
Team Name: NUDT-IBDL
Bin Ji, Weisang Xu.
Rank 6th out of 50 teams
Projects
-
Research on Few-shot Learning Models for Information Extraction (面向信息抽取的小样本学习模型研究), 2022 - Current.
Supported by Hunan Provincial Natural Science Foundation, Grant No. 2022JJ30668
co-Principal Investigator
Introduction: Existing prototypical networks for few-shot NER suffer from estimated label dependency and closely distributed
prototypes. To address them, I first conduct research on entity-level prototypes and their dispersedly distributed
representations, and then I propose an entity-level prototypical network -- EP-Net.
-
Research on Key Techniques of Dissertation Quality Assessment (学位论文质量评估关键技术研究), Summer, Fall 2022.
Supported by National Key Research and Development Program
co-Principal Investigator
Introduction: I first conduct research on an image classification and OCR based approach to extract critical metadata from
dissertations that are in PDF format, and then I investigate using them to assess
the dissertation quality with XGBoost.
-
Research on Key Techniques of Text-Oriented Neural Network Attack and Defense (面向文本的神经网络攻击与防御关键技术研究), Spring, 2022.
Supported by Hunan Provincial Natural Science Foundation, Grant No. 2022JJ30046
Participant
Introduction: I participate in the research of a more context-aware approach for textual adversarial attacks, and I make a
contribution to the design of using probability difference and beam search to improve the attack efficiency.
-
Research on Distantly Supervision based Annotation for Heterogeneous Few-shot Data (面向异构含错小样本的远监督数据标注研究), Fall 2020.
Participant
Introduction: I participate in the research of two distantly supervision-based data annotation approaches. The first is the
entity pair match-based approach, and the second is the entity replacement-based approach.
©Bin Ji.  Last update: March, 2023.