Understanding Survey Paper Taxonomy about Large Language Models via Graph Representation Learning

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

As new research on Large Language Models (LLMs) continues, it is difficult to keep up with new research and models. To help researchers synthesize the new research many have written survey papers, but even those have become numerous. In this paper, we develop a method to automatically assign survey papers to a taxonomy. We collect the metadata of 144 LLM survey papers and explore three paradigms to classify papers within the taxonomy. Our work indicates that leveraging graph structure information on co-category graphs can significantly outperform the language models in two paradigms; pre-trained language models’ fine-tuning and zero-shot/few-shot classifications using LLMs. We find that our model surpasses an average human recognition level and that fine-tuning LLMs using weak labels generated by a smaller model, such as the GCN in this study, can be more effective than using ground-truth labels, revealing the potential of weak-to-strong generalization in the taxonomy classification task.

Original languageEnglish
Title of host publicationSDP 2024 - 4th Workshop on Scholarly Document Processing, Proceedings of the Workshop
EditorsTirthankar Ghosal, Amanpreet Singh, Anita de Waard, Philipp Mayr, Aakanksha Naik, Orion Weller, Yoonjoo Lee, Shannon Shen, Yanxia Qin
Pages58-69
Number of pages12
ISBN (Electronic)9798891761513
StatePublished - 2024
Event4th Workshop on Scholarly Document Processing, SDP 2024 at ACL 2024 - Bangkok, Thailand
Duration: 16 Aug 2024 → …

Publication series

NameSDP 2024 - 4th Workshop on Scholarly Document Processing, Proceedings of the Workshop

Conference

Conference4th Workshop on Scholarly Document Processing, SDP 2024 at ACL 2024
Country/TerritoryThailand
CityBangkok
Period16/08/24 → …

Fingerprint

Dive into the research topics of 'Understanding Survey Paper Taxonomy about Large Language Models via Graph Representation Learning'. Together they form a unique fingerprint.

Cite this