MetaTPTrans: A Meta Learning Approach for Multilingual Code Representation Learning

  • Weiguo Pian
  • , Hanyu Peng
  • , Xunzhu Tang
  • , Tiezhu Sun
  • , Haoye Tian*
  • , Andrew Habib
  • , Jacques Klein
  • , Tegawendé F. Bissyandé
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference article in proceedingsScientificpeer-review

8 Citations (Scopus)

Abstract

Representation learning of source code is essential for applying machine learning to software engineering tasks. Learning code representation from a multilingual source code dataset has been shown to be more effective than learning from single-language datasets separately, since more training data from multilingual dataset improves the model’s ability to extract language-agnostic information from source code. However, existing multilingual training overlooks the language-specific information which is crucial for modeling source code across different programming languages, while only focusing on learning a unified model with shared parameters among different languages for language-agnostic information modeling. To address this problem, we propose MetaTPTrans, a meta learning approach for multilingual code representation learning. MetaTPTrans generates different parameters for the feature extractor according to the specific programming language type of the input code snippet, enabling the model to learn both language-agnostic and language-specific information with dynamic parameters in the feature extractor. We conduct experiments on the code summarization and code completion tasks to verify the effectiveness of our approach. The results demonstrate the superiority of our approach with significant improvements on state-of-the-art baselines.

Original languageEnglish
Title of host publicationProceedings of the AAAI Conference on Artificial Intelligence
EditorsBrian Williams, Yiling Chen, Jennifer Neville
PublisherAAAI Press
Pages5239-5247
Number of pages9
ISBN (Electronic)9781577358800
DOIs
Publication statusPublished - 27 Jun 2023
MoE publication typeA4 Conference publication
EventAAAI Conference on Artificial Intelligence - Walter E. Washington Convention Center, Washington, United States
Duration: 7 Feb 202314 Feb 2023
Conference number: 37
https://aaai-23.aaai.org/

Publication series

NameProceedings of the 37th AAAI Conference on Artificial Intelligence, AAAI 2023
Number4
Volume37

Conference

ConferenceAAAI Conference on Artificial Intelligence
Abbreviated titleAAAI
Country/TerritoryUnited States
CityWashington
Period07/02/202314/02/2023
Internet address

Fingerprint

Dive into the research topics of 'MetaTPTrans: A Meta Learning Approach for Multilingual Code Representation Learning'. Together they form a unique fingerprint.

Cite this