Leveraging Small Software Engineering Data Sets with Pre-trained Neural Networks

Romain Robbes; Andrea Alexander Janes

doi:10.1109/ICSE-NIER.2019.00016

Back

Leveraging Small Software Engineering Data Sets with Pre-trained Neural Networks

Conference proceeding

Peer reviewed

Leveraging Small Software Engineering Data Sets with Pre-trained Neural Networks

Romain Robbes and Andrea Alexander Janes

2019 IEEE/ACM 41st International Conference on Software Engineering: New ideas and emerging results: ICSE-NIER 2019; 25-31 May 2019, Montréal, Canada; Proceedings, pp.29-32

41st IEEE/ACM International Conference on Software Engineering: New Ideas and Emerging Results, ICSE-NIER 2019 (Montreal, 25/05/2019 - 31/05/2019)

2019

DOI: https://doi.org/10.1109/ICSE-NIER.2019.00016

Handle:

https://hdl.handle.net/10863/24886

Abstract

Data sets

Deep learning

Transfer learning

Many software engineering data sets, particularly those that demand manual labelling for classification, are necessarily small. As a consequence, several recent software engineering papers have cast doubt on the effectiveness of deep neural networks for classification tasks, when applied to these data sets. We provide initial evidence that recent advances in Natural Language Processing, that allow neural networks to leverage large amount of unlabelled data in a pre-training phase, can significantly improve performance.

Files and links (1)

url

https://ieeexplore.ieee.org/abstract/document/8805726View

Details

Title: Leveraging Small Software Engineering Data Sets with Pre-trained Neural Networks
Creators: Romain Robbes
Andrea Alexander Janes - Free University of Bozen-Bolzano
Publication Details: 2019 IEEE/ACM 41st International Conference on Software Engineering: New ideas and emerging results: ICSE-NIER 2019; 25-31 May 2019, Montréal, Canada; Proceedings, pp.29-32
ISBN: 9781728117591
EISBN: 9781728117584
Conference: 41st IEEE/ACM International Conference on Software Engineering: New Ideas and Emerging Results, ICSE-NIER 2019 (Montreal, 25/05/2019 - 31/05/2019)
Publisher: IEEE Press
Piscataway, NJ
Number of pages: 4
Identifiers: 978-1-72811-759-1
(UNIBZ)30595553
991006414295901241
Web of Science ID: 000557879900008
Scopus ID: 2-s2.0-85072066773
Academic Unit: Faculty of Computer Science
Language: English
Resource Type: Conference proceeding
Author Names String: Robbes R, Janes A

Metrics

4 Record Views