The NLP4NLP Corpus
The article Rediscovering 15 Years of Discoveries in Language Resources and Evaluation: The LREC Anthology Analysis, published first at LREC 2014 on the occasion of the LREC 15th anniversary, has been extended to the production and analysis of the NLP4NLP corpus containing close to 65,000 articles published in major conferences and journals in speech and language processing over 50 years (1965-2015) on various aspects (publication, collaboration, citation, innovation, plagiarism,…).
The results of those analyses have recently been assembled in a series of two papers published in a special issue on Mining Scientific Papers: NLP-enhanced Bibliometrics of the Frontiers in Research Metrics and Analytics journal :
- The NLP4NLP Corpus (I): https://www.frontiersin.org/articles/10.3389/frma.2018.00036/full
- The NLP4NLP Corpus (II): https://www.frontiersin.org/articles/10.3389/frma.2018.00037/full