Archive


Blog - Readings - posts for September 2021

Sep 27 2021

Facebook GSLM textless NLP

facebook_gslm_450.png

Facebook recently introduced a generative spoken language model (GSLM) called textless NLP.
The research team believes that their GSLM can be an effective method for pre-training downstream tasks trained with few available labelled or annotated data, like spoken summarization, information retrieval tasks, and sentiment analysis.
GSLM uses the latest breakthroughs in representation learning, allowing it to work directly from raw audio signals, without any text or labels. According to Facebook, this opens the door to a new era of textless NLP applications for potentially every language spoken on Earth — even those without significant or limited text datasets. In addition, it enables the development of NLP models that incorporate the full range of expressivity of oral language.
More

Sep 16 2021

NLP-Based Source Code Analysis Tools

https://www.decoder-project.eu/download/Main/Readings/NLP-based_source_code_analysis/DCR_Tree_tools.jpg?rev=1.1

Tree Technology, a partner in DECODER Project, posted an article about recent R&D efforts. 

Abstract: We have used Natural Language Processing (NLP) techniques in tools aimed to support and improve the software development and software quality processes for Java and C/C++ languages.

The use of complex models has increased performance in many common NLP tasks, such as named entity recognition, text classification, summarisation and translation among others. Besides, transfer learning has also become an interesting option when not much labelled data is available and knowledge learnt from one problem can be applied to a new but related task. In this context, our two NLP-based source code analysis tools - namely Variable Misuse and Code Summarisation - have been conceived by and for software developers.

Read more