Readings

Machine Learning for Cybersecurity

IVD_method.jpg

Automated Vulnerability Detection in Source Code Using Minimum Intermediate Representation

Vulnerability is one of the root causes of network intrusion. An effective way to mitigate security threats is to discover and patch vulnerabilities before an attack. Traditional vulnerability detection methods rely on manual participation and incur a high false positive rate. The intelligent vulnerability detection methods suffer from the problems of long-term dependence, out of vocabulary, coarse detection granularity and lack of vulnerable samples.
This paper proposes an automated and intelligent vulnerability detection method in source code based on the minimum intermediate representation learning. 

More

MLOps Demystified

MLOps_Level1.png

As Machine Learning at an organization matures from research to applied enterprise solutions, there comes the need for automated Machine Learning operations that can efficiently handle the end-to-end ML Lifecycle.

The goal of level 1 MLOps (see figure) is to perform continuous training of the model by automating the entire machine learning pipeline which in turn leads to continuous delivery of prediction service. The underlying concept which empowers the continuous model training is the ability to do data version control along with efficient tracking of training/evaluation events. 

DevOps Market to reach $15 billion by 2026

containeurization.jpg

The global DevOps market size is projected to reach $14,969.6 million by 2026, a compound annual growth rate of 19.1%, according to a Fortune Business Insights report. The report highlighted the significance of this increase, noting nearly +404% in eight years, as that market was only worth $3,708.1 million in 2018. Containerization, PaaS (Platform as a Service) and hybrid cloud are three major enablers in DevOps growth.

For more information:

Agile Testing + DevOps = DevTestOps

DevTestOps.jpg

"DevOps is now really DevTestOps and for teams to be truly agile, test management is the vital link in the success of DevOps. You require TestOps to match the pace of DevOps and testing early and often — breaking the silos."

In fact, the World Quality Report 2019-2020 led by Capgemini shows that there is increased investment in the QA and Test function reported by 90% of US and 69% percent of Canadian survey participants in the past four years.

Read more

The Global Embedded Systems Market Expected to Grow +5% CAGR by 2024

STM_SoC.jpg

An embedded system is a combination of software and hardware which together facilitate the accurate functioning of a target device. Embedded system market is expected to mark significant growth over 2019 to 2024 owing to increasing consumers spending on smart phones, providing high application- specified integrated circuit and high speed operating systems applications and technological advancement. 

A recent Advance Market Analytics market study is being classified by Type (Normal Phase HPLC and Reverse Phase HPLC), by Application (Automotive, Telecommunication, Healthcare, Industrial, Consumer Electronics and Military & Aerospace) and major geographies with country level break-up. According to this study, the Global Embedded Systems market is expected to see growth rate of 5.28% and may see market size of USD536.2 Million by 2024.

Read more

Distinct AI Techniques Bring Different Business Values

model_network.jpg

Machine learning and deep learning are often conflated by business decision makers. Machine Learning can involve a wide variety of techniques for building analytics models or decision engines that don't involve neural networks, the mechanism for deep learning. And there is a whole range of AI techniques outside of machine learning as well that can be applied to solve business problems.

Do you leverage these techniques or do you prefer computer vision and natural language processing applications to solve your business problems?

Read George Lawton article in TechTarget

TESTAR test results extracted while executing MyThaiStar as web system under test

testar_thumb.png

Authors: Fernando Pastor Ricos and Tanja E. Vos from Universitat Politècnica de València

TESTAR test results datasets extracted with TESTAR tool using MyThaiStar web application as System Under Test (SUT). These datasets have been generated to be used as an example to be automatically generated and introduced locally in DECODER PKM, from H2020 DECODER Project.

TESTAR tool is an open source tool for automated testing through graphical user interface (GUI) currently  being  developed  by  the Universitat Politecnica de Valencia and the Open University of the Netherlands.

MyThaiStar is the reference application that Capgemini uses internally to promote best programming practices and the correct use of last technologies. It’s is developed with Devon Framework, the standard tool for development at the company. More...

An MLOps approach to bring models to production

MLOps.png

Machine Learning Open Studio and Model as a Service (MaaS) from Activeeon helps data scientists and IT operations work together in an MLOps approach allowing to bring ML models to production. Machine Learning Open Studio includes automatic data drift detection mechanisms and allows traceability and audit over model performance to retrain it when necessary.

Only a small percentage of ML projects make it to production because of deployment complexity, lack of governance tools and many other reasons. Once in production, ML models often fail to adapt to the changes in the environment and its dynamic data which results in performance degradation.

To maintain the prediction accuracy of ML models in production, an active monitoring of model performance is mandatory. This allows to know when to retrain it using the most recent data and the newest implementation techniques, then redeploy in production. More...

Algorithm and Data Structure Visualization

visualgo.jpg

Visualizations can help us understand how data structures and algorithms work. 

The visualgo.net website provides great visualization and animations on advanced algorithms. Most of them are discussed in 'Competitive Programming', co-authored by two brothers Dr Steven Halim and Dr Felix Halim. Today, some of these advanced algorithms visualization/animation can only be found in VisuAlgo. 

An online quiz system has been added that allows students to test their knowledge of basic data structures and algorithms. It generates questions and check the student answers automatically.

Covid-19 infection in Italy: when AI provides vital insights

Covid-19_graph1.jpg

Thanks to mathematical models and predictions, Gianluca Malato - a Data Scientist, fiction author and software developer - compared logistic and exponential models applied to Covid-19 virus infection in Italy. Both models help to better understand the evolution of the infection. The data preparation and python coding are detailed in an article posted in Towards Data Science on 8 March 2020. At that time, the main projections - now checked regularly by this Covid-19 Italian infection collaborative research - were:

Clear Linux OS automates the creation of RPM packaging

clearlinux.png

Designed by Intel and open source contributors, the Clear Linux OS delivers a secure, hardware optimized OS. Its updates ensure that software dependencies remain mutually compatible. 

The autospec tool is used to assist with the automated creation and maintenance of RPM packaging in Clear Linux OS. Where a standard RPM build process using rpmbuild requires a tarball and .spec file to start, autospec requires only a tarball and package name to start.

Recent reviews confirm the performance an stability improvements of Clear Linux OS. However, software that are packaged in other formats for other Linux distributions are not guaranteed to work on Clear Linux OS and may be impacted by Clear Linux OS updates. 

The Twelve-Factor App, a Methodology for Building Web Apps

12Factor.jpg

Suggested by the designers of the Heroku PaaS platform, the twelve-factor methodology can be applied to apps written in any programming language, and which use any combination of backing services (database, queue, memory cache, etc). It is aimed at building Software-as-a-Service apps that:

  1. Use declarative formats for setup automation, to minimize time and cost for new developers joining the project;
  2. Have a clean contract with the underlying operating system, offering maximum portability between execution environments;
  3. Are suitable for deployment on modern cloud platforms, obviating the need for servers and systems administration;
  4. Minimize divergence between development and production, enabling continuous deployment for maximum agility;
  5. And can scale up without significant changes to tooling, architecture, or development practices.

More about the Twelve-Factor App

A New Model-Based Approach for API Testing

load-testing-rest-api.png

Keeping Pace with Agile Development, Visualizing Complex Dependencies, and Orchestrating for Completeness of Testing are three good reasons to select a Model-Based approach for API testing, according to Collin Chau, a DevOps test expert. 

"With the proliferation and complexity in microservices development that the Internet of Things brings, development teams are struggling to embrace API testing for more effective QA testing in-sprint. Learn how a model-based testing approach makes the difference in your API tests."

Read Collin Chau full article in Continuous Testing

NLP Search Paves the Way for Augmented Data Discovery

busby.jpg

Combining natural language understanding and natural language generation will result in dynamic, bi-directional human-machine communication that will take several forms: text, voice and images. In text and voice scenarios, the BI or analytics solution can converse with the user to render the desired result - regardless of data-related and query-related search complexity.

Data visualizations also will become more interactive, if not immersive, along the lines of Busby from Oblong Industries. This product focuses on immersive interfaces, not specifically BI or analytics. However, its concepts could have a ripple effect on how people interact with data and thus, augmented data discovery.

"I think the future of BI is no BI. Don't ask me to search and look for things anymore. Give me that piece of information when I need it and if I need it. Come to me when there's something I need to know", foresees Erick Brethenoux, senior director analyst at Gartner.

For more information, read Lisa Morgan TechTarget article entitled NLP makes augmented data discovery a reality in analytics

Is BERT a Game Changer in NLP?

Google_office.jpg

BERT  (Bidirectional Encoder Representations from Transformers) is an open-sourced NLP pre-training model developed by researchers at Google in 2018. It has inspired multiple NLP architectures, training approaches and language models, including Google’s TransformerXL, OpenAI’s GPT-2, ERNIE2.0, XLNet, and RoBERTa. 

For instance, BERT is now used by Google Search to provide more relevant results. And it can also be used in smarter chatbots with conversational AI applications, expects Bharat S Raj. 

More...

Site maintained by OW2