Airin: Machine Learning Models

A Marketing Story by Agency Fusion about Airin
2 Views | 0 High Fives
A
Airin
Improve User Experience

Challenge

Airin is a well-funded startup creating an AI expert system platform for enterprise IT customers. It augments Subject Matter Experts in developing and sharing problem statements, identifying the right questions for a given problem, and understanding the context around decision making. Although Airin has their own development team, they engaged Objective to help them develop machine learning models. The Problem GIVEN A CUSTOMER'S ARBITRARY PROBLEM STATEMENT, PREDICT THE MOST RELEVANT SET OF QUESTIONS When a user has a problem statement such as, “I’d like to move my on-premise server infrastructure to the cloud”, we wanted to see the platform recommend question sets related to that individual’s objectives. Our goal was to come up with a proof of concept demonstrating that this is actually feasible and that a natural language processing approach to the problem can produce results that are better than a simple full text search approach.

Solution

01 EXPLORATORY DATA ANALYSISWE ANALYZED 15,826 CONTENT ITEMS AND DEVELOPED A BASELINE FOR COMPARING RESULTS Understanding the relationships between question sets Before data can be used in machine learning, the data has to be explored to fully understand the nuances of its structure, its quality, and the relationships that might be represented within. Airin provided us with 126 question sets containing 15,826 content items. We analyzed these to understand the distribution of questions, statements, follow-up questions, and section headers. Developing a baseline Before training a machine learning model on the data, we needed to know what the baseline results were. There are two common approaches to setting a baseline in a classification problem: Choose the most common class and use its ratio in the data Use the statistical probability of guessing correctly by chance Both of these approaches yielded a very low baseline (3.7% and 0.8% accuracy). There was a lot of room to improve beyond either baseline! 02 ITERATIVE EXPERIMENTATIONWE DEVELOPED AND TRAINED A CUSTOM MACHINE LEARNING MODEL Transforming all text to numerical representation Building a classifier involving language data required us to convert the words into numbers. This process can be explained in four steps. Convert each word to a token Choose a token representation—for our Proof of Concept we used GloVe Convert tokens to numbers Combine tokens in a given text by averaging each element across the vectors Training a model, evaluating performance, recleaning, and iterating In order for data to be useful to a model that can be trusted, the data often needs to be manipulated and cleaned. We used T-SNE, Altair, and KMeans clustering to group, visualize, and find problems with the data. We then ran the data through various machine learning algorithms such as logistic regression, decision trees, and the random forest classifier to determine if the approach would improve the accuracy of our model. Several cycles of this step improved the ability of the models to produce reliable predictions in the desired 90% range.

Results

WE PROTOTYPED A SYSTEM THAT ALLOWED THE CLIENT TO EXPERIMENT WITH OUR MACHINE LEARNING MODEL In order to showcase our results to the client in a way that is usable, we created a prototype system using a four step process. Create a new Python class to wrap the transformation and training Train and save the ML model Create a basic web service using Flask (a minimalist Python framework) Make it portable with Docker This allowed the client to experiment with our machine learning models to truly learn what they are capable of doing and to see how they can apply our models in their product.

A
~14 Employees
$10,000 Min Budget
1 Stories
Become A Partner

Agency Fusion

Salt Lake City, Utah

Want fresh marketing stories in your inbox?

Get the weekly email that makes you a marketing genius.
Stay informed and inspired, for free.

Related Stories

Guide London
116% Increase in Site Visits
GUIDE LONDON
3 years ago
Lifting The Curtain on SEO Success.
76% Increase in Pageviews
For the Love of Your Local
3 years ago
5 years ago