5 leading AI startups in MLops



In addition to the huge and growing demand for AI applications, there is a complementary thirst for the infrastructure and supporting software that enables AI applications. From data preparation and training to deployment, and beyond, many startups are coming into the field, MLops. Let’s take a look at some of the more interesting things that make AI initiatives more successful.

Weight and bias

Weight and Bias It has a significant presence in the field of machine learning, especially among data scientists who need a comprehensive and well-designed experimental follow-up service. First, W&B can be quickly integrated with almost any popular machine learning library (and it’s easy to add custom metrics).

Second, you can use as many W & B’s as you want – as a turbocharged version Tensor board.It can also be used as a way to control and report hyperparameter tuning, or as a collaboration center where all members of the l The data science team can see the results and replicate the experiments performed by other team members. For businesses, W&B can also be used as a governance and history platform, providing an audit trail that uses input data, transformation, and experimentation to build a model as it goes. as it moves from development to production.

Your data scientists probably already know W&B, and if they aren’t using it internally, they certainly want to be. If OpenAI, GitHub, Salesforce, Nvidia use W&B, why?


Seldon is another company that offers open-core products with additional business functionality. The open source component is Seldon Core. This is a cloud native way to deploy models with advanced features like any model chain for inference, Canary deployment, A / B testing, multi-arm bandit and support frameworks such as: is. TensorFlow, Scikit-learn and XGBoost You can use it immediately. Seldon also provides an open source Alibi library for testing and explaining machine learning models. This library contains different ways to get an overview of how the model predictions are formed.

An interesting feature of Seldon Core is its extremely flexible compatibility with the tech stack. You can use Seldon Core alone or insert it into a slot. Deployment of Kubeflow. You can deploy the model created through MLFlow, or you can use the Nvidia Triton inference server. Therefore, there are different ways to take advantage of Seldon to get the maximum payout.

For enterprises, Seldon Deploy offers a comprehensive suite of tools for model governance, including dashboards, audited workflows, and performance monitoring. This offer is intended for data scientists, SREs, managers and auditors. With Seldon’s focus on auditing and explanation, it’s no surprise that this UK-based startup, where Barclays and Capital One use the service, has hit the bank.

While there are many competitors in the template deployment space, Seldon offers a comprehensive set of features, with a very strong focus on deploying Kubernetes in its core offering, and businesses that need a more complete solution. Add a business to help you.

Pine cone / Zilliz

Vector search is now bright red. Vector research can revolutionize research, thanks to recent advances in machine learning in areas such as text, images and voice. For example, a search for “Kleenex” might return the organization selected by the retailer without the need for custom rules for synonym substitution. Vector integration Place the search query in the same area of ​​vector space. You can also use the exact same process to locate sounds and perform facial recognition.

[ Also on InfoWorld: 3 AI startups revolutionizing NLP ]

Today’s search engine software is often not optimized for vector searches, but the work continues. Elastic And Apache Lucene, And open source alternative hosts provide fast, large-scale vector search capabilities (for example). NMSLib, FAISS, Boring). In addition, many start-ups have sprung up, removing some of the burden of setting up and maintaining search engine vectors of poor operating services. Pinecone and Zilliz are two startups that offer vector research to businesses.

Pinecone is a pure SaaS offering that uploads the integration generated by the machine learning model to the server and submits a request through the API. All aspects of hosting including security, scaling, speed, and other operational issues are handled by the Pinecone team. This means that you can launch your similarity search engine in a matter of hours.

But Ground squirrel managed cloud solution is coming soon, in the form of Zillow Cloud. The company takes an open-based approach using an open source library called. Tobi .. Milvus is a vector search engine with an expressive and easy to use API that encompasses commonly used libraries such as NMSLib and FAISS and can be used by developers to create and maintain their own vector indexes. Provides simple deployment.


Grid.ai This is the idea of ​​the people behind PyTorch Lightning, a popular high-level framework built on PyTorch that sums up many PyTorch standards and can be easily trained on one or 1000 GPUs using two parameter switches. . Grid.ai integrates and executes PyTorch Lightning simplifications and trains models using temporary GPU resources, as seamlessly as data scientists execute code locally. I could do it.

Do you want to run hyperparameter scans on 200 GPUs at once? Grid.ai manages all provisioning (and decommissioning) of infrastructure resources in the background, ensuring datasets are optimized for large-scale use, and providing metric reports. I go. All of these are easily grouped together. -Use WebUI. You can also use Grid.ai to launch an instance in the console or by connecting to a Jupyter Notebook for interactive development.

Grid.ai’s efforts to simplify large-scale model training are useful for companies that need to run training sessions on a regular basis that occupy more than 100 GPUs at a time, but their customer base is still high. I do not know. Still, if you need a streamlined training pipeline for data scientists that minimizes cloud costs, you need to look at Grid.ai.

Data robot

DataRobot We want to own the AI ​​lifecycle of a business, from data readiness to production deployment. Make a good pitch. DataRobot’s data preparation pipeline includes all the features related to the web user interface that should simplify data enrichment. In addition, it includes features to help users (beginners or professionals) by profiling, grouping and automatically cleaning up data previously. He entered the model.

DataRobot Machine Learning Ability to train a splint model against a target. This allows you to choose one of the best performing generative models or your own model uploaded to the platform. When it comes to deployment, the platform’s built-in MLops module tracks everything from availability to drifting data over time, so you can see your model’s performance at a glance. moment. There is also a feature called Humble AI which allows you to place additional guardrails on your model in case a low probability event occurs during prediction. Of course, these can also be tracked via the MLops module.

Slightly different from most of the other startups on this list, DataRobot installs on its own datacenter and bare metal in Hadoop clusters and deploys on private and managed cloud services. Indeed, the enterprise AI platform fights in a positive way and serves customers, from fast-growing startups to established Fortune 500 companies.

MLops is one of the hottest areas in AI right now. The need for accelerators, platforms, management and monitoring will increase as more companies enter the AI ​​space. If you’re a part of the AI ​​gold rush, you can count on these five startups to provide pickaxes and axes.

Copyright © 2021 IDG Communications, Inc.



About Author

Comments are closed.