Hive AI
About Hive AutoML
About Hive AutoML
AutoML lets you easily manage your datasets, fine-tune custom models on that data, and deploy your custom models for inference.
Our no-code solution supports Hive’s proprietary models as well as popular open-source models like Llama 3.1 and DeBERTa. AutoML offers models across a range of use cases including classification, sentiment analysis, moderation, and chat.
AutoML lets you easily manage your datasets, fine-tune custom models on that data, and deploy your custom models for inference.
Our no-code solution supports Hive’s proprietary models as well as popular open-source models like Llama 3.1 and DeBERTa. AutoML offers models across a range of use cases including classification, sentiment analysis, moderation, and chat.
Manage Complex Data
Manage Complex Data
Data is the foundational building block for machine learning models. AutoML makes dataset management simple.
AutoML datasets make it simple to prepare your data for model fine-tuning. Our flexible platform accepts structured and unstructured data in several popular formats. You can upload data with existing labels or use our dataset management tools to add labels to uncategorized data.
AutoML also offers dataset functions and other popular tools to automate key data workflows. Functions can help you automate data labeling, set up an embedding pipeline for retrieval-augmented generation (RAG), and so much more.
Fine-Tune and Evaluate Custom Models
Fine-Tune and Evaluate Custom Models
AutoML offers several text classification, image classification, and large language models for custom fine-tuning. We offer default training options that work well for most objectives. Users are also free to customize dozens of hyperparameters to better suit their needs or just to experiment with different training configurations.
Several key metrics like balanced accuracy, precision, and recall are available during and after training to help you measure and evaluate your model’s performance.
Once you’ve trained and evaluated your model, deploy it to Hive Models with the click of a button. Your custom model will be available for inference and Moderation Dashboard integration through a Hive Models project, just like our pre-trained models.
AutoML offers several text classification, image classification, and large language models for custom fine-tuning. We offer default training options that work well for most objectives. Users are also free to customize dozens of hyperparameters to better suit their needs or just to experiment with different training configurations.
Several key metrics like balanced accuracy, precision, and recall are available during and after training to help you measure and evaluate your model’s performance.
Once you’ve trained and evaluated your model, deploy it to Hive Models with the click of a button. Your custom model will be available for inference and Moderation Dashboard integration through a Hive Models project, just like our pre-trained models.
Explore All Customizable Models
Explore All Customizable Models
Fine-tune a model to meet your specific needs with just a few clicks.
AutoML
Hive Text
Classification v3
Hive’s text classification model is able to interpret full sentences with linguistic subtleties across 30 different languages. It is a proprietary general-purpose text classification model trained by Hive’s Machine Learning team. Text Classification v3 is well-suited for most text classification tasks.
Text Classification
Hive Text
Moderation v3
Hive’s text moderation model is trained on a proprietary large corpus of labeled data across multiple domains, and is able to interpret full sentences with linguistic subtleties. The model detects undesirable content like sexual, bullying, spam, and more in 30 different languages. Fine-tuned versions of Hive Text Moderation v3 maintain pre-trained Hive moderation labels, so this model is best-suited for content moderation tasks.
Text Classification
DeBERTa v3
DeBERTa is a large text classification model. The DeBERTa v3 base model was pre-trained on English data. Though this model can be fine-tuned on any language, it is best suited for English-only datasets. DeBERTa performs well for sentiment analysis or very complex/nuanced classification.
Text Classification
Longformer
Longformer is a long-sequence transformer model. Its attention mechanism varies from similar transformer-based models, allowing it to process longer sequence lengths. Longformer is well-suited for classifying lengthy
text examples.Longformer is a long-sequence transformer model. Its attention mechanism varies from similar transformer-based models, allowing it to process longer sequence lengths. Longformer is well-suited for classifying lengthy text examples.
Text Classification
Simple usage based pricing so you only pay for what you use
Simple usage based pricing so you only pay for what you use
AutoML
Product
Pricing
Unit
Text
Text Moderation (Last Layer) | Contact Sales | |
Text Classification (Last Layer) | $4.00 | Per 1M Rows Seen |
Text Classification (LoRA) | $4.00 | Per 1M Rows Seen |
Text Classification (Full Fine Tuning) | $4.00 | Per 1M Rows Seen |
Text Classification (DeBERTa Last Layer) | $4.00 | Per 1M Rows Seen |
Text Classification (DeBERTa LoRA) | $4.00 | Per 1M Rows Seen |
Text Classification (DeBERTa Full Fine Tuning) | $4.00 | Per 1M Rows Seen |
Text Classification (Longformer Last Layer) | $4.00 | Per 1M Rows Seen |
Text Classification (Longformer Full Fine Tuning) | $4.00 | Per 1M Rows Seen |
Image
Image Moderation (Last Layer) | Contact Sales | |
Image Classification (Last Layer) | $40.00 | Per 1M Rows Seen |
Image Classification (Full Fine Tuning) | $40.00 | Per 1M Rows Seen |
LLM Chat
LLM Instruct 8B | $0.50 | Per 1M Tokens Seen |
LLM Instruct 70B | $3.00 | Per 1M Tokens Seen |
- Rows Seen refers to the number of Epochs ran multiplied by the number of rows of training and validation data sets unless otherwise specified. LLM training uses number of Tokens instead of rows.
- Text Last Layer trainings are billed only for training rows + validation dataset rows, without multiplying by the number of Epochs trained.
Text
Image
LLM Chat
- Rows Seen refers to the number of Epochs ran multiplied by the number of rows of training and validation data sets unless otherwise specified. LLM training uses number of Tokens instead of rows.
- Text Last Layer trainings are billed only for training rows + validation dataset rows, without multiplying by the number of Epochs trained.
Simple usage based pricing so you only pay for what you use
Simple usage based pricing so you only pay for what you use
AutoML
How customers use AutoML
How customers use AutoML
Custom moderation
Digital platforms can train models to meet their unique text, image, and video moderation needs.
Content categorization
Digital platforms use AutoML models to classify content into buckets that align with their specific customer and
content segments.Digital platforms use AutoML models to classify content into buckets that align with their specific customer and content segments.
Spam detection
Customers on the bleeding edge of their industries train models to identify evolving spam that is not caught by traditional detection solutions.
Proof-of-completion
Customers in logistics, ride-share and industrial operations train custom image classification models to validate workflow steps are done completely, such as ensuring packages are delivered as expected.
Metadata tagging
Marketplaces with vast digital libraries of image content build custom image classification models to extract key metadata and validate user-inputted attributes
Sentiment analysis
Customers build custom text classification models to determine the sentiment of customer reviews, efficiently escalating urgent tickets to manual rewiew.