{"id":47,"date":"2023-05-03T13:43:00","date_gmt":"2023-05-03T13:43:00","guid":{"rendered":"http:\/\/54.151.72.21\/?p=47"},"modified":"2025-03-05T06:20:58","modified_gmt":"2025-03-05T06:20:58","slug":"build-your-own-custom-ml-models-with-hive-automl","status":"publish","type":"post","link":"https:\/\/thehive.ai\/blog\/build-your-own-custom-ml-models-with-hive-automl","title":{"rendered":"Build Your Own Custom ML Models with Hive AutoML"},"content":{"rendered":"\n<p>We\u2019re excited to announce Hive\u2019s new&nbsp;<strong><a href=\"https:\/\/thehive.ai\/solutions\/auto-ml\" target=\"_blank\" rel=\"noreferrer noopener\">AutoML<\/a><\/strong>&nbsp;tool that provides customers with everything they need to train, evaluate, and deploy customized machine learning models.&nbsp;<\/p>\n\n\n\n<p>Our pre-trained models solve a wide range of use cases, but we will always be bounded by the number of models we can build. Now customers who find that their unique needs and moderation guidelines don\u2019t quite match with any of our existing solutions can create their own, custom-built for their platform and easily accessible via API.<\/p>\n\n\n\n<p>AutoML can be used to augment our current offerings or to create new models entirely. Want to flag a particular subject that doesn\u2019t exist as a head in our Text Moderation API, or a certain symbol or action that isn\u2019t part of our Visual Moderation? With AutoML, you can quickly build solutions for these problems that are already integrated with your Hive workflow.<\/p>\n\n\n\n<p>Let\u2019s walk through our AutoML process to illustrate how it works. In this example, we\u2019ll build a text classification model that can determine whether or not a given news headline is satirical.&nbsp;<\/p>\n\n\n\n<p>First, we need to get our data in the proper format. For text classification models, all dataset files must be in CSV format. One column should contain the text data (titled&nbsp;<strong>text_data<\/strong>) and all other columns represent model heads (classification categories). The values within each row of any given column represent the classes (possible classifications) within that head. An example of this formatting for our satire model is shown below:<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"1024\" height=\"597\" src=\"https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/1-1-1024x597.png\" alt=\"\" class=\"wp-image-176\" srcset=\"https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/1-1-1024x597.png 1024w, https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/1-1-300x175.png 300w, https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/1-1-768x448.png 768w, https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/1-1-1536x896.png 1536w, https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/1-1.png 2048w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>The first page you\u2019ll see on Hive\u2019s AutoML platform is a dashboard with all of your organization\u2019s training projects. In the image below, you\u2019ll see how the training and deployment status of old projects are displayed. To create our satire classifier, we\u2019re going to make a new project by hitting the \u201cCreate New Project\u201d button in the top right corner.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"1024\" height=\"597\" src=\"https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/2-1-1024x597.png\" alt=\"\" class=\"wp-image-177\" srcset=\"https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/2-1-1024x597.png 1024w, https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/2-1-300x175.png 300w, https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/2-1-768x448.png 768w, https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/2-1-1536x896.png 1536w, https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/2-1.png 2048w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>We\u2019ll then be prompted to provide a name and description for the project as well as training data in the form of a CSV file. For test data, you can either upload a separate CSV file or choose to randomly split your training data into two files, one to be used for training and the other for testing. If you decide to split your data, you will be able to choose the percentage that you would like to split off.<\/p>\n\n\n\n<p>After all of that is entered, we are ready to train! Beginning model training is as easy as hitting a single button. While your model trains, you can easily view its training status on the Training Projects page.<\/p>\n\n\n\n<p>Once training is completed, your project page will show an analysis of the model\u2019s performance. The boxes at the top allow you to decide if you want to look at this analysis for a particular class or overall. If you\u2019re building a multi-headed model, you can choose which head you\u2019d like to evaluate as well. We provide precision, recall, and balanced accuracy for all confidence thresholds as well as a PR curve. We also display a confusion matrix to show how many predictions were correct and incorrect per class.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" width=\"1024\" height=\"597\" src=\"https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/3-1024x597.png\" alt=\"\" class=\"wp-image-178\" srcset=\"https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/3-1024x597.png 1024w, https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/3-300x175.png 300w, https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/3-768x448.png 768w, https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/3-1536x896.png 1536w, https:\/\/staticblog.thehive.ai\/uploads\/2024\/07\/3.png 2048w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>Once you\u2019re satisfied with your model\u2019s performance, select the \u201cCreate Deployment\u201d to launch the model. Similarly to model training, deployment will take a few moments. After model deployment is complete, you can view the deployment in your Hive customer dashboard, where you can access your API key, view current tasks, as well as access other information as you would with our pre-trained models.<\/p>\n\n\n\n<p>We\u2019re very excited to be adding AutoML to our offerings. The platform currently supports both text and image classification, and we\u2019re working to add support for large language models next. If you\u2019d like to learn more about our AutoML platform and other solutions we\u2019re building, please feel free to reach out to&nbsp;<a href=\"mailto:sales@thehive.ai\" target=\"_blank\" rel=\"noreferrer noopener\">sales@thehive.ai<\/a>&nbsp;or contact us&nbsp;<a href=\"https:\/\/thehive.ai\/contact-us?source=blog\" target=\"_blank\" rel=\"noreferrer noopener\">here<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Hive announces a new AutoML tool to train, evaluate, and deploy customized machine learning models.<\/p>\n","protected":false},"author":1,"featured_media":170,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"kia_subtitle":""},"categories":[8,4,2],"tags":[],"_links":{"self":[{"href":"https:\/\/thehive.ai\/blog\/wp-json\/wp\/v2\/posts\/47"}],"collection":[{"href":"https:\/\/thehive.ai\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/thehive.ai\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/thehive.ai\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/thehive.ai\/blog\/wp-json\/wp\/v2\/comments?post=47"}],"version-history":[{"count":5,"href":"https:\/\/thehive.ai\/blog\/wp-json\/wp\/v2\/posts\/47\/revisions"}],"predecessor-version":[{"id":412,"href":"https:\/\/thehive.ai\/blog\/wp-json\/wp\/v2\/posts\/47\/revisions\/412"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/thehive.ai\/blog\/wp-json\/wp\/v2\/media\/170"}],"wp:attachment":[{"href":"https:\/\/thehive.ai\/blog\/wp-json\/wp\/v2\/media?parent=47"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/thehive.ai\/blog\/wp-json\/wp\/v2\/categories?post=47"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/thehive.ai\/blog\/wp-json\/wp\/v2\/tags?post=47"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}