BACK TO ALL BLOGS Streamline CSAM Reports with Moderation Dashboard’s NCMEC Integration HiveFebruary 26, 2025February 27, 2025 Contents Ensuring Child Safety OnlineIntegration WorkflowNCMEC Report Contents Hive is excited to announce that we have integrated the National Center for Missing & Exploited Children’s (NCMEC) CyberTipline into Moderation Dashboard, streamlining the process of submitting child sexual abuse material (CSAM) reports. This feature is now available to all Moderation Dashboard customers with valid NCMEC credentials. Ensuring Child Safety Online The National Center for Missing & Exploited Children is a non-profit organization dedicated to protecting children from all forms of exploitation and abuse. All electronic communication service providers are required under U.S. federal law to report any known CSAM on their platforms to NCMEC’s CyberTipline—a centralized system for receiving and processing CSAM reports. These reports are later shared with law enforcement and relevant service providers so they can take further action. Throughout our endeavors and partnerships, Hive’s commitment to online safety has been unwavering. We built this integration to help automate the reporting process, simplify our customers’ workflows, and ensure that their platforms can comply with applicable law. Integration Workflow A step-by-step sample integration workflow is outlined, starting from when a user uploads an image to the platform and ending in the subsequent actions a moderator can take. For a more detailed guide on how the reporting process works, refer to the following documentation. A user uploads an image to the platform.The image is processed by Hive’s proprietary CSAM Detection API, powered by Thorn—a leading nonprofit that builds technology to defend children from sexual abuse. To learn more about our Thorn partnership, read our blog posts linked below:Matching Against CSAM: Hive’s Innovative Integration with Thorn’s Safer MatchExpanding Our CSAM Detection APIIf there is a likelihood of CSAM detected in the image, this image will surface as a link in the CSAM Review Feed. Once the link is clicked, the media will open in a new browser tab for the moderator to review. Moderation Dashboard will never display CSAM content directly within the Review Feed.From the review feed, the moderator can take two actions:Perform an enforcement action (e.g. banning the user or deleting the post). A webhook is sent to the customer’s server afterward, containing the moderator’s chosen enforcement action as well as the post and user metadata, all of which are used to take the content down.The system will automatically create a report, which the moderator can send to NCMEC by clicking the “Submit” button within the Review Feed. After the report is submitted, the system creates an internal log to track the report (e.g. submission date and time, as well as storing the response from NCMEC). “Report to NCMEC” button within Review Feed NCMEC Report Contents Customers can pre-fill information fields that are constant across reports. These fields will be automatically populated for each report, reducing effort on the customer’s end. To provide our customers with full transparency, the report sent to NCMEC includes: the moderator’s information, the company’s information, the potential CSAM content, and the incident date and time. Moderator information fields If you’re interested in learning more about what we do, please reach out to our sales team (sales@thehive.ai) or contact us here for further questions.
BACK TO ALL BLOGS Hive to be Lead Sponsor of Trust & Safety Summit 2025 HiveFebruary 5, 2025March 17, 2025 We are thrilled to announce that Hive is the lead sponsor of the Trust & Safety Summit 2025. As Europe’s premier Trust & Safety conference, this summit is designed to empower T&S leaders to tackle operational and regulation challenges, providing them with both actionable insights and future-focused strategies. The summit will be held Tuesday, March 25th and Wednesday, March 26th at the Hilton London Syon Park, UK. The 2-day event will explore themes such as regulatory preparedness, scaling trust and safety solutions, and best practices for effective content moderation. An incredible selection of programming will include expert-led panels, interactive workshops and networking events. Hive’s CEO Kevin Guo will deliver the keynote presentation on “The Next Frontier of Content Moderation”, covering topics like multi-modal LLMs and detecting AI generated content. Additionally, Hive will host two panels during the event: Hyperscaling Trust & Safety: Navigating Growth While Maintaining Integrity. Hive will be discussing best practices for scaling trust & safety systems for online platforms experiencing hypergrowth.Harnessing AI to Detect Unknown CSAM: Innovations, Challenges, and the Path Forward. Hive will be joined by partners Thorn and IWF to discuss recent advancements in CSAM detection solutions. As the lead sponsor of the T&S Summit 2025, we are furthering our commitment to making the internet a safer place. Today, Hive’s comprehensive moderation stack empowers Trust & Safety teams of all sizes to scale their moderation workflows with both pre-trained and customizable AI models, flexible LLM-based moderation, and a moderation dashboard for streamlined enforcement of policies. We look forward to welcoming you to the Trust & Safety Summit 2025. If you’re interested in attending the conference, please reach out to your Hive account manager or sales@thehive.ai. Prospective conference attendees can also find more details and ticket information here. For a detailed breakdown of summit programming, download the agenda here. To learn more about what we do at Hive, please reach out to our sales team or contact us here for further questions.
BACK TO ALL BLOGS Protecting Children’s Online Safety with Internet Watch Foundation HiveJanuary 23, 2025February 25, 2025 Contents Our Joint Commitment to Child SafetyIntegration with Thorn Safer Match Hive is proud to announce that we are partnering with Internet Watch Foundation (IWF), a non-profit organization working to stop child sexual abuse online. We will be integrating their proprietary keyword and URL lists into our default Text Moderation model for all customers at no additional cost. Our Joint Commitment to Child Safety Making the internet a safer place is one of Hive’s core values. Our partnership with IWF allows us to use their specialized knowledge to bolster our leading content moderation tools, helping our customers better detect and flag online records of child sexual abuse. As part of our partnership, Hive will now include the following two IWF wordlists as part of our default Text Moderation model for all customers at no additional cost: Keyword List: This wordlist contains known terms and code words that offenders use to exchange child sexual abuse material (CSAM) in a discreet manner. More information can be found here.URL List: This wordlist contains a comprehensive list of webpages that are confirmed to host CSAM in image or video form. More information can be found here. With these lists, customers can now use Text Moderation to catch various keywords and URLs associated with CSAM. These lists are dynamic and will be updated on a daily basis. A sample Text Moderation response can be found below. We recommend that all customers perform an initial evaluation to first determine if the list’s keywords are helpful for their specific use case. For more information, refer to the following documentation. Integration with Thorn Safer Match Our partnership also grants us access to IWF’s hash lists. Previously, we partnered with Thorn, allowing customers to integrate their Safer Match hash matching technology for CSAM detection using Hive APIs. We can now match against IWF’s hash lists with Thorn Safer Match. If you want this feature supported, please reach out to our sales team (sales@thehive.ai). By combining our leading moderation tools with IWF’s specialized expertise, we hope that we can create a safer internet for children worldwide. For more details, you can find our recent press release here, as well as our CEO Kevin Guo’s interview with Rashi Shrivastava of Forbes here. If you’re interested in learning more about what we do, please reach out to our sales team or contact us here for further questions.
BACK TO ALL BLOGS State of the Deepfake: Trends & Threat Forecast for 2025 HiveJanuary 16, 2025January 23, 2025
BACK TO ALL BLOGS Expanding our Moderation APIs with Hive’s New Vision Language Model HiveDecember 23, 2024February 21, 2025 Contents An Introduction to VLMs and Moderation 11BPotential Use CasesExpanding Moderation Hive is thrilled to announce that we’re releasing Moderation 11B Vision Language Model. Fine-tuned on top of Llama 3.2 11B Vision Instruct, Moderation 11B is a new vision language model (VLM) that expands our established suite of text and visual moderation models. Building on our existing capabilities, this new model offers a powerful way to handle flexible and context-dependent moderation scenarios. An Introduction to VLMs and Moderation 11B Vision language models (VLMs) are models that can learn from image and text inputs. This ability to simultaneously process inputs across multiple modalities (e.g. images and text) is known as multimodality. While VLMs share similar functions with large language models (LLMs), traditional LLMs cannot process image inputs. With Moderation 11B VLM, we leverage unique multimodal capabilities to extend our existing moderation tool suite. Beyond its multimodality, Moderation 11B VLM can incorporate additional contextual information, which is not possible with our traditional classifiers. The model’s baked-in knowledge, combined with insights trained from our classifier dataset, enables a more comprehensive approach to moderation. Moderation 11B VLM is trained on all 53 public heads of our Visual Moderation system, recognizing content across distinct categories such as sexual content, violence, drugs, hate, and more. Because of these enhancements, it becomes a valuable addition to our existing Enterprise moderation classifiers, helping to capture a wide range of flexible and alternative cases that can arise in dynamic workflows. Potential Use Cases Moderation 11B VLM applies to a broad range of use cases, notably surpassing Llama 3.2 11B Vision Instruct in identifying contextual violations and handling unseen data in our internal tests. Below are some potential use cases where our model performs well: Contextual violations: Cases where individual inputs alone may not be flagged as violations, but all inputs contextualized together makes it one. For example, a text message could appear harmless on its own, yet the preceding conversation context reveals it to be a violation.Multi-modal violations: Situations where both text and image inputs are important. For instance, analyzing a product image alongside its description can uncover violations that single-modality models would miss.Unseen data: Inputs that the model has not previously encountered. For example, customers may use Moderation 11B VLM to ensure that user content aligns with newly introduced company policies. Below are graphical representations of how our fine-tuned Moderation 11B model performed in our internal testing compared to the Llama 3.2 11B Vision Instruct model. We assessed their respective F1 scores, a metric that combines both precision and recall. The F1 score was computed using the standard formula: F1 = 2 * (precision * recall) / (precision + recall). In our internal evaluation, we tasked both our Moderation 11B VLM and Llama 3.2 11B Vision Instruct with learning the classification guidelines outlined in our public Visual Moderation documentation. These guidelines were then used to evaluate a randomly selected sizable sample dataset of images from our proprietary Visual Moderation dataset, which has highly accurate hand-labeled ground truth classifications. This dataset also included diverse and challenging content types from each of our visual moderation heads, such as sexual intent, hate symbols and self harm. While Moderation 11B VLM’s performance demonstrates its ability to generalize well within the scope of these content classes, it is important to note that results may vary depending on the composition of external datasets Expanding Moderation With Moderation 11B VLM’s release, we hope to meaningfully and flexibly broaden the range of use cases our moderation tools can handle. We’re excited to see how this model assists with your moderation workflows, especially when navigating complex scenarios. Anyone with a Hive account can access our API playground here to try Moderation 11B VLM directly from the user interface. Below are two examples of Moderation 11B VLM requests and responses. For more details, please refer to the documentation here. If you’re interested in learning more about what we do, please reach out to our sales team (sales@thehive.ai) or contact us here for further questions.
BACK TO ALL BLOGS Announcing Hive’s Partnership with the Defense Innovation Unit HiveDecember 5, 2024February 21, 2025 Contents Combating Synthetic Media and DisinformationOur Cutting-Edge ToolsForging a Path Forward Hive is excited to announce that we have been awarded a Department of Defense (DoD) contract for deepfake detection of video, image, and audio content. This groundbreaking partnership marks a significant milestone in protecting our national security from the risks of synthetic media and AI-generated disinformation. Combating Synthetic Media and Disinformation Rapid strides in technology have made AI manipulation the weapon of choice for numerous adversarial entities. For the Department of Defense, a digital safeguard is necessary in order to protect the integrity of vital information systems and stay vigilant against the future spread of misinformation, threats, and conflicts at a national scale. Hive’s reputation as frontline defenders against AI-generated deception makes us uniquely equipped to handle such threats. Not only do we understand the stakes at hand, we have been and continue to be committed to delivering unmatched detection tools that can mitigate these risks with accuracy and speed. Under our initial two-year contract, Hive will partner with the Defense Innovation Unit (DIU) to support the intelligence community with our state-of-the-art deepfake detection models, deployed in an offline, on-premise environment and capable of detecting AI-generated video, image, and audio content. We are honored to join forces with the Department of Defense in this critical mission. Our Cutting-Edge Tools To best empower the U.S. defense forces against potential threats, we have provided five proprietary models that can detect whether an input is AI-generated or a deepfake. If an input is flagged as AI-generated, it was likely created using a generative AI engine. Whereas, a deepfake is a real image or video where one or more of the faces in the original image has been swapped with another person’s face. The models we’ve provided are, as follows: AI-Generated Detection (Image and Video), which detects if an image or video is AI-generated.AI-Generated Detection (Audio), which detects if an audio clip is AI-generated.Deepfake Detection (Image), which detects if an image contains one or more faces that are deepfaked.Deepfake Detection (Video), which detects if a video contains one or more faces that are deepfaked.Liveness (Image and Video), which detects whether a face in an image or video is primary (exists in the primary image) or secondary (exists in an image, screen, or painting inside of the primary image). Forging a Path Forward Even as new threats continue to emerge and escalate, Hive continues to be steadfast in our commitment to provide the world’s most capable AI models for validating the safety and authenticity of digital content. For more details, you can find our recent press release here and the DIU’s press release here. If you’re interested in learning more about what we do, please reach out to our sales team (sales@thehive.ai) or contact us here for further questions.