Skip to content

Pegasus 1.5 by TwelveLabs Review

Pegasus 1.5 by TwelveLabs is a video intelligence platform that enables machines to understand, search, and analyze video content across vision, audio, and language.

shipped Apr 21, 2026aifreemium
Pegasus 1.5 by TwelveLabs - AI tool
1Processes videos up to two hours in length in a single API call.
2Outperformed Gemini 3 Pro by 13.1% on aggregate segmentation quality in internal benchmarks.
3Indexes an hour of video in approximately one minute, achieving ~60x real-time speed.
4Offers an API with synchronous analysis support as of May 6, 2026.

Stork Quadrant

Becomes the API· 27/100

Replaceable as a UI, but kept alive as the API the agents call.

TwelveLabs built a capable multimodal video understanding API before the frontier labs caught up. That window is closing. GPT-4o, Gemini 1.5 Pro, and Claude already handle video natively, and they're getting faster and cheaper. There's no proprietary data, no network, no regulatory gate — just a specialized model that bigger players will commoditize.

Claude Sonnet 4.6, scored 2026-05-30

Defensibility · 0/100

  • Physical-world coupling
  • Regulatory moat
  • Network liquidity
  • Proprietary refreshing data
  • High-trust catastrophic workflows
  • Multi-party coordination
  • Brand / community / taste

An LLM alone could replace

  • Summarize what happens in a video by describing its content
  • Transcribe audio and extract key topics or themes from spoken content
  • Answer questions about a video's subject matter given a transcript or description
  • Generate metadata tags or chapter markers for video content

Agent-Readiness · 60/100

  • Verified MCPStork MCP listing: io-twelvelabs-twelvelabs-mcp-server (untested)
  • Listed on agent surfacesStork:io-twelvelabs-twelvelabs-mcp-server
  • Usage-based pricingpricing page heuristic match: https://www.twelvelabs.io/pricing
  • Headless agent auth
  • Public OpenAPIhttps://docs.twelvelabs.io/v1.3/docs/resources/platform-overview
  • Active changeloghttps://www.twelvelabs.io/blog/introducing-pegasus-1-5 (2026-04-19)
  • llms.txthttps://www.twelvelabs.io/llms.txt

Score history · +5 pts over 3 re-scores

How to defend

Go vertical and own the liability: pick one industry where wrong video analysis has real consequences — insurance claims, legal evidence, broadcast compliance — and become the vendor that signs the contract and bears the risk. That's the only move that creates a moat here.

  • Ship an MCP server and list it on Stork — biggest single point gain (+25).
  • Expose API-key auth with a self-serve sandbox tier; remove sales-call gates (+15).

Pegasus 1.5 by TwelveLabs at a Glance

Best For
video, code, research
Pricing
freemium
Key Features
Processes videos up to two hours in length in a single API call. · Outperformed Gemini 3 Pro by 13.1% on aggregate segmentation quality in internal benchmarks. · Indexes an hour of video in approximately one minute, achieving ~60x real-time speed.
Alternatives
Mixpeek, Azure AI Video Indexer, Moments Lab, Memories.ai

About Pegasus 1.5 by TwelveLabs

Headquarters
San Francisco, USA
Founded
2020
Team Size
51-100
Funding
Series A

Similar Tools

Compare Alternatives

Other tools you might consider

Connect

</>Embed "Featured on Stork" Badge
Badge previewBadge preview light
<a href="https://www.stork.ai/en/pegasus-1-5-by-twelvelabs" target="_blank" rel="noopener noreferrer"><img src="https://www.stork.ai/api/badge/pegasus-1-5-by-twelvelabs?style=dark" alt="Pegasus 1.5 by TwelveLabs - Featured on Stork.ai" height="36" /></a>
[![Pegasus 1.5 by TwelveLabs - Featured on Stork.ai](https://www.stork.ai/api/badge/pegasus-1-5-by-twelvelabs?style=dark)](https://www.stork.ai/en/pegasus-1-5-by-twelvelabs)

overview

What is Pegasus 1.5 by TwelveLabs?

Pegasus 1.5 by TwelveLabs is a video-first language model developed by Twelve Labs that enables developers and enterprises to transform raw video content into structured, queryable data. It leverages multimodal AI to analyze visual, audio, and speech information within videos. Generally released on April 20, 2026, Pegasus 1.5 specializes in Time Based Metadata Extraction (TBM), allowing users to define custom JSON schemas to receive timestamped, structured metadata from videos up to two hours long in a single API call, without requiring prior indexing or preprocessing. The model integrates visual, audio, and speech information to generate contextually relevant text and structured metadata. As of May 6, 2026, it supports synchronous analysis for direct video processing from a URL, asset, or base64 string. Recent updates on May 28, 2026, increased its context window to 261,120 tokens and maximum response length to 98,304 tokens.

quick facts

Quick Facts

AttributeValue
DeveloperTwelve Labs
Business ModelFreemium / Hybrid (Freemium with usage-based tiers)
PricingFreemium starting at $0.001/1k input tokens, $0.007/1k output tokens
PlatformsAPI
API AvailableYes
Founded2020
HQSan Francisco, USA
FundingSeries A

features

Key Features of Pegasus 1.5 by TwelveLabs

Pegasus 1.5 by TwelveLabs provides a comprehensive suite of features designed for deep video understanding and structured data extraction, leveraging multimodal AI across vision, audio, and language. Its capabilities are accessible via an API and SDK, enabling integration into various enterprise workflows.

  • 1Search video content across vision, audio, and language modalities.
  • 2Analyze and understand video content with human-like comprehension.
  • 3Transform raw video into Time Based Metadata (TBM) using user-defined JSON schemas.
  • 4Ingest multimodal data through a single, streamlined pipeline.
  • 5Index an hour of video in approximately one minute, achieving ~60x real-time processing speed.
  • 6API and SDK available for seamless integration into existing systems.
  • 7Segment video content into meaningful, timestamped segments (e.g., editorial narratives, sports plays, speaker changes).
  • 8Generate insights from video, including concise content summaries and comprehensive textual descriptions.
  • 9Support for synchronous analysis of videos up to two hours in length, receiving results directly in the response.
  • 10Multimodal prompting, allowing users to include reference images for visual context or specific segment identification.

use cases

Who Should Use Pegasus 1.5 by TwelveLabs?

Pegasus 1.5 by TwelveLabs is designed for a diverse range of users requiring advanced video intelligence to automate analysis, enhance content management, and extract actionable insights from video data at scale. Its multimodal capabilities cater to both technical and content-focused professionals.

  • 1**Developers**: For integrating advanced video understanding capabilities into applications, platforms, and services via a robust API and SDK.
  • 2**Enterprises & Media Companies**: For automating labor-intensive manual video-tagging workflows, content summarization, detailed descriptions, and compliance scanning, saving thousands of hours of review per year.
  • 3**Sports Organizations**: For breaking down games into analyzable moments, identifying scoring events, fouls, or pivotal plays with timestamps for highlight creation, coaching reviews, and scouting.
  • 4**Marketing & Advertising Agencies**: For evaluating videos for persuasive content, identifying weak spots, and improving clarity and engagement in campaigns.
  • 5**Security Operators & Government Agencies**: For content analysis, extracting key information for archiving, and enhancing recommendation systems or monetization strategies.

pricing

Pegasus 1.5 by TwelveLabs Pricing & Plans

TwelveLabs offers a freemium pricing model for Pegasus 1.5, structured across Free, Developer, and Enterprise tiers, with usage-based rates for API consumption. Rate limits are implemented across all plans, varying by usage type (duration-based for video/audio, token-based for text, and request-based for endpoints).

  • 1**Free**: Provides basic limits at no cost, allowing users to explore the platform's capabilities.
  • 2**Developer**: Offers three tiers with increasing limits, with pricing variable based on monthly spending.
  • 3**Enterprise**: Provides custom limits and tailored pricing, requiring direct consultation with Twelve Labs.
  • 4**Input text (Pegasus model)**: Priced at $0.001 per 1,000 tokens.
  • 5**Output text (Pegasus Analyze API)**: Priced at $0.007 per 1,000 tokens.

competitors

Pegasus 1.5 by TwelveLabs vs Competitors

Pegasus 1.5 by TwelveLabs is positioned as a specialized video reasoning model, designed to outperform general-purpose models in key areas critical for production video workflows. It competes with various platforms offering video intelligence and multimodal analysis.

1
Mixpeek

Mixpeek is a multimodal data warehouse that decomposes video, images, and audio into searchable features and reassembles them through multi-stage retrieval pipelines.

Similar to TwelveLabs, Mixpeek offers a full-stack video intelligence platform with composable pipelines for various extractors (vision, audio, OCR, face), providing retrieval-ready output. It also offers a freemium model with 1,000 free credits, aligning with TwelveLabs' freemium offering.

2
Azure AI Video Indexer

Azure AI Video Indexer is a cloud and edge service that automatically extracts deep insights from video and audio content, integrated within the Microsoft Azure ecosystem.

Azure AI Video Indexer provides similar multimodal analysis capabilities (object detection, OCR, transcription, sentiment analysis) but is a service within the broader Azure ecosystem, potentially appealing to organizations already using Azure, whereas TwelveLabs is a specialized platform. It offers a free trial with up to 2,400 minutes of free indexing.

3
Moments Lab

Moments Lab is an AI-powered video discovery platform that indexes visuals, audio, and metadata to help organizations find, repurpose, share, and monetize video content.

Moments Lab directly competes in the enterprise video discovery and content monetization space, offering similar multimodal indexing and search capabilities to TwelveLabs, but with a stronger emphasis on repurposing and monetizing content. Pricing is likely enterprise-focused, requiring a demo.

4
Memories.ai

Memories.ai offers advanced AI video understanding technology that effortlessly analyzes every frame to detect objects, interpret context, recognize emotions, and extract meaningful insights.

Memories.ai provides comprehensive AI video understanding for insights, content tagging, and scene analysis, directly aligning with TwelveLabs' core offering of understanding video across vision, audio, and language. It implies a trial or freemium model.

Frequently Asked Questions

+What is Pegasus 1.5 by TwelveLabs?

Pegasus 1.5 by TwelveLabs is a video-first language model developed by Twelve Labs that enables developers and enterprises to transform raw video content into structured, queryable data. It leverages multimodal AI to analyze visual, audio, and speech information within videos.

+Is Pegasus 1.5 by TwelveLabs free?

Yes, Pegasus 1.5 by TwelveLabs offers a freemium model with a 'Free' tier that includes basic limits at no cost. Paid 'Developer' and 'Enterprise' plans are available, with usage-based pricing for API consumption, such as $0.001 per 1,000 input text tokens and $0.007 per 1,000 output text tokens.

+What are the main features of Pegasus 1.5 by TwelveLabs?

Key features include searching, analyzing, and understanding video across vision, audio, and language; transforming video into Time Based Metadata (TBM) using custom JSON schemas; ingesting multimodal data through a single pipeline; indexing an hour of video in approximately one minute; and offering an API and SDK for integration. It also supports synchronous analysis and multimodal prompting with reference images.

+Who should use Pegasus 1.5 by TwelveLabs?

Pegasus 1.5 by TwelveLabs is intended for developers, enterprises, media companies, sports organizations, marketing and advertising agencies, security operators, and government agencies. It is particularly useful for those needing to automate video analysis, extract structured insights, and enhance content management workflows.

+How does Pegasus 1.5 by TwelveLabs compare to alternatives?

Pegasus 1.5 by TwelveLabs is a specialized video reasoning model that outperforms general-purpose models like Gemini 3 Pro/3.1 Pro in segmentation quality and structured output reliability. Compared to competitors like Mixpeek, Azure AI Video Indexer, Moments Lab, and Memories.ai, Pegasus 1.5 focuses on its video-first language model for deep understanding and structured metadata extraction, offering a specialized solution for transforming raw video into queryable data.

For builders

This page is doing a job for someone else’s tool.

AI agents read it. Buyers find it. Backlinks accrue. Your tool can have one too — live in 24 hours, indexed by Claude, ChatGPT, and Perplexity, queryable via MCP.