Ex-Googlers Build Smart Infrastructure to Decode Business Video Data

Harness the power of your video archives with emerging video data intelligence from ex-Google innovators.
video data - video data-
Share
Add Console Classics as a preferred source on Google

Harness the power of your video archives with emerging video data intelligence from ex-Google innovators.

office workspace with tech team collaborating
office workspace with tech team collaborating

Unlocking the Hidden Potential of Video Data

Video content is exploding like never before. Businesses now generate vast oceans of footage—from broadcast archives collected over decades to countless hours of surveillance and production recordings. Yet, much of this treasure trove lies dormant, unwatched and unanalyzed, locked away as what’s called dark data: the untapped resource quietly filling servers but rarely delivering value.

Two former Googlers, Aza Kai and Hiraku Yanagita, who collaborated at Google Japan for nearly ten years, saw the enormous opportunity in this space. They co-founded InfiniMind, a Tokyo-born startup creating infrastructure that transforms petabytes of dormant video and audio into structured, searchable, and actionable business intelligence.

Kai recalls, “My co-founder, who spent a decade leading brand and data solutions at Google Japan, and I foresaw this technological turning point while still at Google.” By 2024, advances had matured enough, and market needs were crystal clear, compelling them to build their solution from the ground up.

Why Existing Video AI Solutions Fall Short

Kai’s career at Google spanned cloud computing, machine learning, ad systems, and video recommendation algorithms. With deep insight into data science, he explains that prior video intelligence tools forced a frustrating compromise.

“Older technologies could only tag objects frame-by-frame,” Kai says. “They failed to grasp the bigger picture—the storyline, cause-effect relationships, or answer complex queries about the content itself.” For customers owning decades of broadcast footage or massive video libraries, even simple questions about what their videos contained were often impossible to answer.

InfiniMind team photo showing diverse engineers collaborating
InfiniMind team photo showing diverse engineers collaborating

The AI Breakthrough That Changed Everything

Between 2021 and 2023, the landscape shifted dramatically with leaps forward in vision-language models. Video AI began evolving beyond mere object recognition to start understanding narrative flow and context.

Kai highlights two key contributors to this revolution: the steady drop in GPU costs and annual model accuracy improvements averaging 15–20% over the past decade. “But the real game-changer was capability—until recently, models simply couldn’t perform the complex tasks we needed,” he explains.

This breakthrough meant video data could finally be sliced and diced like traditional structured data—opening doors for businesses to dive deep into video insights.

From Tokyo Roots to Global Ambitions

InfiniMind recently raised $5.8 million in seed funding, spearheaded by UTEC and supported by CX2, Headline Asia, Chiba Dojo, plus an AI researcher affiliated with a16z Scout. Although headquartered in the U.S. now, the startup maintains its roots in Japan—a perfect laboratory thanks to advanced hardware, top-tier engineers, and a nurturing startup ecosystem.

Japan’s demanding pilot customers helped InfiniMind refine its tech to meet real-world challenges before scaling internationally.

Product Launches: Helping Brands See Through the Lens

Their first offering, TV Pulse, debuted in Japan in April 2025. This AI-powered platform analyzes television broadcasts live, allowing media and retail companies to track product placements, brand visibility, customer sentiment, and public relations impact in near real-time.

Pilot programs with major broadcasters and agencies quickly turned into paying contracts, including wholesalers and media firms eager to quantify their video-driven influence.

Now, InfiniMind is gearing up for the global stage with DeepFrame, a flagship long-form video intelligence platform. Slated for beta release in March 2026 and full launch in April, DeepFrame can process up to 200 hours of footage, pinpointing specific scenes, speakers, or events with remarkable precision.

Standing Out in a Crowded Video Analytics Market

The video intelligence field is crowded and diverse. General-purpose APIs, like those from companies such as TwelveLabs, cater to broad user bases—from casual consumers to prosumers and enterprises.

InfiniMind differentiates itself by focusing exclusively on enterprise needs, especially in areas like monitoring, safety, security, and deep video content analysis.

Kai emphasizes, “Our solution requires no coding. Clients simply bring their video data, and our system takes care of the rest—delivering actionable insights seamlessly.” Beyond visuals, InfiniMind integrates audio and speech recognition, enabling a more holistic understanding of video content.

One of their standout advantages is cost efficiency. “Many existing tools chase either accuracy or niche use cases but overlook the challenge of affordability at scale,” Kai notes. InfiniMind aims to break that mold.

Fueling Growth and Innovation

The seed funding isn’t just a financial boost—it’s a springboard for further developing the DeepFrame AI model, bolstering engineering infrastructure, expanding the team, and broadening customer reach across Japan and the U.S.

Kai sees their work as part of a grander vision: “This field is one of the pathways toward artificial general intelligence (AGI). Understanding video intelligence is, in essence, understanding reality itself. While industrial applications are vital, our ultimate goal is to push technological boundaries to help humans make smarter decisions.”


Conclusion

The vast reservoirs of video data companies have amassed are no longer destined to remain unused. Thanks to pioneers like InfiniMind’s founders, the curtain is being lifted on these dark data warehouses, transforming endless footage into meaningful insights that drive smarter business decisions. As video content continues to grow exponentially, solutions that can truly understand and analyze it will become indispensable. Will your company be ready to harness the video intelligence revolution?


FAQ

  • How does InfiniMind’s AI handle long videos?
    Their DeepFrame platform can process hours of footage, precisely identifying specific scenes, speakers, or events without length limitations.
  • What industries benefit most from this technology?
    Media, retail, security, and monitoring sectors see immediate advantages, especially those with large archives or real-time broadcast content.
  • Is coding knowledge required to use InfiniMind’s products?
    No, their user-friendly interface requires no coding—clients just upload videos, and the system generates insights automatically.
  • Does the platform analyze audio as well as video?
    Yes, InfiniMind integrates speech and sound analysis, providing a richer understanding beyond visuals alone.
  • What sets InfiniMind apart from other video AI companies?
    Their focus on enterprise-grade solutions, cost efficiency, no-code access, and multi-modal (audio + video) analysis distinguishes them from more general tools.

Don’t want to miss new posts?

author avatar
Valeriy Bagrintsev Founder & Chief Content Creator
Valeriy is the founder of Just Plugged and a tech reviewer focused on consumer electronics, software, and buying guides. He brings years of hands-on experience researching and evaluating tech products to help readers choose better technology with confidence.
Comments
Add a comment

Leave a Reply

Updates, No Noise
Updates, No Noise
Updates, No Noise
We filter tech. You get the signal.
Updates, No Noise
Occasional emails with honest reviews, guides, and real deals.