Computer vision for people & planetary health | Reducing the trial-and-error of machine learning for startups | Consulting | Heather D. Couture, PhD
  • Articles
  • Newsletter
  • Podcast
  • Webinars
  • Services
  • Resources
    • Pathology
    • Earth Observation
    • Office Hours
  • About
    • About Me
    • Testimonials
    • Portfolio
    • Publications
    • Speaking
    • Media Coverage
  • Book a Call

What's New in Computer Vision? CVPR 2025 Edition


Are you ready to catch up on the latest breakthroughs in computer vision? Join me for a 30-minute, fast-paced webinar inspired by the most exciting trends and demos from CVPR 2025.

What You’ll Learn:

  • Foundation Models in New Domains: While mainstream CV has embraced foundation models, some domains like cytology, spatial proteomics, and agriculture are just catching up.
  • Multimodal Integration: Combine multiple modalities of data in new open weight and open data models, plus better grounding capabilities for VLMs.
  • The Rise of Agentic AI: Explore the next generation of autonomous AI systems getting ready to make real-world impact.
Watch the Replay
Stay at the cutting edge of vision AI. Subscribe to my weekly Computer Vision Insights newsletter?

Download slides

In this session, I explored the rapid evolution of computer vision from task-specific models to sophisticated AI agents, highlighting groundbreaking developments from CVPR 2025. Did you know we now have foundation models specifically trained for cytology analysis, spatial proteomics with 175 protein markers, and hyperspectral imagery spanning 400-2500nm? These advances show how computer vision is moving far beyond general-purpose applications into highly specialized domains.

Key takeaways:

  • Foundation models are expanding into specialized domains like medical imaging, agriculture, and remote sensing with massive, curated datasets
  • Multimodal systems are embracing open weights/data, grounding capabilities, and using language as the “glue” to connect different modalities
  • AI agents are evolving into multi-agent systems with specialized roles that mimic human expert workflows through orchestration
  • New benchmarks and evaluation frameworks are emerging to handle complex multimodal tasks across diverse applications
  • The field is shifting from isolated, general-purpose models toward integrated, domain-specific systems capable of real-world problem-solving

If something in this talk resonated with you and you’d like a deeper discussion, I’m also offering a free pixel clarity call in which we’ll dive into your unique challenges and goals. I can help you connect data, modeling, and domain expertise to build more trustworthy computer vision.

You can book this call here.



Share

Tweet

Latest

What's New in Computer Vision? CVPR 2025 Edition

June 27, 2025

Bias & Batch Effects in Medical Imaging

May 15, 2025

Three Critical Mistakes Derailing Your Computer Vision Projects

April 2, 2025

Disentangling Distribution Shift

February 26, 2025

Pixel Scientia Labs

Computer Vision for People & Planetary Health

  • [email protected]
  • http://pixelscientia.com
  • Articles
  • Newsletter
  • Podcast
  • Webinars
  • Services
  • Contact

© 2024 Pixel Scientia Labs, LLC