VisionLab Events
HIGHLIGHT

OTHER NEWS
ReViT: Enhancing vision transformers with residual attention
Vision Transformer (ViT) self-attention mechanism is characterized by feature collapse in deeper layers, resulting in the vanishing of low-level visual features. However, such features can
2 Settembre 2024 Nessun commento
S-GEAR: Semantically Guided Representation Learning for Action Anticipation (ECCV2024)
Action anticipation is forecasting future activity from a partially observed sequence of events. However, this task is exposed to intrinsic future uncertainty and the difficulty
2 Settembre 2024 Nessun commento
23 Agosto 2024 Nessun commento
23 Agosto 2024 Nessun commento
23 Agosto 2024 Nessun commento
23 Agosto 2024 Nessun commento
Computer Vision Conferences
- NTIRE, AI4RWD CVPR WorkshopsSource: Computer Vision Conferences Published on: 2026-01-23
- ICPRAISource: Computer Vision Conferences Published on: 2026-01-23
- ImageMatch, FedVision CVPR WorkshopsSource: Computer Vision Conferences Published on: 2026-01-23
- IWBF 2026 DeadlineSource: Computer Vision Conferences Published on: 2026-01-23
- Source: Computer Vision Conferences Published on: 2026-01-23
- ICPRAI 2026 DeadlineSource: Computer Vision Conferences Published on: 2026-01-23
- CRV 2026 DeadlineSource: Computer Vision Conferences Published on: 2026-01-23
- AIxVR 2026Source: Computer Vision Conferences Published on: 2026-01-23
- IEEE MIPR 2025Source: Computer Vision Conferences Published on: 2025-07-26
- ICPR Preliminary CfPSource: Computer Vision Conferences Published on: 2025-07-26
- SRBS Correction, BMVC WorkshopSource: Computer Vision Conferences Published on: 2025-07-26
- HiCV Abstracts, ICCV WorkshopSource: Computer Vision Conferences Published on: 2025-07-26
- SRBS BMVC WorkshopSource: Computer Vision Conferences Published on: 2025-07-26
- AVSS 2025Source: Computer Vision Conferences Published on: 2025-07-26
- ACIVS 2025Source: Computer Vision Conferences Published on: 2025-07-26
Nvidia News
- Running Large-Scale GPU Workloads on Kubernetes with Slurm
- Cut Checkpoint Costs with About 30 Lines of Python and NVIDIA nvCOMP
- How to Accelerate Protein Structure Prediction at Proteome-Scale
- Integrate Physical AI Capabilities into Existing Apps with NVIDIA Omniverse Libraries
- Running AI Workloads on Rack-Scale Supercomputers: From Hardware to Topology-Aware Scheduling
- Accelerating Vision AI Pipelines with Batch Mode VC-6 and NVIDIA Nsight
- Achieving Single-Digit Microsecond Latency Inference for Capital Markets
- Bringing AI Closer to the Edge and On-Device with Gemma 4
- CUDA Tile Programming Now Available for BASIC!
- NVIDIA Extreme Co-Design Delivers New MLPerf Inference Records
- Accelerate Token Production in AI Factories Using Unified Services and Real-Time AI
- Stream High-Fidelity Spatial Computing Content to Any Device with NVIDIA CloudXR 6.0
- Build and Stream Browser-Based XR Experiences with NVIDIA CloudXR.js
- Maximize AI Infrastructure Throughput by Consolidating Underutilized GPU Workloads
- How Centralized Radar Processing on NVIDIA DRIVE Enables Safer, Smarter Level 4 Autonomy
Microsoft News
- Ideas: Steering AI toward the work future we wantSource: Microsoft Research Date: 2026-04-09 By Jaime Teevan, Jenna Butler, Jake Hofman, Rebecca Janssen
- ADeLe: Predicting and explaining AI performance across tasks
- AsgardBench: A benchmark for visually grounded interactive planning
- Will machines ever be intelligent?
- Systematic debugging for AI agents: Introducing the AgentRx frameworkSource: Microsoft Research Date: 2026-03-12 By Shraddha Barke, Arnav Goyal, Alind Khare, Chetan Bansal
- Phi-4-reasoning-vision and the lessons of training a multimodal reasoning modelSource: Microsoft Research Date: 2026-03-04 By Jyoti Aneja, Michael Harrison, Neel Joshi, Tyler LaBonte, John Langford, Eduardo Salinas
- Trailer: The Shape of Things to Come
- CORPGEN advances AI agents for real workSource: Microsoft Research Date: 2026-02-26 By Abubakarr Jaye, Nigel Boachie Kumankumah, Chidera Biringa, Sulaiman Vesal, Anjel Patel, Dayquan Julienne
- Media Authenticity Methods in Practice: Capabilities, Limitations, and Directions
- Project Silica’s advances in glass storage technology
- Rethinking imitation learning with Predictive Inverse Dynamics ModelsSource: Microsoft Research Date: 2026-02-05 By Pallavi Choudhury, Lukas Schäfer, Chris Lovett, Katja Hofmann, Sergio Valcarcel Macua
- Paza: Introducing automatic speech recognition benchmarks and models for low resource languagesSource: Microsoft Research Date: 2026-02-05 By Mercy Muchai, Kevin Chege, Nick Mumero, Stephanie Nyairo
- UniRG: Scaling medical imaging report generation with multimodal reinforcement learningSource: Microsoft Research Date: 2026-01-27 By Sheng Zhang, Flora Liu, Guanghui Qin, Mu Wei, Hoifung Poon
- Multimodal reinforcement learning with agentic verifier for AI agentsSource: Microsoft Research Date: 2026-01-20 By Reuben Tan, Baolin Peng, Zhengyuan Yang, Oier Mees, Jianfeng Gao
- OptiMind: A small language model with optimization expertiseSource: Microsoft Research Date: 2026-01-15 By Xinzhi Zhang, Zeyi Chen, Humishka Hope, Hugo Barbalho, Konstantina Mellou, Marco Molinaro, Janardhan (Jana) Kulkarni, Ishai Menache, Sirui Li
Google AI News
- Generative AI to quantify uncertainty in weather forecasting
- AutoBNN: Probabilistic time series forecasting with compositional bayesian neural networks
- Computer-aided diagnosis for lung cancer screening
- Using AI to expand global access to reliable flood forecasts
- ScreenAI: A visual language model for UI and visually-situated language understanding
- SCIN: A new resource for representative dermatology images
- MELON: Reconstructing 3D objects from images with unknown poses
- HEAL: A framework for health equity assessment of machine learning performance
- Cappy: Outperforming and boosting large multi-task language models with a small scorer
- Talk like a graph: Encoding graphs for large language models
- Chain-of-table: Evolving tables in the reasoning chain for table understanding
- Health-specific embedding tools for dermatology and pathology
- Social learning: Collaborative learning with large language models
- Croissant: a metadata format for ML-ready datasets
- Google at APS 2024
