VisionLab Events
HIGHLIGHT

OTHER NEWS
ReViT: Enhancing vision transformers with residual attention
Vision Transformer (ViT) self-attention mechanism is characterized by feature collapse in deeper layers, resulting in the vanishing of low-level visual features. However, such features can
2 Settembre 2024 Nessun commento
S-GEAR: Semantically Guided Representation Learning for Action Anticipation (ECCV2024)
Action anticipation is forecasting future activity from a partially observed sequence of events. However, this task is exposed to intrinsic future uncertainty and the difficulty
2 Settembre 2024 Nessun commento
23 Agosto 2024 Nessun commento
23 Agosto 2024 Nessun commento
23 Agosto 2024 Nessun commento
23 Agosto 2024 Nessun commento
Computer Vision Conferences
- NTIRE, AI4RWD CVPR WorkshopsSource: Computer Vision Conferences Published on: 2026-01-23
- ICPRAISource: Computer Vision Conferences Published on: 2026-01-23
- ImageMatch, FedVision CVPR WorkshopsSource: Computer Vision Conferences Published on: 2026-01-23
- IWBF 2026 DeadlineSource: Computer Vision Conferences Published on: 2026-01-23
- Source: Computer Vision Conferences Published on: 2026-01-23
- ICPRAI 2026 DeadlineSource: Computer Vision Conferences Published on: 2026-01-23
- CRV 2026 DeadlineSource: Computer Vision Conferences Published on: 2026-01-23
- AIxVR 2026Source: Computer Vision Conferences Published on: 2026-01-23
- IEEE MIPR 2025Source: Computer Vision Conferences Published on: 2025-07-26
- ICPR Preliminary CfPSource: Computer Vision Conferences Published on: 2025-07-26
- SRBS Correction, BMVC WorkshopSource: Computer Vision Conferences Published on: 2025-07-26
- HiCV Abstracts, ICCV WorkshopSource: Computer Vision Conferences Published on: 2025-07-26
- SRBS BMVC WorkshopSource: Computer Vision Conferences Published on: 2025-07-26
- AVSS 2025Source: Computer Vision Conferences Published on: 2025-07-26
- ACIVS 2025Source: Computer Vision Conferences Published on: 2025-07-26
Nvidia News
- Tuning Flash Attention for Peak Performance in NVIDIA CUDA Tile
- How to Minimize Game Runtime Inference Costs with Coding Agents
- cuTile.jl Brings NVIDIA CUDA Tile-Based Programming to Julia
- Building Telco Reasoning Models for Autonomous Networks with NVIDIA NeMo
- 5 New Digital Twin Products Developers Can Use to Build 6G Networks
- Develop Native Multimodal Agents with Qwen3.5 VLM Using NVIDIA GPU-Accelerated Endpoints
- Maximizing GPU Utilization with NVIDIA Run:ai and NVIDIA NIM
- Making Softmax More Efficient with NVIDIA Blackwell Ultra
- Using NVFP4 Low-Precision Model Training for Higher Throughput Without Losing Accuracy
- Accelerating Data Processing with NVIDIA Multi-Instance GPU and NUMA Node Localization
- Unlock Massive Token Throughput with GPU Fractioning in NVIDIA Run:ai
- Topping the GPU MODE Kernel Leaderboard with NVIDIA cuda.compute
- How NVIDIA Extreme Hardware-Software Co-Design Delivered a Large Inference Boost for Sarvam AI’s Sovereign Models
- Build AI-Ready Knowledge Systems Using 5 Essential Multimodal RAG Capabilities
- R²D²: Scaling Multimodal Robot Learning with NVIDIA Isaac Lab
Microsoft News
- Phi-4-reasoning-vision and the lessons of training a multimodal reasoning modelSource: Microsoft Research Date: 2026-03-04 By Jyoti Aneja, Michael Harrison, Neel Joshi, Tyler LaBonte, John Langford, Eduardo Salinas
- Trailer: The Shape of Things to Come
- CORPGEN advances AI agents for real workSource: Microsoft Research Date: 2026-02-26 By Abubakarr Jaye, Nigel Boachie Kumankumah, Chidera Biringa, Sulaiman Vesal, Anjel Patel, Dayquan Julienne
- Media Authenticity Methods in Practice: Capabilities, Limitations, and Directions
- Project Silica’s advances in glass storage technology
- Rethinking imitation learning with Predictive Inverse Dynamics ModelsSource: Microsoft Research Date: 2026-02-05 By Pallavi Choudhury, Lukas Schäfer, Chris Lovett, Katja Hofmann, Sergio Valcarcel Macua
- Paza: Introducing automatic speech recognition benchmarks and models for low resource languagesSource: Microsoft Research Date: 2026-02-05 By Mercy Muchai, Kevin Chege, Nick Mumero, Stephanie Nyairo
- UniRG: Scaling medical imaging report generation with multimodal reinforcement learningSource: Microsoft Research Date: 2026-01-27 By Sheng Zhang, Flora Liu, Guanghui Qin, Mu Wei, Hoifung Poon
- Multimodal reinforcement learning with agentic verifier for AI agentsSource: Microsoft Research Date: 2026-01-20 By Reuben Tan, Baolin Peng, Zhengyuan Yang, Oier Mees, Jianfeng Gao
- OptiMind: A small language model with optimization expertiseSource: Microsoft Research Date: 2026-01-15 By Xinzhi Zhang, Zeyi Chen, Humishka Hope, Hugo Barbalho, Konstantina Mellou, Marco Molinaro, Janardhan (Jana) Kulkarni, Ishai Menache, Sirui Li
- Agent Lightning: Adding reinforcement learning to AI agents without code rewritesSource: Microsoft Research Date: 2025-12-11 By Xufang Luo, Yuge Zhang, Zhiyuan He, Zilong Wang, Dongsheng Li, Luna K. Qiu, Yuqing Yang
- Promptions helps make AI prompting more precise with dynamic UI controlsSource: Microsoft Research Date: 2025-12-10 By Sean Rintel, Advait Sarkar, Jack Williams, Nicholas Wilson, Richard Banks, Neeltje Berger, Philipp Steinacher, Payod Panda, Ian Drosos
- GigaTIME: Scaling tumor microenvironment modeling using virtual population generated by multimodal AISource: Microsoft Research Date: 2025-12-09 By Hoifung Poon, Jeya Maria Jose Valanarasu, Naoto Usuyama, Sheng Wang
- Reducing Privacy leaks in AI: Two approaches to contextual integrity Source: Microsoft Research Date: 2025-11-25 By Gbola Afonja, Huseyin Atahan Inan, Qingwei Lin 林庆维, Saravan Rajmohan, Robert Sim, Xiaoting Qin, Jue Zhang, Lukas Wutschitz
- Fara-7B: An Efficient Agentic Model for Computer UseSource: Microsoft Research Date: 2025-11-24 By Ahmed Awadallah, Akshay Nambi, Alexey Taymanov, Aravind Rajeswaran, Corby Rosset, Hussein Mozannar, Spencer Whitehead, Vibhav Vineet, Yash Lara, Yash Pandya, Andrew Zhao
Google AI News
- Generative AI to quantify uncertainty in weather forecasting
- AutoBNN: Probabilistic time series forecasting with compositional bayesian neural networks
- Computer-aided diagnosis for lung cancer screening
- Using AI to expand global access to reliable flood forecasts
- ScreenAI: A visual language model for UI and visually-situated language understanding
- SCIN: A new resource for representative dermatology images
- MELON: Reconstructing 3D objects from images with unknown poses
- HEAL: A framework for health equity assessment of machine learning performance
- Cappy: Outperforming and boosting large multi-task language models with a small scorer
- Talk like a graph: Encoding graphs for large language models
- Chain-of-table: Evolving tables in the reasoning chain for table understanding
- Health-specific embedding tools for dermatology and pathology
- Social learning: Collaborative learning with large language models
- Croissant: a metadata format for ML-ready datasets
- Google at APS 2024
