Paper Digest: Recent Papers on Semantic Segmentation
Paper Digest Team extracted all recent Semantic Segmentation related papers on our radar, and generated highlight sentences for them. The results are then sorted by relevance & date. In addition to this ‘static’ page, we also provide a real-time version of this article, which has more coverage and is updated in real time to include the most recent updates on this topic.
This curated list is created by the Paper Digest Team. Experience the cutting-edge capabilities of Paper Digest, an innovative AI-powered research platform that gets you the personalized and comprehensive daily paper digests on the latest research in your field. It also empowers you to read articles, write articles, get answers, conduct literature reviews and generate research reports.
Experience the full potential of our services today!
TABLE 1: Paper Digest: Recent Papers on Semantic Segmentation
| Paper | Author(s) | Source | Date | |
|---|---|---|---|---|
| 1 | GaussianTrimmer: Online Trimming Boundaries for 3DGS Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To this end, we propose an online boundary trimming method, GaussianTrimmer, which is an efficient and plug-and-play post-processing method capable of trimming coarse boundaries for existing 3D Gaussian segmentation methods. |
Liwei Liao; Ronggang Wang; | arxiv-cs.CV | 2026-01-18 |
| 2 | Toward Real-World High-Precision Image Matting and Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To this end, we propose a Foreground Consistent Learning model, dubbed as FCLM, to address the aforementioned issues. |
HAIPENG ZHOU et. al. | arxiv-cs.CV | 2026-01-17 |
| 3 | Medical SAM3: A Foundation Model for Universal Prompt-Driven Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Here we present Medical SAM3, a foundation model for universal prompt-driven medical image segmentation, obtained by fully fine-tuning SAM3 on large-scale, heterogeneous 2D and 3D medical imaging datasets with paired segmentation masks and text prompts. |
CHONGCONG JIANG et. al. | arxiv-cs.CV | 2026-01-15 |
| 4 | Jordan-Segmentable Masks: A Topology-Aware Definition for Characterizing Binary Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we introduce a topology-aware notion of segmentation based on the Jordan Curve Theorem, and adapted for use in digital planes. |
Serena Grazia De Benedictis; Amedeo Altavilla; Nicoletta Del Buono; | arxiv-cs.CV | 2026-01-15 |
| 5 | SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We propose a simple but effective framework, termed SAM2-UNet, for versatile image segmentation. |
XINYU XIONG et. al. | Visual Intelligence | 2026-01-13 |
| 6 | How Do Optical Flow and Textual Prompts Collaborate to Assist in Audio-Visual Semantic Segmentation? Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce a novel collaborative framework, \textit{S}tepping \textit{S}tone \textit{P}lus (SSP), which integrates optical flow and textual prompts to assist the segmentation process. |
Peng Gao; Yujian Lee; Yongqi Xu; Wentao Fan; | arxiv-cs.CV | 2026-01-12 |
| 7 | HG-RSOVSSeg: Hierarchical Guidance Open-Vocabulary Semantic Segmentation Framework of High-Resolution Remote Sensing Images Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Specifically, we propose a multimodal feature aggregation module for pixel-level alignment and a hierarchical visual feature decoder guided by text feature alignment, which progressively refines visual features using language priors, preserving semantic coherence during high-resolution decoding. |
Wubiao Huang; Fei Deng; Huchen Li; Jing Yang; | Remote Sensing | 2026-01-09 |
| 8 | G2P: Gaussian-to-Point Attribute Alignment for Boundary-Aware 3D Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose Gaussian-to-Point (G2P), which transfers appearance-aware attributes from 3D Gaussian Splatting to point clouds for more discriminative and appearance-consistent segmentation. |
HOJUN SONG et. al. | arxiv-cs.CV | 2026-01-06 |
| 9 | EarthVL: A Progressive Earth Vision-Language Understanding and Generation Framework Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Three benchmarks, including semantic segmentation, multiple-choice, and open-ended VQA demonstrated the superiorities of EarthVLNet, yielding three future directions: 1) segmentation features consistently enhance VQA performance even in cross-dataset scenarios; 2) multiple-choice tasks show greater sensitivity to the vision encoder than to the language decoder; and 3) open-ended tasks necessitate advanced vision encoders and language decoders for an optimal performance. We believe this dataset and method will provide a beneficial benchmark that connects ”image-mask-text”, advancing geographical applications for Earth vision. |
JUNJUE WANG et. al. | arxiv-cs.CV | 2026-01-06 |
| 10 | Leveraging 2D-VLM for Label-Free 3D Segmentation in Large-Scale Outdoor Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper presents a novel 3D semantic segmentation method for large-scale point cloud data that does not require annotated 3D training data or paired RGB images. |
Toshihiko Nishimura; Hirofumi Abe; Kazuhiko Murasaki; Taiga Yoshida; Ryuichi Tanida; | arxiv-cs.CV | 2026-01-05 |
| 11 | Joint Semantic and Rendering Enhancements in 3D Gaussian Modeling with Anisotropic Local Encoding Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we propose a joint enhancement framework for 3D semantic Gaussian modeling that synergizes both semantic and rendering branches. |
Jingming He; Chongyi Li; Shiqi Wang; Sam Kwong; | arxiv-cs.CV | 2026-01-05 |
| 12 | A Cascaded Information Interaction Network for Precise Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, robust segmentation remains a challenge in complex scenarios. To address this, this paper proposes a cascaded convolutional neural network integrated with a novel Global Information Guidance Module. |
Hewen Xiao; Jie Mei; Guangfu Ma; Weiren Wu; | arxiv-cs.CV | 2026-01-01 |
| 13 | UniC-Lift: Unified 3D Instance Segmentation Via Contrastive Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Existing methods use a two-stage approach in which some rely on contrastive learning with hyperparameter-sensitive clustering, while others preprocess labels for consistency. We propose a unified framework that merges these steps, reducing training time and improving performance by introducing a learnable feature embedding for segmentation in Gaussian primitives. |
Ankit Dhiman; Srinath R; Jaswanth Reddy; Lokesh R Boregowda; Venkatesh Babu Radhakrishnan; | arxiv-cs.CV | 2025-12-31 |
| 14 | 3D Semantic Segmentation for Post-Disaster Assessment Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: While 3D semantic segmentation is crucial for post-disaster assessment, existing deep learning models lack datasets specifically designed for post-disaster environments. To address this gap, we constructed a specialized 3D dataset using unmanned aerial vehicles (UAVs)-captured aerial footage of Hurricane Ian (2022) over affected areas, employing Structure-from-Motion (SfM) and Multi-View Stereo (MVS) techniques to reconstruct 3D point clouds. |
Nhut Le; Maryam Rahnemoonfar; | arxiv-cs.CV | 2025-12-30 |
| 15 | BATISNet: Instance Segmentation of Tooth Point Clouds with Boundary Awareness Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, due to the tightly packed structure of teeth, unclear boundaries, and the diversity of complex cases such as missing teeth, malposed teeth, semantic segmentation often struggles to achieve satisfactory results when dealing with complex dental cases. To address these issues, this paper propose BATISNet, a boundary-aware instance network for tooth point cloud segmentation. |
Yating Cai; Yanghui Xu; Zehua Hu; Jiazhou Chen; Jing Huang; | arxiv-cs.GR | 2025-12-30 |
| 16 | PASENet: Snowy Scene 3D Object Detection With Pillar‐Wise Attention and Semantic Enhancement Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: ABSTRACT LiDAR based 3D object detection plays an essential role in autonomous driving, yet snowy conditions degrade point cloud quality by introducing false returns and causing occlusion of objects, degrading the detection accuracy of existing 3D object detection algorithms. To overcome these challenges, we propose pillar‐wise attention and semantic enhancement network (PASENet), an end‐to‐end network specifically designed for snowy scene. |
Yutian Wu; Wenwei Sun; Zuodong Zhong; Qing Li; | IET Image Processing | 2025-12-30 |
| 17 | SwinTF3D: A Lightweight Multimodal Fusion Approach for Text-Guided 3D Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The lack of semantic understanding in these models makes them ineffective in addressing flexible, user-defined segmentation objectives. To overcome these limitations, we propose SwinTF3D, a lightweight multimodal fusion approach that unifies visual and linguistic representations for text-guided 3D medical image segmentation. |
Hasan Faraz Khan; Noor Fatima; Muzammil Behzad; | arxiv-cs.CV | 2025-12-28 |
| 18 | Split4D: Decomposed 4D Scene Reconstruction Without Video Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Therefore, they heavily rely on the quality of video segmentation maps, which are often unstable, leading to unreliable reconstruction results. To overcome this challenge, our key idea is to represent the decomposed 4D scene with the Freetime FeatureGS and design a streaming feature learning strategy to accurately recover it from per-image segmentation maps, eliminating the need for video segmentation. |
YONGZHEN HU et. al. | arxiv-cs.CV | 2025-12-27 |
| 19 | Scene-VLM: Multimodal Video Scene Segmentation Via Vision-Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we present Scene-VLM, the first fine-tuned vision-language model (VLM) framework for video scene segmentation. |
NIMROD BERMAN et. al. | arxiv-cs.CV | 2025-12-25 |
| 20 | A Dual-Path Fusion Network with Edge Feature Enhancement for Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper proposes a Dual-path Feature-enhanced Fusion Network (DPF-Net) for medical image segmentation to address limitations in existing methods, including insufficient edge feature extraction, semantic gaps among multi-scale encoder features, and significant semantic disparities between the encoder and decoder in U-Net architectures. |
Liangxu Shi; Weiyuan He; Guodong Wang; | Mathematics | 2025-12-24 |
| 21 | Surgical Scene Segmentation Using A Spike-Driven Video Transformer with Real-Time Potential Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To this end, we propose \textit{SpikeSurgSeg}, the first spike-driven video Transformer framework tailored for surgical scene segmentation with real-time potential on non-GPU platforms. |
SHIHAO ZOU et. al. | arxiv-cs.CV | 2025-12-24 |
| 22 | PGMNet: A Polyp Segmentation Network Based on Bit-Plane Slicing and Multi-Scale Adaptive Fusion Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Accurate detection and segmentation of polyps during colonoscopy are of great significance for the early prevention and treatment of colorectal cancer. However, due to the … |
Dong Wang; ShanLin Liu; Shuai Li; HaiSha Liu; YuLingHeng Wang; | Biomedical Physics & Engineering Express | 2025-12-22 |
| 23 | VOIC: Visible-Occluded Decoupling for Monocular 3D Semantic Scene Completion Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This strategy purifies the supervisory space for two complementary sub-tasks: visible-region perception and occluded-region reasoning. Building on this idea, we propose the Visible-Occluded Interactive Completion Network (VOIC), a novel dual-decoder framework that explicitly decouples SSC into visible-region semantic perception and occluded-region scene completion. |
Zaidao Han; Risa Higashita; Jiang Liu; | arxiv-cs.CV | 2025-12-21 |
| 24 | IndiVNet A Region Adaptive Semantic Image Segmentation for Autonomous Driving in Unstructured Environments Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Pritam Chakraborty; Anjan Bandyopadhyay; Siddhartha Bhattacharyya; Jan Platos; | Scientific Reports | 2025-12-20 |
| 25 | Task-Oriented Data Synthesis and Control-Rectify Sampling for Remote Sensing Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, the complexity of semantic mask control and the uncertainty of sampling quality often limit the utility of synthetic data in downstream semantic segmentation tasks. To address these challenges, we propose a task-oriented data synthesis framework (TODSynth), including a Multimodal Diffusion Transformer (MM-DiT) with unified triple attention and a plug-and-play sampling strategy guided by task feedback. |
YUNKAI YANG et. al. | arxiv-cs.CV | 2025-12-18 |
| 26 | SynthSeg-Agents: Multi-Agent Synthetic Data Generation for Zero-Shot Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we introduce a novel direction, Zero Shot Weakly Supervised Semantic Segmentation (ZSWSSS), and propose SynthSeg Agents, a multi agent framework driven by Large Language Models (LLMs) to generate synthetic training data entirely without real images. |
Wangyu Wu; Zhenhong Chen; Xiaowei Huang; Fei Ma; Jimin Xiao; | arxiv-cs.CV | 2025-12-17 |
| 27 | Unified Semantic Transformer for 3D Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce UNITE, a Unified Semantic Transformer for 3D scene understanding, a novel feed-forward neural network that unifies a diverse set of 3D semantic tasks within a single model. |
SEBASTIAN KOCH et. al. | arxiv-cs.CV | 2025-12-16 |
| 28 | Dual-Branch Superpixel and Class-Center Attention Network for Efficient Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Specifically, we introduce a superpixel sampling weighting module that models pixel dependencies based on different regional affiliations, thereby enhancing the network’s sensitivity to object boundaries while preserving local features. |
YUNTING ZHANG et. al. | Sensors | 2025-12-16 |
| 29 | JoDiffusion: Jointly Diffusing Image with Pixel-Level Annotations for Semantic Segmentation Promotion Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To migrate both problems with one stone, we present a novel dataset generative diffusion framework for semantic segmentation, termed JoDiffusion. |
HAOYU WANG et. al. | arxiv-cs.CV | 2025-12-15 |
| 30 | Pancakes: Consistent Multi-Protocol Image Segmentation Across Biomedical Domains Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce Pancakes, a framework that, given a new image from a previously unseen domain, automatically generates multi-label segmentation maps for multiple plausible protocols, while maintaining semantic consistency across related images. |
Marianne Rakic; Siyu Gai; Etienne Chollet; John V. Guttag; Adrian V. Dalca; | arxiv-cs.CV | 2025-12-15 |
| 31 | Semantic Segmentation of Remote Sensing Images Via Visible-Infrared Domain Adaptation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, the research of unsupervised domain adaptation method for semantic segmentation of remote sensing images is carried out based on deep learning. |
Yuan Chang; Bin Hui; Qifu Zhang; | International Journal of Pattern Recognition and Artificial … | 2025-12-10 |
| 32 | Modulatory Feedback Determines Attentional Object Segmentation in A Model of The Ventral Stream Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The present study presents a biologically plausible neural network that performs scene segmentation and can shift attention using modulatory feedback connections from higher to lower cortical brain areas. |
Paolo Papale; Jonathan R. Williford; Stijn Balk; Pieter R. Roelfsema; | PLOS One | 2025-12-10 |
| 33 | Human Detection in UAV Thermal Imagery: Dataset Extension and Comparative Evaluation on Embedded Platforms Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Existing datasets are mostly limited to urban or open-field scenarios, and our experiments show that models trained on such heterogeneous data achieve poor results. To address this gap, we collected and annotated thermal images in mountainous environments using a DJI M3T drone under clear daytime conditions. |
Andrei-Alexandru Ulmămei; Taddeo D’Adamo; Costin-Emanuel Vasile; Radu Hobincu; | Journal of Imaging | 2025-12-09 |
| 34 | SegEarth-OV3: Exploring SAM 3 for Open-Vocabulary Semantic Segmentation in Remote Sensing Images Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we present a preliminary exploration of applying SAM 3 to the remote sensing OVSS task without any training. |
KAIYU LI et. al. | arxiv-cs.CV | 2025-12-09 |
| 35 | Generalized Referring Expression Segmentation on Aerial Photos Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This work presents Aerial-D, a new large-scale referring expression segmentation dataset for aerial imagery, comprising 37,288 images with 1,522,523 referring expressions that cover 259,709 annotated targets, spanning across individual object instances, groups of instances, and semantic regions covering 21 distinct classes that range from vehicles and infrastructure to land coverage types. |
Luís Marnoto; Alexandre Bernardino; Bruno Martins; | arxiv-cs.CV | 2025-12-08 |
| 36 | Dynamic Mutual Adversarial Learning for Semi-Supervised Semantic Segmentation of Underwater Images with Limited and Noisy Annotations Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we delineate the formulation of a novel semi-supervised paradigm with dynamic mutual adversarial training for the semantic segmentation of underwater images. |
HAN CHEN et. al. | Journal of Marine Science and Engineering | 2025-12-08 |
| 37 | Power of Boundary and Reflection: Semantic Transparent Object Segmentation Using Pyramid Vision Transformer with Transparent Cues Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: While it is known that human perception relies on boundary and reflective-object features to distinguish glass objects, the existing literature has not yet sufficiently captured both properties when handling transparent objects. Hence, we propose incorporating both of these powerful visual cues via the Boundary Feature Enhancement and Reflection Feature Enhancement modules in a mutually beneficial way. |
TUAN-ANH VU et. al. | arxiv-cs.CV | 2025-12-07 |
| 38 | Selective Masking Based Self-Supervised Learning for Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper proposes a novel self-supervised learning method for semantic segmentation using selective masking image reconstruction as the pretraining task. |
Yuemin Wang; Ian Stavness; | arxiv-cs.CV | 2025-12-07 |
| 39 | See in Depth: Training-Free Surgical Scene Segmentation with Monocular Depth Priors Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose depth-guided surgical scene segmentation (DepSeg), a training-free framework that utilizes monocular depth as a geometric prior together with pretrained vision foundation models. |
Kunyi Yang; Qingyu Wang; Cheng Yuan; Yutong Ban; | arxiv-cs.CV | 2025-12-05 |
| 40 | The SAM2-to-SAM3 Gap in The Segment Anything Model Family: Why Prompt-Based Expertise Fails in Concept-Driven Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper investigates the fundamental discontinuity between the latest two Segment Anything Models: SAM2 and SAM3. |
Ranjan Sapkota; Konstantinos I. Roumeliotis; Manoj Karkee; | arxiv-cs.CV | 2025-12-04 |
| 41 | Deep Learning Approach for Crop-weed Segmentation in Peanut Cultivation Using PSPEdgeWeedNet Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this context, PSPEdgeWeedNet is proposed, a novel edge-aware deep learning architecture tailored for precise semantic segmentation of crops and weeds within peanut cultivation fields. Distinct from the conventional Pyramid Scene Parsing Network (PSPNet) and its boundary-aware variant developed as a baseline in this research, PSPEdgeWeedNet introduces a dedicated edge detection branch. |
Deepthi G Pai; Mamatha Balachandra; Radhika Kamath; | Scientific Reports | 2025-12-03 |
| 42 | Evaluating SAM2 for Video Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The Segmentation Anything Model 2 (SAM2) has proven to be a powerful foundation model for promptable visual object segmentation in both images and videos, capable of storing object-aware memories and transferring them temporally through memory blocks. |
SYED HESHAM SYED ARIFF et. al. | arxiv-cs.CV | 2025-12-01 |
| 43 | Reducing Semantic Ambiguity in Open-vocabulary Remote Sensing Image Segmentation Via Knowledge Graph-enhanced Class Representations Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Wubiao Huang; Huchen Li; Shuai Zhang; Fei Deng; | ISPRS Journal of Photogrammetry and Remote Sensing | 2025-11-30 |
| 44 | The Role of U-Net Variants in Semantic Segmentation of Remote Sensing Images: A Survey Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This survey systematically reviews major U-Net extensions (including U-Net++, ResUNet-a, HCANet, CCT-Net, DIResUNet, CM-UNet, TransUNet, AER-UNet and U-KAN) and additional optimization techniques such as incremental learning. |
Yiyang Liu; | Applied and Computational Engineering | 2025-11-26 |
| 45 | A Fast and Efficient Modern BERT Based Text-Conditioned Diffusion Model for Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose FastTextDiff, a label-efficient diffusion-based segmentation model that integrates medical text annotations to enhance semantic representations. |
Venkata Siddharth Dhara; Pawan Kumar; | arxiv-cs.CV | 2025-11-26 |
| 46 | Image Diffusion Models Exhibit Emergent Temporal Propagation in Videos Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we investigate their self-attention maps can be reinterpreted as semantic label propagation kernels, providing robust pixel-level correspondences between relevant image regions. |
Youngseo Kim; Dohyun Kim; Geohee Han; Paul Hongsuck Seo; | arxiv-cs.CV | 2025-11-25 |
| 47 | Machine Learning Segmentation for Microscopy with Domain-Informed Targets Via Custom Loss Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Here, we develop new regularization loss terms that incorporate domain knowledge into the training of a tree-based machine learning classification model, and demonstrate that the predicted segmentation can be tuned without modifying the training labels. |
Nina Prakash; Paul Gasper; Francois L. E. Usseglio-Viretta; | ECS Meeting Abstracts | 2025-11-24 |
| 48 | SAM3-Adapter: Efficient Adaptation of Segment Anything 3 for Camouflage Object Segmentation, Shadow Detection, and Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: With the emergence of Segment Anything 3 (SAM3)-a more efficient and higher-performing evolution with a redesigned architecture and improved training pipeline-we revisit these long-standing challenges. In this work, we present SAM3-Adapter, the first adapter framework tailored for SAM3 that unlocks its full segmentation capability. |
TIANRUN CHEN et. al. | arxiv-cs.CV | 2025-11-24 |
| 49 | Vision–Language Enhanced Foundation Model for Semi-supervised Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we integrate VLM-based segmentation into semi-supervised medical image segmentation by introducing a Vision-Language Enhanced Semi-supervised Segmentation Assistant (VESSA) that incorporates foundation-level visual-semantic understanding into SSL frameworks. |
JIAQI GUO et. al. | arxiv-cs.CV | 2025-11-24 |
| 50 | DiffSeg30k: A Multi-Turn Diffusion Editing Benchmark for Localized AIGC Detection Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce DiffSeg30k, a publicly available dataset of 30k diffusion-edited images with pixel-level annotations, designed to support fine-grained detection. |
Hai Ci; Ziheng Peng; Pei Yang; Yingxin Xuan; Mike Zheng Shou; | arxiv-cs.CV | 2025-11-24 |
| 51 | MedSAM3: Delving Into Segment Anything with Medical Concepts Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Here, we propose MedSAM-3, a text promptable medical segmentation model for medical image and video segmentation. |
ANGLIN LIU et. al. | arxiv-cs.CV | 2025-11-24 |
| 52 | SegSplat: Feed-forward Gaussian Splatting and Open-Set Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We have introduced SegSplat, a novel framework designed to bridge the gap between rapid, feed-forward 3D reconstruction and rich, open-vocabulary semantic understanding. |
Peter Siegel; Federico Tombari; Marc Pollefeys; Daniel Barath; | arxiv-cs.CV | 2025-11-23 |
| 53 | Improved COOT Optimization: An Approach to Multilevel Thresholding in Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper proposes the application of an improved COOT (ICOOT) optimization algorithm for multilevel image thresholding. |
SIMRANDEEP SINGH et. al. | Scientific Reports | 2025-11-21 |
| 54 | Analysis of Pedestrian Semantic Segmentation Technology in Autonomous Driving Scenarios Under Occlusion Conditions Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper presents a comprehensive survey of both traditional and occlusion-aware semantic segmentation approaches, with a structured analysis of their evolution, strengths, and limitations. |
Yingxin He; | Scientific Journal of Technology | 2025-11-21 |
| 55 | Graph Neural Networks for Surgical Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Methods: We propose two segmentation models integrating Vision Transformer (ViT) feature encoders with Graph Neural Networks (GNNs) to explicitly model spatial relationships between anatomical regions. |
Yihan Li; Nikhil Churamani; Maria Robu; Imanol Luengo; Danail Stoyanov; | arxiv-cs.CV | 2025-11-20 |
| 56 | VideoSeg-R1:Reasoning Video Object Segmentation Via Reinforcement Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Traditional video reasoning segmentation methods rely on supervised fine-tuning, which limits generalization to out-of-distribution scenarios and lacks explicit reasoning. To address this, we propose \textbf{VideoSeg-R1}, the first framework to introduce reinforcement learning into video reasoning segmentation. |
Zishan Xu; Yifu Guo; Yuquan Lu; Fengyu Yang; Junxin Li; | arxiv-cs.CV | 2025-11-20 |
| 57 | MaskMed: Decoupled Mask and Class Prediction for Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we propose a unified decoupled segmentation head that separates multi-class prediction into class-agnostic mask prediction and class label prediction using shared object queries. |
Bin Xie; Gady Agam; | arxiv-cs.CV | 2025-11-19 |
| 58 | Re-purposing SAM Into Efficient Visual Projectors for MLLM-Based Referring Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Inspired by text tokenizers, we propose a novel semantic visual projector that leverages semantic superpixels generated by SAM to identify “visual words” in an image. |
Xiaobo Yang; Xiaojin Gong; | ACM Transactions on Multimedia Computing, Communications, … | 2025-11-19 |
| 59 | InfoCLIP: Bridging Vision-Language Pretraining and Open-Vocabulary Semantic Segmentation Via Information-Theoretic Alignment Transfer Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To stabilize modality alignment during fine-tuning, we propose InfoCLIP, which leverages an information-theoretic perspective to transfer alignment knowledge from pretrained CLIP to the segmentation task. |
MUYAO YUAN et. al. | arxiv-cs.CV | 2025-11-19 |
| 60 | DiffPixelFormer: Differential Pixel-Aware Transformer for RGB-D Indoor Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Although RGB-D fusion leverages complementary appearance and geometric cues, existing methods often depend on computationally intensive cross-attention mechanisms and insufficiently model intra- and inter-modal feature relationships, resulting in imprecise feature alignment and limited discriminative representation. To address these challenges, we propose DiffPixelFormer, a differential pixel-aware Transformer for RGB-D indoor scene segmentation that simultaneously enhances intra-modal representations and models inter-modal interactions. |
YAN GONG et. al. | arxiv-cs.CV | 2025-11-17 |
| 61 | Analysis of Various Image Segmentation Techniques on Retinal Oct Images Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To reduce the speckle noise during preprocessing, the wiener filter approach is used. |
G. Vyshnavi; Dr. G. Jhansi Reddy; T. Suchitra; M. Sharanya; | INTERNATIONAL JOURNAL OF SCIENTIFIC RESEARCH IN ENGINEERING … | 2025-11-17 |
| 62 | DBGroup: Dual-Branch Point Grouping for Weakly Supervised 3D Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, these approaches still encounter limitations, including labor-intensive annotation processes, high complexity, and reliance on expert annotators. To address these challenges, we propose \textbf{DBGroup}, a two-stage weakly supervised 3D instance segmentation framework that leverages scene-level annotations as a more efficient and scalable alternative. |
Xuexun Liu; Xiaoxu Xu; Qiudan Zhang; Lin Ma; Xu Wang; | arxiv-cs.CV | 2025-11-13 |
| 63 | TrueCity: Real and Simulated Urban Data for Cross-Domain 3D Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce TrueCity,the first urban semantic segmentation benchmark with cm-accurate annotatedreal-world point clouds, semantic 3D city models, and annotated simulated pointclouds representing the same city. |
DUC NGUYEN et. al. | arxiv-cs.CV | 2025-11-10 |
| 64 | Enhancing Semantic Segmentation with A Boundary-sensitive Loss Function: A Novel Approach Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper proposes a novel boundary-sensitive loss function, which combines region loss and boundary loss, to enhance both region consistency and edge delineation in segmentation tasks. |
Ganesh R. Padalkar; Madhuri B. Khambete; | International Journal of Electrical and Computer … | 2025-11-08 |
| 65 | Seg4Diff: Unveiling Open-Vocabulary Semantic Segmentation in Text-to-Image Diffusion Transformers Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we introduce Seg4Diff (Segmentation for Diffusion), a systematic framework for analyzing the attention structures of MM-DiT, with a focus on how specific layers propagate semantic information from text to image. |
CHAEHYUN KIM et. al. | nips | 2025-11-07 |
| 66 | GMM-based VAE Model with Normalising Flow for Effective Stochastic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This work propose a novel framework by integrating Gaussian Mixture Model (GMM) with Normalizing Flow (NF) in CVAE for stochastic segmentation. |
Conghui Li; Chern Hong Lim; Xin Wang; | nips | 2025-11-07 |
| 67 | $\epsilon$-Seg: Sparsely Supervised Semantic Segmentation of Microscopy Data Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce $\epsilon$-Seg, a method based on hierarchical variational autoencoders (HVAEs), employing center-region masking, sparse label contrastive learning (CL), a Gaussian mixture model (GMM) prior, and clustering-free label prediction. |
Sheida RahnamaiKordasiabi; Damian Dalle Nogare; Florian Jug; | nips | 2025-11-07 |
| 68 | UniMRSeg: Unified Modality-Relax Segmentation Via Hierarchical Self-Supervised Compensation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we propose a unified modality-relax segmentation network (UniMRSeg) through hierarchical self-supervised compensation (HSSC). |
XIAOQI ZHAO et. al. | nips | 2025-11-07 |
| 69 | ARGenSeg: Image Segmentation with Autoregressive Image Generation Model Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: These methods rely on discrete representations or semantic prompts fed into task-specific decoders, which limits the ability of the MLLM to capture fine-grained visual details. To address these challenges, we introduce a segmentation framework for MLLM based on image generation, which naturally produces dense masks for target objects. |
XIAOLONG WANG et. al. | nips | 2025-11-07 |
| 70 | UFO: A Unified Approach to Fine-grained Visual Perception Via Open-ended Language Interface IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This is primarily because these tasks often rely heavily on task-specific designs and architectures that can complicate the modeling process. To address this challenge, we present UFO, a framework that unifies fine-grained visual perception tasks through an open-ended language interface. |
HAO TANG et. al. | nips | 2025-11-07 |
| 71 | FOCUS: Unified Vision-Language Modeling for Interactive Editing Driven By Referential Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To enable accurate and controllable image editing, we propose a progressive multi-stage training pipeline, where segmentation masks are jointly optimized and used as spatial condition prompts to guide the diffusion decoder. |
FAN YANG et. al. | nips | 2025-11-07 |
| 72 | Pancakes: Consistent Multi-Protocol Image Segmentation Across Biomedical Domains Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce _Pancakes_, a framework that, given a new image from a previously unseen domain, automatically generates multi-label segmentation maps for _multiple_ plausible protocols, while maintaining semantic consistency across related images. |
Marianne Rakic; Siyu Gai; Etienne Chollet; John Guttag; Adrian V Dalca; | nips | 2025-11-07 |
| 73 | No Object Is An Island: Enhancing 3D Semantic Segmentation Generalization with Diffusion Models Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a novel cross-modal learning framework based on diffusion models to enhance the generalization of 3D semantic segmentation, named XDiff3D. |
Fan Li; Xuan Wang; Xuanbin Wang; Zhaoxiang Zhang; Yuelei Xu; | nips | 2025-11-07 |
| 74 | Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We introduce Vireo, a novel single-stage framework for OV-DGSS that unifies the strengths of OVSS and DGSS for the first time. |
SIYU CHEN et. al. | nips | 2025-11-07 |
| 75 | LangHOPS: Language Grounded Hierarchical Open-Vocabulary Part Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose LangHOPS, the first Multimodal Large Language Model (MLLM)-based framework for open-vocabulary object–part instance segmentation. |
YANG MIAO et. al. | nips | 2025-11-07 |
| 76 | Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In response, we propose a novel TTA method tailored to adapting VLMs for segmentation during test time. |
MEHRDAD NOORI et. al. | nips | 2025-11-07 |
| 77 | Autonomous Semantic Mapping for SLAM Systems IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We proposed an autonomous semantic mapping approach that integrates multimodal semantic segmentation and SLAM techniques to construct a dense 3D semantic map in real time. |
YONG HE et. al. | ISPRS Annals of the Photogrammetry, Remote Sensing and … | 2025-11-03 |
| 78 | MicroAUNet: Boundary-Enhanced Multi-scale Fusion with Knowledge Distillation for Colonoscopy Polyp Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, current deep learning-based polyp segmentation modelseither compromise clinical decision-making by providing ambiguous polyp marginsin segmentation outputs or rely on heavy architectures with high computationalcomplexity, resulting in insufficient inference speeds for real-time colorectalendoscopic applications. To address this problem, we propose MicroAUNet, alight-weighted attention-based segmentation network that combinesdepthwise-separable dilated convolutions with a single-path, parameter-sharedchannel-spatial attention block to strengthen multi-scale boundary features. |
Ziyi Wang; Yuanmei Zhang; Dorna Esrafilzadeh; Ali R. Jalili; Suncheng Xiang; | arxiv-cs.CV | 2025-11-02 |
| 79 | RM2Occ: Re-Projection Multi-Task Multi-Sensor Fusion for Autonomous Driving 3D Object Detection and Occupancy Perception Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Occupancy prediction plays a crucial role in supporting autonomous driving planning and decision-making. Existing methods typically rely on modular stacking and fusion techniques … |
YILONG REN et. al. | IEEE Transactions on Intelligent Transportation Systems | 2025-11-01 |
| 80 | Bridging The Semantic Gap in Medical Image Segmentation Via Multi-scale Dependency and Attention-guided Enhancement Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
MINGRONG LI et. al. | Scientific Reports | 2025-10-30 |
| 81 | Key Technologies for Real-time Localization and Scene Semantic Segmentation of Mobile Robots in Dynamic Environments Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Specifically, the team investigate vision- and lidar-based SLAM methods, lightweight deep neural networks for semantic segmentation, and a multi-sensor fusion framework in various dynamic scenarios. |
Long-Xue Cheng; Jun-Xia Han; | Journal of Computers | 2025-10-28 |
| 82 | A Wheat Spike Image Segmentation Method Based on Improved U-Net Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To address issues such as small color differences in the background of wheat spike images in the field and low segmentation accuracy, this article proposes a wheat spike segmentation method called SAU-Net (Striped Pooling and Attention Mechanism optimized U-Net). |
XIAOLEI WANG et. al. | PeerJ Computer Science | 2025-10-27 |
| 83 | ACS-SegNet: An Attention-Based CNN-SegFormer Segmentation Network for Tissue Segmentation in Histopathology Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In thisstudy, we propose a novel approach based on attention-driven feature fusion ofconvolutional neural networks (CNNs) and vision transformers (ViTs) within aunified dual-encoder model to improve semantic segmentation performance.Evaluation on two publicly available datasets showed that our model achieved{\mu}IoU/{\mu}Dice scores of 76.79%/86.87% on the GCPS dataset and64.93%/76.60% on the PUMA dataset, outperforming state-of-the-art and baselinebenchmarks. |
Nima Torbati; Anastasia Meshcheryakova; Ramona Woitek; Diana Mechtcheriakova; Amirreza Mahbod; | arxiv-cs.CV | 2025-10-23 |
| 84 | IMPROVING UNDERWATER SEMANTIC SEGMENTATION VIA ADAPTIVE FREQUENCY-AWARE ENHANCEMENT AND THE SEGMENT ANYTHING MODEL Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This study proposes a hybrid deep learning framework that combines image enhancement with robust segmentation. |
Abhisheka Thumbesara Eshwara; | International Journal of Applied Mathematics | 2025-10-22 |
| 85 | Communication-Efficient Multi-Vehicle Collaborative Semantic Segmentation Via Sparse 3D Gaussian Sharing Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, existing methods share the entire dense, scene-level BEV feature, which contains significant redundancy and lacks height information, ultimately leading to unavoidable bandwidth waste and performance degradation. To address these challenges, we present GSCOOP, the first collaborative semantic segmentation framework that leverages sparse, object-centric 3D Gaussians to fundamentally overcome communication bottlenecks. |
TIANYU HONG et. al. | iccv | 2025-10-20 |
| 86 | EVOLVE: Event-Guided Deformable Feature Transfer and Dual-Memory Refinement for Low-Light Video Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Video Object Segmentation (VOS) in low-light scenarios remains highly challenging due to significant texture loss and severe noise, which often lead to unreliable image feature generation and degraded segmentation performance. To address this issue, we propose EVOLVE, a novel event-guided deformable feature transfer and dual-memory refinement framework for low-light VOS. |
Jong-Hyeon Baek; Jiwon Oh; Yeong Jun Koh; | iccv | 2025-10-20 |
| 87 | AM-Adapter: Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis In-the-Wild Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: During inference, we introduce an automated exemplar retrieval method for selecting exemplar image-segmentation pairs efficiently. |
SIYOON JIN et. al. | iccv | 2025-10-20 |
| 88 | SPA: Efficient User-Preference Alignment Against Uncertainty in Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: While prior uncertainty-aware and interactive methods offer adaptability, they are inefficient at test time: uncertainty-aware models require users to choose from numerous similar outputs, while interactive models demand significant user input through click or box prompts to refine segmentation. To address these challenges, we propose SPA, a new Segmentation Preference Alignment framework that efficiently adapts to diverse test-time preferences with minimal human interaction. |
Jiayuan Zhu; Junde Wu; Cheng Ouyang; Konstantinos Kamnitsas; J. Alison Noble; | iccv | 2025-10-20 |
| 89 | Revisiting Efficient Semantic Segmentation: Learning Offsets for Better Spatial and Class Feature Alignment Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: With experimental analysis, we find that this paradigm results in a highly challenging assumption for efficient scenarios: Image pixel features should not vary for the same category in different images. To address this dilemma, we propose a coupled dual-branch offset learning paradigm that explicitly learns feature and class offsets to dynamically refine both class representations and spatial image features. |
Shi-Chen Zhang; Yunheng Li; Yu-Huan Wu; Qibin Hou; Ming-Ming Cheng; | iccv | 2025-10-20 |
| 90 | Correspondence As Video: Test-Time Adaption on SAM2 for Reference Segmentation in The Wild Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this study, we propose a novel approach by representing the inherent correspondence between reference-target image pairs as a pseudo video. |
Haoran Wang; Zekun Li; Jian Zhang; Lei Qi; Yinghuan Shi; | iccv | 2025-10-20 |
| 91 | Identity-aware Language Gaussian Splatting for Open-vocabulary 3D Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This inconsistency highly results in mis-labeling where different language embeddings are assigned to the same part of an object. To address this issue, we propose a simple yet powerful method that aligns language embeddings via the identity information. |
SungMin Jang; Wonjun Kim; | iccv | 2025-10-20 |
| 92 | Feature Purification Matters: Suppressing Outlier Propagation for Training-Free Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a Self-adaptive Feature Purifier framework (SFP) to suppress propagated outliers and enhance semantic representations for open-vocabulary semantic segmentation. |
SHUO JIN et. al. | iccv | 2025-10-20 |
| 93 | MaskSAM: Auto-prompt SAM with Mask Classification for Volumetric Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, SAM is not directly applicable to medical image segmentation due to its inability to predict semantic labels, reliance on additional prompts, and suboptimal performance in this domain. To address these limitations, we propose MaskSAM, a novel prompt-free SAM adaptation framework for medical image segmentation based on mask classification. |
BIN XIE et. al. | iccv | 2025-10-20 |
| 94 | Unsupervised Histopathological Image Semantic Segmentation with Overlapping Patches Consistency Constraint Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a context-based Overlapping Patches Consistency Constraint (OPCC), which employs the consistency constraint between the local overlapping region’s similarity and global context similarity, achieving consistent class representation in similar environments. |
WENTIAN CAI et. al. | iccv | 2025-10-20 |
| 95 | CorrCLIP: Reconstructing Patch Correlations in CLIP for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Accordingly, we propose CorrCLIP, which reconstructs the scope and value of patch correlations. |
Dengke Zhang; Fagui Liu; Quan Tang; | iccv | 2025-10-20 |
| 96 | Seeing The Unseen: A Semantic Alignment and Context-Aware Prompt Framework for Open-Vocabulary Camouflaged Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Despite existing open-vocabulary methods exhibit strong segmentation capabilities, they still have a major limitation in camouflaged scenarios: semantic confusion, which leads to incomplete segmentation and class shift in the model. To mitigate the above limitation, we propose a framework for OVCOS, named SuCLIP. |
Peng Ren; Tian Bai; Jing Sun; Fuming Sun; | iccv | 2025-10-20 |
| 97 | Intelligent Communication Mixture-of-Experts Boosted-Medical Image Segmentation Foundation Model Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: 2) We propose a semantic-guided contrastive learning method to addressthe issue of weak supervision in contrastive learning. |
Xinwei Zhang; Hu Chen; Zhe Yuan; Sukun Tian; Peng Feng; | arxiv-cs.CV | 2025-10-20 |
| 98 | DiSCO-3D : Discovering and Segmenting Sub-Concepts from Open-vocabulary Queries in NeRF Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose DiSCO-3D, the first method addressing the broader problem of 3D Open-Vocabulary Sub-concepts Discovery, which aims to provide a 3D semantic segmentation that adapts to both the scene and user queries. |
Doriand Petit; Steve Bourgeois; Vincent Gay-Bellile; Florian Chabot; Loïc Barthe; | iccv | 2025-10-20 |
| 99 | Auto-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we introduce Auto-Vocabulary Semantic Segmentation (AVS), advancing open-ended image understanding by eliminating the necessity to predefine object categories for segmentation. |
Osman Ülger; Maksymilian Kulicki; Yuki Asano; Martin R. Oswald; | iccv | 2025-10-20 |
| 100 | LIRA: Inferring Segmentation in Large Multi-modal Models with Local Interleaved Region Assistance Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: These challenges stem primarily from constraints in weak visual comprehension and a lack of fine-grained perception. To alleviate these limitations, we propose LIRA, a framework that capitalizes on the complementary relationship between visual comprehension and segmentation via two key components: (1) Semantic-Enhanced Feature Extractor (SEFE) improves object attribute inference by fusing semantic and pixel-level features, leading to more accurate segmentation; (2) Interleaved Local Visual Coupling (ILVC) autoregressively generates local descriptions after extracting local features based on segmentation masks, offering fine-grained supervision to mitigate hallucinations. |
ZHANG LI et. al. | iccv | 2025-10-20 |
| 101 | HyPiDecoder: Hybrid Pixel Decoder for Efficient Segmentation and Detection Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, the inefficiency of MSDeformAttn has become a performance bottleneck for segmenters. To address this, we propose the Hyper Pixel Decoder (HyPiDecoder), an improved Pixel Decoder design that replaces parts of the MSDeformAttn layers with convolution-based FPN layers, introducing explicit locality information and significantly boosting inference speed. |
Fengzhe Zhou; Humphrey Shi; | iccv | 2025-10-20 |
| 102 | How Do Optical Flow and Textual Prompts Collaborate to Assist in Audio-Visual Semantic Segmentation? Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce a novel collaborative framework, Stepping Stone Plus (SSP), which integrates optical flow and textual prompts to assist the segmentation process. |
Yujian Lee; Peng Gao; Yongqi Xu; Wentao Fan; | iccv | 2025-10-20 |
| 103 | Alleviating Textual Reliance in Medical Language-guided Segmentation Via Prototype-driven Semantic Approximation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, its inherent reliance on paired image-text input, which we refer to as "textual reliance", presents two fundamental limitations: 1) many medical segmentation datasets lack paired reports, leaving a substantial portion of image-only data underutilized for training; and 2) inference is limited to retrospective analysis of cases with paired reports, limiting its applicability in most clinical scenarios where segmentation typically precedes reporting. To address these limitations, we propose ProLearn, the first Prototype-driven Learning framework for language-guided segmentation that fundamentally alleviates textual reliance. |
Shuchang Ye; Usman Naseem; Mingyuan Meng; Jinman Kim; | iccv | 2025-10-20 |
| 104 | CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we present a novel hierarchical framework, named CLIPer, that hierarchically improves spatial representation of CLIP. |
Lin Sun; Jiale Cao; Jin Xie; Xiaoheng Jiang; Yanwei Pang; | iccv | 2025-10-20 |
| 105 | Images As Noisy Labels: Unleashing The Potential of The Diffusion Model for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a DEnoising learning framework based on the Diffusion model for Open-vocabulary semantic Segmentation, called DEDOS, which is aimed at constructing the scene skeleton. |
Fan Li; Xuanbin Wang; Xuan Wang; Zhaoxiang Zhang; Yuelei Xu; | iccv | 2025-10-20 |
| 106 | Kestrel: 3D Multimodal LLM for Part-Aware Grounded Description Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we introduce Part-Aware Point Grounded Description (PaPGD), a challenging task aimed at advancing 3D multimodal learning for fine-grained, part-aware segmentation grounding and detailed explanation of 3D objects. |
Mahmoud Ahmed; Junjie Fei; Jian Ding; Eslam Mohamed Bakr; Mohamed Elhoseiny; | iccv | 2025-10-20 |
| 107 | Efficient Track Anything Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: The high computation complexity of image encoder and memory module has limited its applications in real-world tasks, e.g., video object segmentation on mobile devices. To address this limitation, we propose EfficientTAMs, lightweight end-to-end track anything models that produce high-quality results with low latency and small model size. |
YUNYANG XIONG et. al. | iccv | 2025-10-20 |
| 108 | Exploring Weather-aware Aggregation and Adaptation for Semantic Segmentation Under Adverse Conditions Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a novel weather-aware aggregation and adaptation network that leverages characteristic knowledge to achieve weather homogenization and enhance scene perception. |
Yuwen Pan; Rui Sun; Wangkai Li; Tianzhu Zhang; | iccv | 2025-10-20 |
| 109 | HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The remarkable performance of large multimodal models (LMMs) has attracted significant interest from the image segmentation community.To align with the next-token-prediction paradigm, current LMM-driven segmentation methods either use object boundary points to represent masks or introduce special segmentation tokens, whose hidden states are decoded by a segmentation model requiring the original image as input.However, these approaches often suffer from inadequate mask representation and complex architectures, limiting the potential of LMMs.In this work, we propose the Hierarchical Mask Tokenizer (HiMTok), which represents segmentation masks with up to 32 tokens and eliminates the need for the original image during mask de-tokenization. |
Tao Wang; Changxu Cheng; Lingfeng Wang; Senda Chen; Wuyue Zhao; | iccv | 2025-10-20 |
| 110 | Neuro-Symbolic Spatial Reasoning in Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In contrast to contemporaryVLM correlation-based approaches, we propose Relational Segmentor (RelateSeg)to impose explicit spatial relational constraints by first order logic (FOL)formulated in a neural network architecture. |
Jiayi Lin; Jiabo Huang; Shaogang Gong; | arxiv-cs.CV | 2025-10-17 |
| 111 | Semantic Segmentation with Coarse Annotations Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper proposes a regularizationmethod for models with an encoder-decoder architecture with superpixel basedupsampling. |
Jort de Jong; Mike Holenderski; | arxiv-cs.CV | 2025-10-17 |
| 112 | A Symmetry-Aware BAS for Improved Fuzzy Intra-Class Distance-Based Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: At present, the Beetle Antennae Search (BAS) algorithm has achieved remarkable success in image segmentation. |
Yazhi Wang; Lei Ding; Qing Zhang; | Symmetry | 2025-10-17 |
| 113 | Multi-Task Traffic Scene Perception Algorithm Based on Multi-Scale Prompter Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The algorithm introduces a multi-scale prompt learning method to obtain rich multi-scale feature maps and prompt words. |
Kaibo Yang; Mingen Zhong; Kang Fan; Jiawei Tan; | Engineering Research Express | 2025-10-15 |
| 114 | The Application and Challenges of Deep Learning in Semantic Segmentation of High-resolution Remote Sensing Images Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper investigates the application of deep learning technologies in remote sensing image semantic segmentation, based on Convolutional Neural Networks (CNN) and Transformer-based semantic segmentation methods. |
Shijing Hu; | Advances in Engineering Innovation | 2025-10-14 |
| 115 | A Framework for Low-Effort Training Data Generation for Urban Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To addressthis, we present a new framework that adapts an off-the-shelf diffusion modelto a target domain using only imperfect pseudo-labels. |
DENIS ZAVADSKI et. al. | arxiv-cs.CV | 2025-10-13 |
| 116 | EEMS: Edge-Prompt Enhanced Medical Image Segmentation Based on Learnable Gating Mechanism Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce EEMS, a new model for segmentation,combining an Edge-Aware Enhancement Unit (EAEU) and a Multi-scale PromptGeneration Unit (MSPGU). |
HAN XIA et. al. | arxiv-cs.CV | 2025-10-13 |
| 117 | DTEA: Dynamic Topology Weaving and Instability-Driven Entropic Attenuation for Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Extensive experiments on threebenchmark datasets show our framework achieves superior segmentation accuracyand better generalization across various clinical settings. |
WEIXUAN LI et. al. | arxiv-cs.CV | 2025-10-13 |
| 118 | Semantic Segmentation Algorithm Based on Light Field and LiDAR Fusion Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation serves as a cornerstone of scene understanding inautonomous driving but continues to face significant challenges under complexconditions such as occlusion. … |
Jie Luo; Yuxuan Jiang; Xin Jin; Mingyu Liu; Yihui Fan; | arxiv-cs.CV | 2025-10-08 |
| 119 | MMFNet: A Mamba-Based Multimodal Fusion Network for Remote Sensing Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper proposes MMFNet, a novel multimodal fusion network that leverages the Mamba architecture to efficiently capture long-range dependencies for semantic segmentation tasks. |
Jingting Qiu; Wei Chang; Wei Ren; Shanshan Hou; Ronghao Yang; | Sensors | 2025-10-08 |
| 120 | Temporal Prompting Matters: Rethinking Referring Video Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Most existing methods requireend-to-end training with dense mask annotations, which could becomputation-consuming and less scalable. In this work, we rethink the RVOSproblem and aim to investigate the key to this task. |
CI-SIANG LIN et. al. | arxiv-cs.CV | 2025-10-08 |
| 121 | SquareNet: Multi-scale Progressive Difference and Scale-cross Attention Network for Volumetric Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Specifically, we propose a dual encoder-decoder network architecture comprising a multi-scale progressive difference (MSPD) branch and a group scale-cross attention (GSCA) branch. |
HUAXIANG LIU et. al. | Engineering Research Express | 2025-10-08 |
| 122 | Semantic Knowledge Transfer for Semi-supervised Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Shiwei Zhou; Haifeng Zhao; Leilei Ma; Dengdi Sun; | Eng. Appl. Artif. Intell. | 2025-10-01 |
| 123 | Video Object Segmentation-Aware Audio Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Inparticular, these models focus on the entire video and do not provide precisemethods for prioritizing a specific object within a scene, generatingunnecessary background sounds, or focusing on the wrong objects. To addressthis gap, we introduce the novel task of video object segmentation-aware audiogeneration, which explicitly conditions sound synthesis on object-levelsegmentation maps. |
Ilpo Viertola; Vladimir Iashin; Esa Rahtu; | arxiv-cs.CV | 2025-09-30 |
| 124 | CORE-3D: Context-aware Open-vocabulary Retrieval By Embeddings in 3D Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, these methods oftenproduce fragmented masks and inaccurate semantic assignments due to the directuse of raw masks, limiting their effectiveness in complex environments. Toaddress this, we leverage SemanticSAM with progressive granularity refinementto generate more accurate and numerous object-level masks, mitigating theover-segmentation commonly observed in mask generation models such as vanillaSAM, and improving downstream 3D semantic segmentation. |
Mohamad Amin Mirzaei; Pantea Amoie; Ali Ekhterachian; Matin Mirzababaei; Babak Khalaj; | arxiv-cs.CV | 2025-09-29 |
| 125 | 2nd Place Report of MOSEv2 Challenge 2025: Concept Guided Video Object Segmentation Via SeC Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we evaluate itszero-shot performance on the challenging coMplex video Object SEgmentation v2(MOSEv2) dataset. |
ZHIXIONG ZHANG et. al. | arxiv-cs.CV | 2025-09-28 |
| 126 | SSVIF: Self-Supervised Segmentation-Oriented Visible and Infrared Image Fusion Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However,compared to traditional methods, application-oriented VIF methods requiredatasets labeled for downstream tasks (e.g., semantic segmentation or objectdetection), making data acquisition labor-intensive and time-consuming. Toaddress this issue, we propose a self-supervised training framework forsegmentation-oriented VIF methods (SSVIF). |
Zixian Zhao; Xingchen Zhang; | arxiv-cs.CV | 2025-09-26 |
| 127 | SwinMamba: A Hybrid Local-global Mamba Framework for Enhancing Semantic Segmentation of Remotely Sensed Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation of remote sensing imagery is a fundamental task incomputer vision, supporting a wide range of applications such as land useclassification, urban planning, … |
Qinfeng Zhu; Han Li; Liang He; Lei Fan; | arxiv-cs.CV | 2025-09-25 |
| 128 | Boosting LiDAR-Based Localization with Semantic Insight: Camera Projection Versus Direct LiDAR Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Semantic segmentation of LiDAR data presents considerable challenges,particularly when dealing with diverse sensor types and configurations.However, incorporating semantic information can significantly enhance theaccuracy and robustness of LiDAR-based localization techniques for autonomousmobile systems. We propose an approach that integrates semantic camera datawith LiDAR segmentation to address this challenge. |
Sven Ochs; Philip Schörner; Marc René Zofka; J. Marius Zöllner; | arxiv-cs.RO | 2025-09-24 |
| 129 | Seg4Diff: Unveiling Open-Vocabulary Segmentation in Text-to-Image Diffusion Transformers Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we introduce Seg4Diff(Segmentation for Diffusion), a systematic framework for analyzing theattention structures of MM-DiT, with a focus on how specific layers propagatesemantic information from text to image. |
CHAEHYUN KIM et. al. | arxiv-cs.CV | 2025-09-22 |
| 130 | DyGLNet: Hybrid Global-Local Feature Fusion with Dynamic Upsampling for Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper proposes the DyGLNet, which achievesefficient and accurate segmentation by fusing global and local features with adynamic upsampling mechanism. |
Yican Zhao; Ce Wang; You Hao; Lei Li; Tianli Liao; | arxiv-cs.CV | 2025-09-16 |
| 131 | MSGFusion: Multimodal Scene Graph-Guided Infrared and Visible Image Fusion Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Infrared and visible image fusion has garnered considerable attention owingto the strong complementarity of these two modalities in complex, harshenvironments. While deep … |
Guihui Li; Bowei Dong; Kaizhi Dong; Jiayi Li; Haiyong Zheng; | arxiv-cs.CV | 2025-09-16 |
| 132 | Instance-Guided Class Activation Mapping for Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Our approach demonstrates superior localization accuracy, withcomplete object coverage and precise boundary delineation, while maintainingcomputational efficiency. |
Ali Torabi; Sanjog Gaihre; MD Mahbubur Rahman; Yaqoob Majeed; | arxiv-cs.CV | 2025-09-15 |
| 133 | MAFS: Masked Autoencoder for Infrared-Visible Image Fusion and Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Infrared-visible image fusion methods aim at generating fused images withgood visual quality and also facilitate the performance of high-level tasks.Indeed, existing semantic-driven methods have considered semantic informationinjection for downstream applications. |
Liying Wang; Xiaoli Zhang; Chuanmin Jia; Siwei Ma; | arxiv-cs.CV | 2025-09-15 |
| 134 | Microsurgical Instrument Segmentation for Robot-Assisted Surgery Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose Microsurgery Instrument Segmentation for RoboticAssistance(MISRA), a segmentation framework that augments RGB input withluminance channels, integrates skip attention to preserve elongated features,and employs an Iterative Feedback Module(IFM) for continuity restoration acrossmultiple passes. |
Tae Kyeong Jeong; Garam Kim; Juyoun Park; | arxiv-cs.CV | 2025-09-15 |
| 135 | OpenUrban3D: Annotation-Free Open-Vocabulary Semantic Segmentation of Large-Scale Urban Point Clouds Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The mainobstacles are the frequent absence of high-quality, well-aligned multi-viewimagery in large-scale urban point cloud datasets and the poor generalizationof existing three-dimensional (3D) segmentation pipelines across diverse urbanenvironments with substantial variation in geometry, scale, and appearance. Toaddress these challenges, we present OpenUrban3D, the first 3D open-vocabularysemantic segmentation framework for large-scale urban scenes that operateswithout aligned multi-view images, pre-trained point cloud segmentationnetworks, or manual annotations. |
Chongyu Wang; Kunlei Jing; Jihua Zhu; Di Wang; | arxiv-cs.CV | 2025-09-13 |
| 136 | SCOPE: Speech-guided COllaborative PErception Framework for Surgical Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce a speech-guided collaborativeperception (SCOPE) framework that integrates reasoning capabilities of largelanguage model (LLM) with perception capabilities of open-set VFMs to supporton-the-fly segmentation, labeling and tracking of surgical instruments andanatomy in intraoperative video streams. |
Jecia Z. Y. Mao; Francis X Creighton; Russell H Taylor; Manish Sahu; | arxiv-cs.CV | 2025-09-12 |
| 137 | Point Linguist Model: Segment Any Object Via Bridged Large 3D-Language Model Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: At the output stage, predictions depend only on dense featureswithout explicit geometric cues, leading to a loss of fine-grained accuracy. Toaddress these limitations, we present the Point Linguist Model (PLM), a generalframework that bridges the representation gap between LLMs and dense 3D pointclouds without requiring large-scale pre-alignment between 3D-text or3D-images. |
Zhuoxu Huang; Mingqi Gao; Jungong Han; | arxiv-cs.CV | 2025-09-09 |
| 138 | SEEC: Segmentation-Assisted Multi-Entropy Models for Learned Lossless Image Compression Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, most existingapproaches employ a single entropy model to estimate the probabilitydistribution of pixel values across the entire image, which limits theirability to capture the diverse statistical characteristics of differentsemantic regions. To overcome this limitation, we propose Segmentation-AssistedMulti-Entropy Models for Lossless Image Compression (SEEC). |
Chunhang Zheng; Zichang Ren; Dou Li; | arxiv-cs.CV | 2025-09-09 |
| 139 | Text4Seg++: Advancing Image Segmentation Via Generative Language Modeling Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we propose anovel text-as-mask paradigm that casts image segmentation as a text generationproblem, eliminating the need for additional decoders and significantlysimplifying the segmentation process. |
MENGCHENG LAN et. al. | arxiv-cs.CV | 2025-09-08 |
| 140 | Unleashing Hierarchical Reasoning: An LLM-Driven Framework for Training-Free Referring Video Object Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Referring Video Object Segmentation (RVOS) aims to segment an object ofinterest throughout a video based on a language description. The prominentchallenge lies in aligning static … |
BINGRUI ZHAO et. al. | arxiv-cs.CV | 2025-09-06 |
| 141 | CSFAFormer: Category-selective Feature Aggregation Transformer for Multimodal Remote Sensing Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Yue Ni; Donglin Xue; Weijian Chi; Ji Luan; Jiahang Liu; | Inf. Fusion | 2025-09-01 |
| 142 | Multiview Space Function Classification in Apartment Buildings Using Image Deep-Learning Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Amir Ziaee; Georg Suter; | J. Comput. Civ. Eng. | 2025-09-01 |
| 143 | Guided Model-based LiDAR Super-Resolution for Resource-Efficient Automotive Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To overcomethis, we introduce the first end-to-end framework that jointly addresses LiDARsuper-resolution (SR) and semantic segmentation. |
Alexandros Gkillas; Nikos Piperigkos; Aris S. Lalos; | arxiv-cs.CV | 2025-09-01 |
| 144 | Domain Consistency Learning for Continual Test-time Adaptation in Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Yanyu Ye; Wei Wei; Lei Zhang; Chen Ding; Yanning Zhang; | Pattern Recognit. | 2025-09-01 |
| 145 | VoCap: Video Object Captioning and Segmentation from Any Prompt Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Understanding objects in videos in terms of fine-grained localization masksand detailed semantic properties is a fundamental task in video understanding.In this paper, we propose VoCap, a flexible video model that consumes a videoand a prompt of various modalities (text, box or mask), and produces aspatio-temporal masklet with a corresponding object-centric caption. |
JASPER UIJLINGS et. al. | arxiv-cs.CV | 2025-08-29 |
| 146 | GS: Generative Segmentation Via Label Diffusion Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, wepropose GS (Generative Segmentation), a novel framework that formulatessegmentation itself as a generative task via label diffusion. |
Yuhao Chen; Shubin Chen; Liang Lin; Guangrun Wang; | arxiv-cs.CV | 2025-08-27 |
| 147 | LabelGS: Label-Aware 3D Gaussian Splatting for 3D Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The identification andisolating of specific object components is crucial. To address this limitation,we propose Label-aware 3D Gaussian Splatting (LabelGS), a method that augmentsthe Gaussian representation with object label.LabelGS introduces cross-viewconsistent semantic masks for 3D Gaussians and employs a novel OcclusionAnalysis Model to avoid overfitting occlusion during optimization, MainGaussian Labeling model to lift 2D semantic prior to 3D Gaussian and GaussianProjection Filter to avoid Gaussian label conflict. |
YUPENG ZHANG et. al. | arxiv-cs.CV | 2025-08-27 |
| 148 | ArgusCogito: Chain-of-Thought for Cross-Modal Synergy and Omnidirectional Reasoning in Camouflaged Object Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Camouflaged Object Segmentation (COS) poses a significant challenge due tothe intrinsic high similarity between targets and backgrounds, demanding modelscapable of profound … |
JIANWEN TAN et. al. | arxiv-cs.CV | 2025-08-25 |
| 149 | Unlocking Robust Semantic Segmentation Performance Via Label-only Elastic Deformations Against Implicit Label Noise Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Typical dataaugmentation methods, which apply identical transformations to the image andits label, risk amplifying these subtle imperfections and limiting the model’sgeneralization capacity. In this paper, we introduce NSegment+, a novelaugmentation framework that decouples image and label transformations toaddress such realistic noise for semantic segmentation. |
YECHAN KIM et. al. | arxiv-cs.CV | 2025-08-14 |
| 150 | Semantic-aware DropSplat: Adaptive Pruning of Redundant Gaussians for 3D Aerial-View Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This limits theirsegmentation accuracy and consistency. To tackle these challenges, we propose anovel 3D-AVS-SS approach named SAD-Splat. |
Xu Tang; Junan Jia; Yijing Wang; Jingjing Ma; Xiangrong Zhang; | arxiv-cs.CV | 2025-08-13 |
| 151 | Multi-Sequence Parotid Gland Lesion Segmentation Via Expert Text-Guided Segment Anything Model Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Besides, current medical image segmentation methods areautomatically generated, ignoring the domain knowledge of medical experts whenperforming segmentation. To address these limitations, we propose the parotidgland segment anything model (PG-SAM), an expert diagnosis text-guided SAMincorporating expert domain knowledge for cross-sequence parotid gland lesionsegmentation. |
ZHONGYUAN WU et. al. | arxiv-cs.CV | 2025-08-13 |
| 152 | SAGOnline: Segment Any Gaussians Online Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Current methods suffer from prohibitive computationalcosts, limited 3D spatial reasoning, and an inability to track multiple objectssimultaneously. We present Segment Any Gaussians Online (SAGOnline), alightweight and zero-shot framework for real-time 3D segmentation in Gaussianscenes that addresses these limitations through two key innovations: (1) adecoupled strategy that integrates video foundation models (e.g., SAM2) forview-consistent 2D mask propagation across synthesized views; and (2) aGPU-accelerated 3D mask generation and Gaussian-level instance labelingalgorithm that assigns unique identifiers to 3D primitives, enabling losslessmulti-object tracking and segmentation across views. |
WENTAO SUN et. al. | arxiv-cs.CV | 2025-08-11 |
| 153 | Instance Segmentation of Scene Sketches Using Natural Image Priors Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce InkLayer, a method for instance segmentation of raster scene sketches. |
Mia Tang; Yael Vinker; Chuan Yan; Lvmin Zhang; Maneesh Agrawala; | siggraph | 2025-08-10 |
| 154 | A Semantic Segmentation Algorithm for Pleural Effusion Based on DBIF-AUNet Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Existing methods often struggle with diverse image variations andcomplex edges, primarily because direct feature concatenation causes semanticgaps. To address these challenges, we propose the Dual-Branch InteractiveFusion Attention model (DBIF-AUNet). |
RUIXIANG TANG et. al. | arxiv-cs.CV | 2025-08-08 |
| 155 | TEFormer: Texture-Aware and Edge-Guided Transformer for Semantic Segmentation of Urban Remote Sensing Images Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However,geospatial objects often exhibit subtle texture differences and similar spatialstructures, which can easily lead to semantic ambiguity and misclassification.Moreover, challenges such as irregular object shapes, blurred boundaries, andoverlapping spatial distributions of semantic objects contribute to complex anddiverse edge morphologies, further complicating accurate segmentation. Totackle these issues, we propose a texture-aware and edge-guided Transformer(TEFormer) that integrates texture awareness and edge-guidance mechanisms forsemantic segmentation of URSIs. |
Guoyu Zhou; Jing Zhang; Yi Yan; Hui Zhang; Li Zhuo; | arxiv-cs.CV | 2025-08-08 |
| 156 | Open-world Point Cloud Semantic Segmentation: A Human-in-the-loop Framework Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To address theselimitations, we propose HOW-Seg, the first human-in-the-loop framework forOW-Seg. |
Peng Zhang; Songru Yang; Jinsheng Sun; Weiqing Li; Zhiyong Su; | arxiv-cs.CV | 2025-08-06 |
| 157 | Dynamic Robot-Assisted Surgery with Hierarchical Class-Incremental Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work,we build upon the recently introduced Taxonomy-Oriented Poincar\’e-regularizedIncremental Class Segmentation (TOPICS) approach and propose an enhancedvariant, termed TOPICS+, specifically tailored for robust segmentation ofsurgical scenes. |
JULIA HINDEL et. al. | arxiv-cs.CV | 2025-08-03 |
| 158 | Uncertainty-Aware Segmentation Quality Prediction Via Deep Learning Bayesian Modeling: Comprehensive Evaluation and Interpretation on Skin Cancer and Liver Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We present Bayesian adaptations of two benchmarksegmentation models-SwinUNet and Feature Pyramid Network with ResNet50-usingMonte Carlo Dropout, Ensemble, and Test Time Augmentation to quantifyuncertainty. |
SIKHA O K et. al. | arxiv-cs.CV | 2025-08-02 |
| 159 | Transferring Prior Thermal Knowledge for Snowy Urban Scene Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: RGB-thermal (RGB-T) semantic segmentation enables intelligent vehicles to understand environments while operating in urban scenes. However, the research encounters two main … |
XIAODONG GUO et. al. | IEEE Transactions on Intelligent Transportation Systems | 2025-08-01 |
| 160 | A Survey of Deep Learning in Histopathological Nuclear Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Histopathological images contain rich information that can be used to diagnose and monitor disease progression and to predict patient survival. Accurate morphological … |
Lulu Qin; Xiao Yang; Xianhong Xu; Zexuan Zhu; | IEEE Computational Intelligence Magazine | 2025-08-01 |
| 161 | PointGauss: Point Cloud-Guided Multi-Object Segmentation for Gaussian Splatting Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce PointGauss, a novel point cloud-guided framework for real-timemulti-object segmentation in Gaussian Splatting representations. |
WENTAO SUN et. al. | arxiv-cs.CV | 2025-07-31 |
| 162 | Learning Semantic Directions for Feature Augmentation in Domain-Generalized Medical Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This uniquecharacteristic makes medical image segmentation particularly challenging. To address this challenge, we propose a domain generalization frameworktailored for medical image segmentation. |
YINGKAI WANG et. al. | arxiv-cs.CV | 2025-07-31 |
| 163 | Graph-Guided Dual-Level Augmentation for 3D Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, most augmentation strategies only focus on localtransformations or semantic recomposition, lacking the consideration of globalstructural dependencies within scenes. To address this limitation, we propose agraph-guided data augmentation framework with dual-level constraints forrealistic 3D scene synthesis. |
HONGBIN LIN et. al. | arxiv-cs.CV | 2025-07-30 |
| 164 | Dual Cross-image Semantic Consistency with Self-aware Pseudo Labeling for Semi-supervised Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Ingeneral, current approaches, which rely on intra-image pixel-wise consistencytraining via pseudo-labeling, overlook the consistency at more comprehensivesemantic levels (e.g., object region) and suffer from severe discrepancy ofextracted features resulting from an imbalanced number of labeled and unlabeleddata. To overcome these limitations, we present a new \underline{Du}al\underline{C}ross-\underline{i}mage \underline{S}emantic\underline{C}onsistency (DuCiSC) learning framework, for semi-supervisedmedical image segmentation. |
Han Wu; Chong Wang; Zhiming Cui; | arxiv-cs.CV | 2025-07-28 |
| 165 | Solving Scene Understanding for Autonomous Navigation in Unstructured Environments Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The paper discusses thedataset, exploratory data analysis, preparation, implementation of the fivemodels and studies the performance and compares the results achieved in theprocess. |
Naveen Mathews Renji; Kruthika K; Manasa Keshavamurthy; Pooja Kumari; S. Rajarajeswari; | arxiv-cs.CV | 2025-07-27 |
| 166 | Object Segmentation in The Wild with Foundation Models: Application to Vision Assisted Neuro-prostheses for Upper Limbs Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we address the problem of semantic object segmentation usingfoundation models. |
Bolutife Atoki; Jenny Benois-Pineau; Renaud Péteri; Fabien Baldacci; Aymar de Rugy; | arxiv-cs.CV | 2025-07-24 |
| 167 | Robust Noisy Pseudo-label Learning for Semi-supervised Medical Image Segmentation Using Diffusion Model Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a novel diffusion-based framework forsemi-supervised medical image segmentation. |
LIN XI et. al. | arxiv-cs.CV | 2025-07-22 |
| 168 | Label Tree Semantic Losses for Rich Multi-class Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we proposetwo tree-based semantic loss functions which take advantage of a hierarchicalorganisation of the labels. |
JUNWEN WANG et. al. | arxiv-cs.CV | 2025-07-21 |
| 169 | SeC: Advancing Complex Video Object Segmentation Via Progressive Concept Construction Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This limitation arises from their reliance on appearancematching, neglecting the human-like conceptual understanding of objects thatenables robust identification across temporal dynamics. Motivated by this gap,we propose Segment Concept (SeC), a concept-driven segmentation framework thatshifts from conventional feature matching to the progressive construction andutilization of high-level, object-centric representations. |
ZHIXIONG ZHANG et. al. | arxiv-cs.CV | 2025-07-21 |
| 170 | Improved Semantic Segmentation from Ultra-Low-Resolution RGB Images Applied to Privacy-Preserving Object-Goal Navigation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we introduce a novel fullyjoint-learning method that integrates an agglomerative feature extractor and asegmentation-aware discriminator to solve ultra-low-resolution semanticsegmentation, thereby enabling privacy-preserving, semantic object-goalnavigation. |
Xuying Huang; Sicong Pan; Olga Zatsarynna; Juergen Gall; Maren Bennewitz; | arxiv-cs.RO | 2025-07-21 |
| 171 | AMMNet: An Asymmetric Multi-Modal Network for Remote Sensing Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: These issuesundermine the efficiency and robustness of semantic segmentation, particularlyin complex urban environments where precise multi-modal integration isessential. To overcome these limitations, we propose Asymmetric Multi-ModalNetwork (AMMNet), a novel asymmetric architecture that achieves robust andefficient semantic segmentation through three designs tailored for RGB-DSMinput pairs. |
Hui Ye; Haodong Chen; Zeke Zexi Hu; Xiaoming Chen; Yuk Ying Chung; | arxiv-cs.CV | 2025-07-21 |
| 172 | DiSCO-3D : Discovering and Segmenting Sub-Concepts from Open-vocabulary Queries in NeRF Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose DiSCO-3D, thefirst method addressing the broader problem of 3D Open-Vocabulary Sub-conceptsDiscovery, which aims to provide a 3D semantic segmentation that adapts to boththe scene and user queries. |
Doriand Petit; Steve Bourgeois; Vincent Gay-Bellile; Florian Chabot; Loïc Barthe; | arxiv-cs.CV | 2025-07-19 |
| 173 | A Novel Downsampling Strategy Based on Information Complementarity for Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The results show that the HPDmodule provides an efficient solution for semantic segmentation tasks. |
Wenbo Yue; Chang Li; Guoping Xu; | arxiv-cs.CV | 2025-07-19 |
| 174 | Semantic Segmentation Based Scene Understanding in Autonomous Vehicles Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Inthis work, we propose several efficient models to investigate sceneunderstanding through semantic segmentation. |
Ehsan Rassekh; | arxiv-cs.CV | 2025-07-18 |
| 175 | A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose a privacy-preserving semantic-segmentation method for applyingperceptual encryption to images used for model training in addition to testimages. |
Homare Sueyoshi; Kiyoshi Nishikawa; Hitoshi Kiya; | arxiv-cs.CV | 2025-07-16 |
| 176 | On Splitting Lightweight Semantic Image Segmentation for Wireless Communications Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper proposes anovel approach to implementing semantic communication based on splitting thesemantic image segmentation process between a resource constrained transmitterand the receiver. |
Ebrahim Abu-Helalah; Jordi Serra; Jordi Perez-Romero; | arxiv-cs.NI | 2025-07-14 |
| 177 | Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: It has been shown that, in comparison to regularDNN training, training with stylized images reduces texture biases in imageclassification and improves robustness with respect to image corruptions. In aneffort to advance this line of research, we examine whether style transfer canlikewise deliver these two effects in semantic segmentation. |
Ben Hamscher; Edgar Heinert; Annika Mütze; Kira Maag; Matthias Rottmann; | arxiv-cs.CV | 2025-07-14 |
| 178 | Segmentation Similarity Enhanced Semantic Related Entity Fusion for Multi-modal Knowledge Graph Completion Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The segmentation of semantic data, including image segmentation and word-level descriptions, often contain implicit relationships between entities that are frequently overlooked by existing methodologies, thus limiting the effectiveness of reasoning tasks. Therefore, we propose a novel completion inference method based on fine-grained semantic segmentation, which enhances reasoning capability by utilizing implicit relationships between entities. |
Yunpeng Wang; Bo Ning; Xin Wang; Chengfei Liu; Guanyu Li; | sigir | 2025-07-13 |
| 179 | Image Translation with Kernel Prediction Networks for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose a novel imagetranslation method, Domain Adversarial Kernel Prediction Network (DA-KPN), thatguarantees semantic matching between the synthetic label and translation.DA-KPN estimates pixel-wise input transformation parameters of a lightweightand simple translation function. |
Cristina Mata; Michael S. Ryoo; Henrik Turbell; | arxiv-cs.CV | 2025-07-11 |
| 180 | MUVOD: A Novel Multi-view Video Object Segmentation Dataset and A Benchmark for 3D Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we present MUVOD, a new multi-view video dataset fortraining and evaluating object segmentation in reconstructed real-worldscenarios. |
BANGNING WEI et. al. | arxiv-cs.CV | 2025-07-10 |
| 181 | StixelNExT++: Lightweight Monocular Scene Segmentation and Representation for Collective Perception Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper presents StixelNExT++, a novel approach to scene representationfor monocular perception systems. |
Marcel Vosshans; Omar Ait-Aider; Youcef Mezouar; Markus Enzweiler; | arxiv-cs.CV | 2025-07-09 |
| 182 | RSRefSeg 2: Decoupling Referring Remote Sensing Image Segmentation with Foundation Models Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Referring Remote Sensing Image Segmentation provides a flexible andfine-grained framework for remote sensing scene analysis via vision-languagecollaborative interpretation. |
KEYAN CHEN et. al. | arxiv-cs.CV | 2025-07-08 |
| 183 | CoPT: Unsupervised Domain Adaptive Segmentation Using Domain-Agnostic Text Embeddings Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Despiterecent advances in large-scale vision-language representation learning, UDAmethods for segmentation have not taken advantage of the domain-agnosticproperties of text. To address this, we present a novel Covariance-basedPixel-Text loss, CoPT, that uses domain-agnostic text embeddings to learndomain-invariant features in an image segmentation encoder. |
Cristina Mata; Kanchana Ranasinghe; Michael S. Ryoo; | arxiv-cs.CV | 2025-07-08 |
| 184 | MOSU: Autonomous Long-range Robot Navigation with Multi-modal Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We present MOSU, a novel autonomous long-range navigation system thatenhances global navigation for mobile robots through multimodal perception andon-road scene understanding. |
JING LIANG et. al. | arxiv-cs.RO | 2025-07-07 |
| 185 | CoT-Segmenter: Enhancing OOD Detection in Dense Road Scenes Via Chain-of-Thought Reasoning Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To address the presentedchallenges, we propose a novel CoT-based framework targeting OOD detection inroad anomaly scenes. |
Jeonghyo Song; Kimin Yun; DaeUng Jo; Jinyoung Kim; Youngjoon Yoo; | arxiv-cs.CV | 2025-07-05 |
| 186 | No Time to Train! Training-Free Reference-Based Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We find that correspondences enable automaticgeneration of instance-level segmentation masks for downstream tasks andinstantiate our ideas via a multi-stage, training-free method incorporating (1)memory bank construction; (2) representation aggregation and (3) semantic-awarefeature matching. |
Miguel Espinosa; Chenhongyi Yang; Linus Ericsson; Steven McDonagh; Elliot J. Crowley; | arxiv-cs.CV | 2025-07-03 |
| 187 | A Gift from The Integration of Discriminative and Diffusion-based Generative Learning: Boundary Refinement Remote Sensing Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Our theoretical analysis confirmsthat the diffusion denoising process significantly enhances the model’s abilityto learn high-frequency features; however, we also observe that these modelsexhibit insufficient semantic inference for low-frequency features when guidedsolely by the original image. Therefore, we integrate the strengths of bothdiscriminative and generative learning, proposing the Integration ofDiscriminative and diffusion-based Generative learning for Boundary Refinement(IDGBR) framework. |
Hao Wang; Keyan Hu; Xin Guo; Haifeng Li; Chao Tao; | arxiv-cs.CV | 2025-07-02 |
| 188 | SCDF: Seeing Clearly Through Dark and Fog, An Adaptive Semantic Segmentation Scheme for Autonomous Vehicle Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation is a pivotal research area in the advancement of autonomous driving, with a particular focus on addressing adverse weather conditions such as night, rain, … |
Zuobing Ying; Zhengcheng Lin; Zhenyu Li; Xiaochun Huang; Weiping Ding; | IEEE Transactions on Intelligent Transportation Systems | 2025-07-01 |
| 189 | PGOV3D: Open-Vocabulary 3D Semantic Segmentation with Partial-to-Global Curriculum Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We aggregate the partial vocabularies associated with each scene andgenerate pseudo labels using the pre-trained model, effectively bridging thesemantic gap between dense partial observations and large-scale 3Denvironments. |
SHIQI ZHANG et. al. | arxiv-cs.CV | 2025-06-30 |
| 190 | PlantSegNeRF: A Few-shot, Cross-dataset Method for Plant 3D Instance Point Cloud Reconstruction Via Joint-channel NeRF with Multi-view Image Instance Matching Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this study, we proposed anovel approach called plant segmentation neural radiance fields (PlantSegNeRF),aiming to directly generate high-precision instance point clouds frommulti-view RGB image sequences for a wide range of plant species. |
XIN YANG et. al. | arxiv-cs.CV | 2025-06-30 |
| 191 | High-quality Pseudo-labeling for Point Cloud Segmentation with Scene-level Annotation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Consequently, toenhance accuracy, this paper proposes a high-quality pseudo-label generationframework by exploring contemporary multi-modal information and region-pointsemantic consistency. |
Lunhao Duan; Shanshan Zhao; Xingxing Weng; Jing Zhang; Gui-Song Xia; | arxiv-cs.CV | 2025-06-29 |
| 192 | FA-Seg: A Fast and Accurate Diffusion-Based Method for Open-Vocabulary Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we present FA-Seg, aFast and Accurate training-free framework for open-vocabulary segmentationbased on diffusion models. |
Quang-Huy Che; Vinh-Tiep Nguyen; | arxiv-cs.CV | 2025-06-29 |
| 193 | Layer Decomposition and Morphological Reconstruction for Task-Oriented Infrared Image Enhancement Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Furthermore, achievingcontrast enhancement without amplifying noise and losing important informationremains a challenge. To address these challenges, we propose a task-orientedinfrared image enhancement method. |
Siyuan Chai; Xiaodong Guo; Tong Liu; | arxiv-cs.CV | 2025-06-29 |
| 194 | Dual Atrous Separable Convolution for Improving Agricultural Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Thisstudy proposes an efficient image segmentation method for precisionagriculture, focusing on accurately delineating farmland anomalies to supportinformed decision-making and proactive interventions. |
Chee Mei Ling; Thangarajah Akilan; Aparna Ravinda Phalke; | arxiv-cs.CV | 2025-06-27 |
| 195 | SDRNET: Stacked Deep Residual Network for Accurate Semantic Segmentation of Fine-Resolution Remotely Sensed Images Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This article presents a stackeddeep residual network (SDRNet) for semantic segmentation from FRRS images. |
NAFTALY WAMBUGU et. al. | arxiv-cs.CV | 2025-06-27 |
| 196 | Better to Teach Than to Give: Domain Generalized Semantic Segmentation Via Agent Queries with Diffusion Model Guidance Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a novel agent \textbf{Query}-driven learning framework based on \textbf{Diff}usion model guidance for DGSS, named QueryDiff. |
Fan Li; Xuan Wang; Min Qi; Zhaoxiang Zhang; yuelei xu; | icml | 2025-06-25 |
| 197 | Open-Vocabulary Camouflaged Object Segmentation with Cascaded Vision Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Open-Vocabulary Camouflaged Object Segmentation (OVCOS) seeks to segment andclassify camouflaged objects from arbitrary categories, presenting uniquechallenges due to visual ambiguity and unseen categories.Recent approachestypically adopt a two-stage paradigm: first segmenting objects, thenclassifying the segmented regions using Vision Language Models (VLMs). |
KAI ZHAO et. al. | arxiv-cs.CV | 2025-06-24 |
| 198 | A Global-Local Cross-Attention Network for Ultra-high Resolution Remote Sensing Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To address theseissues, we propose GLCANet (Global-Local Cross-Attention Network), alightweight segmentation framework designed for UHR remote sensingimagery.GLCANet employs a dual-stream architecture to efficiently fuse globalsemantics and local details while minimizing GPU usage. |
Chen Yi; Shan LianLei; | arxiv-cs.CV | 2025-06-24 |
| 199 | DepthSeg: Depth Prompting in Remote Sensing Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we introduce a depth prompting two-dimensional (2D) remote sensing semantic segmentation framework (DepthSeg). |
NING ZHOU et. al. | arxiv-cs.CV | 2025-06-17 |
| 200 | Image Segmentation with Large Language Models: A Survey with Perspectives for Intelligent Transportation Systems Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Forintelligent transportation systems (ITS), where accurate scene understanding iscritical for safety and efficiency, this new paradigm offers unprecedentedcapabilities. This survey systematically reviews the emerging field ofLLM-augmented image segmentation, focusing on its applications, challenges, andfuture directions within ITS. |
Sanjeda Akter; Ibne Farabi Shihab; Anuj Sharma; | arxiv-cs.CV | 2025-06-16 |
| 201 | A Comprehensive Survey on Video Scene Parsing:Advances, Challenges, and Prospects Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this survey, we present a holistic review of recent advances in VSP, covering a wide array of vision tasks, including Video Semantic Segmentation (VSS), Video Instance Segmentation (VIS), Video Panoptic Segmentation (VPS), as well as Video Tracking and Segmentation (VTS), and Open-Vocabulary Video Segmentation (OVVS). |
GUOHUAN XIE et. al. | arxiv-cs.CV | 2025-06-16 |
| 202 | InceptionMamba: Efficient Multi-Stage Feature Enhancement with Selective State Space Model for Microscopic Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Moreover, their reliance on the availability of large datasets for improved performance, along with the high computational cost, limit their practicality. To address these issues, we propose an efficient framework for the segmentation task, named InceptionMamba, which encodes multi-stage rich features and offers both performance and computational efficiency. |
DANIYA NAJIHA ABDUL KAREEM et. al. | arxiv-cs.CV | 2025-06-13 |
| 203 | Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Current RRSIS methods rely on multi-modal fusion backbones and semantic segmentation heads but face challenges like dense annotation requirements and complex scene interpretation. To address these issues, we propose a framework named \textit{prompt-generated semantic localization guiding Segment Anything Model}(PSLG-SAM), which decomposes the RRSIS task into two stages: coarse localization and fine segmentation. |
Shuyang Li; Shuang Wang; Zhuangzhuang Sun; Jing Xiao; | arxiv-cs.CV | 2025-06-12 |
| 204 | Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This work introduces Symmetrical Flow Matching (SymmFlow), a new formulation that unifies semantic segmentation, classification, and image generation within a single model. |
Francisco Caetano; Christiaan Viviers; Peter H. N. De With; Fons van der Sommen; | arxiv-cs.CV | 2025-06-12 |
| 205 | Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20$^{th}$ Century Urban Landscapes with Satellite Imageries Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, severe quality degradation (e.g., distortion, misalignment, and spectral scarcity) and annotation absence have long hindered semantic segmentation on such historical RS imagery. To bridge this gap and enhance understanding of urban development, we introduce $\textbf{Urban1960SatBench}$, an annotated segmentation dataset based on historical satellite imagery with the earliest observation time among all existing segmentation datasets, along with a benchmark framework for unsupervised segmentation tasks, $\textbf{Urban1960SatUSM}$. |
TIANXIANG HAO et. al. | arxiv-cs.CV | 2025-06-11 |
| 206 | MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, acquiring high-quality labeled data is often costly and time-consuming. To address this challenge, we proposes a multi-modal self-supervised learning framework that leverages high-resolution RGB images, multi-spectral data, and digital surface models (DSM) for pre-training. |
TONG WANG et. al. | arxiv-cs.CV | 2025-06-10 |
| 207 | Segment Any Architectural Facades (SAAF):An Automatic Segmentation Model for Building Facades, Walls and Windows Based on Multimodal Semantics Guidance Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This study proposes anautomatic segmentation model for building facade walls and windows based onmultimodal semantic guidance, called Segment Any Architectural Facades (SAAF). |
PEILIN LI et. al. | arxiv-cs.CV | 2025-06-09 |
| 208 | PIG: Physically-based Multi-Material Interaction with 3D Gaussians Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, in a scene represented by 3D Gaussian primitives, interactions between objects suffer from inaccurate 3D segmentation, imprecise deformation among different materials, and severe rendering artifacts. To address these challenges, we introduce PIG: Physically-Based Multi-Material Interaction with 3D Gaussians, a novel approach that combines 3D object segmentation with the simulation of interacting objects in high precision. |
ZEYU XIAO et. al. | arxiv-cs.GR | 2025-06-09 |
| 209 | Stepwise Decomposition and Dual-stream Focus: A Novel Approach for Training-free Camouflaged Object Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: While promptable segmentation (\textit{e.g.}, SAM) has shown promise forvarious segmentation tasks, it still requires manual visual prompts for eachobject to be segmented. In … |
CHAO YIN et. al. | arxiv-cs.CV | 2025-06-07 |
| 210 | Exploring Scene Affinity for Semi-Supervised LiDAR Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This paper explores scene affinity (AIScene), namely intra-scene consistency and inter-scene correlation, for semi-supervised LiDAR semantic segmentation in driving scenes. |
CHUANDONG LIU et. al. | cvpr | 2025-06-07 |
| 211 | RelationField: Relate Anything in Radiance Fields Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, current method primarily focus on object-centric representations, supporting object segmentation or detection, while understanding semantic relationships between objects remains largely unexplored. To address this gap, we propose RelationField, the first method to extract inter-object relationships directly from neural radiance fields. |
SEBASTIAN KOCH et. al. | cvpr | 2025-06-07 |
| 212 | SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper introduces SUM Parts, the first large-scale dataset for urban textured meshes with part-level semantic labels, covering about 2.5km^2 with 21 classes. |
Weixiao Gao; Liangliang Nan; Hugo Ledoux; | cvpr | 2025-06-07 |
| 213 | A Semantic Knowledge Complementarity Based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose SKCDF, a semantic knowledge complementarity based decoupling framework for multi-organ segmentation in class-imbalanced medical images. |
ZHENG ZHANG et. al. | cvpr | 2025-06-07 |
| 214 | EntitySAM: Segment Everything in Video Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Specifically, we introduce an entity decoder to facilitate inter-object communication and an automatic prompt generator using learnable object queries. |
Mingqiao Ye; Seoung Wug Oh; Lei Ke; Joon-Young Lee; | cvpr | 2025-06-07 |
| 215 | High Temporal Consistency Through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we propose a lightweight video semantic segmentation approach–suited to onboard real-time inference–achieving high temporal consistency on aerial data through Semantic Similarity Propagation across frames. |
Cédric Vincent; Taehyoung Kim; Henri Meeß; | cvpr | 2025-06-07 |
| 216 | BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature Analysis Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we revisit 3D semantic segmentation through a more granular lens, shedding light on subtle complexities that are typically overshadowed by broader performance metrics. |
Weiguang Zhao; Rui Zhang; Qiufeng Wang; Guangliang Cheng; Kaizhu Huang; | cvpr | 2025-06-07 |
| 217 | DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To improve the FSS pipeline, we propose a novel framework that utilizes large language models (LLMs) to adapt general class semantic information to the query image. |
Amin Karimi; Charalambos Poullis; | cvpr | 2025-06-07 |
| 218 | MaSS13K: A Matting-level Semantic Segmentation Benchmark Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work, we build a large-scale, matting-level semantic segmentation dataset, named MaSS13K, which consists of 13,348 real-world images, all at 4K resolution. |
Chenxi Xie; Minghan Li; Hui Zeng; Jun Luo; Lei Zhang; | cvpr | 2025-06-07 |
| 219 | DocSAM: Unified Document Image Segmentation Via Query Decomposition and Heterogeneous Mixed Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Document image segmentation is crucial in document analysis and recognition but remains challenging due to the heterogeneity of document formats and diverse segmentation tasks. … |
Xiao-Hui Li; Fei Yin; Cheng-Lin Liu; | cvpr | 2025-06-07 |
| 220 | NightAdapter: Learning A Frequency Adapter for Generalizable Night-time Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Night-time scene segmentation is a critical yet challenging task in the real-world applications, primarily due to the complicated lighting conditions. However, existing methods … |
QI BI et. al. | cvpr | 2025-06-07 |
| 221 | Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: While freezing the text encoder preserves its powerful embeddings, recent studies show that fine-tuning both the text and image encoders jointly significantly enhances segmentation performance, especially for classes from open sets. In this work, we explain this phenomenon from the perspective of hierarchical alignment, since during fine-tuning, the hierarchy level of image embeddings shifts from image-level to pixel-level. |
ZELIN PENG et. al. | cvpr | 2025-06-07 |
| 222 | Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper introduces MPEC, a novel Masked Point-Entity Contrastive learning method for open-vocabulary 3D semantic segmentation that leverages both 3D entity-language alignment and point-entity consistency across different point cloud views to foster entity-specific feature representations. |
Yan Wang; Baoxiong Jia; Ziyu Zhu; Siyuan Huang; | cvpr | 2025-06-07 |
| 223 | Dr. Splat: Directly Referring 3D Gaussian Splatting Via Direct Language Embedding Registration IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce Dr. Splat, a novel approach for open-vocabulary 3D scene understanding leveraging 3D Gaussian Splatting. |
KIM JUN-SEONG et. al. | cvpr | 2025-06-07 |
| 224 | An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we propose an end-to-end robust semantic Segmentation Network based on a Conditional-Noise Framework (CNF) of DDPMs, named CDSegNet. |
Wentao Qu; Jing Wang; YongShun Gong; Xiaoshui Huang; Liang Xiao; | cvpr | 2025-06-07 |
| 225 | FALCON: Fairness Learning Via Contrastive Attention Approach to Continual Semantic Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This work presents a novel Fairness Learning via Contrastive Attention Approach to continual learning in semantic scene understanding. |
Thanh-Dat Truong; Utsav Prabhu; Bhiksha Raj; Jackson Cothren; Khoa Luu; | cvpr | 2025-06-07 |
| 226 | VidSeg: Training-free Video Semantic Segmentation Based on Diffusion Models Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce the first training-free approach for Video Semantic Segmentation (VSS) based on pre-trained diffusion models. |
QIAN WANG et. al. | cvpr | 2025-06-07 |
| 227 | Efficient Decoupled Feature 3D Gaussian Splatting Via Hierarchical Compression Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Existing 3DGS-based methods embed both color and high-dimensional semantic features into a single field, leading to significant storage and computational overhead. To mitigate this, we propose Decoupled Feature 3D Gaussian Splatting (DF-3DGS), a novel method that decouples the color and semantic fields, thereby reducing the number of 3D Gaussians required for semantic representation. |
Zhenqi Dai; Ting Liu; Yanning Zhang; | cvpr | 2025-06-07 |
| 228 | Zero-Shot 4D Lidar Panoptic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, the primary challenge in advancing research and developing generalized, versatile methods for spatio-temporal scene understanding in Lidar lies in the scarcity of datasets that provide the necessary diversity and scale of annotations. To overcome these challenges, we propose SAL-4D (Segment Anything in Lidar–4D), a method that utilizes multi-modal robotic sensor setups as a bridge to distill recent developments in Video Object Segmentation (VOS) in conjunction with off-the-shelf Vision-Language foundation models to Lidar. |
Yushan Zhang; Aljoša Ošep; Laura Leal-Taixé; Tim Meinhardt; | cvpr | 2025-06-07 |
| 229 | FFR: Frequency Feature Rectification for Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we identify that attenuated high-frequency features mislead the decoder of ViT-based WSSS models, resulting in over-smoothed false segmentation. To address this, we propose a Frequency Feature Rectification (FFR) framework to rectify the false segmentations caused by attenuated high-frequency features and enhance the learning of high-frequency features in the decoder. |
Ziqian Yang; Xinqiao Zhao; Xiaolei Wang; Quan Zhang; Jimin Xiao; | cvpr | 2025-06-07 |
| 230 | CALICO: Part-Focused Semantic Co-Segmentation with Large Vision-Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we introduce the new task of part-focused semantic co-segmentation, which involves identifying and segmenting common objects and their constituent common and unique parts across images. |
Kiet A. Nguyen; Adheesh Juvekar; Tianjiao Yu; Muntasir Wahed; Ismini Lourentzou; | cvpr | 2025-06-07 |
| 231 | PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose PSA-SSL, a novel extension to point cloud SSL that learns object pose and size-aware (PSA) features. |
Barza Nisar; Steven L. Waslander; | cvpr | 2025-06-07 |
| 232 | Segment Any Motion in Videos Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose a novel approach for moving object segmentation that combines long-range trajectory motion cues with DINO-based semantic features and leverages SAM2 for pixel-level mask densification through an iterative prompting strategy. |
NAN HUANG et. al. | cvpr | 2025-06-07 |
| 233 | Convex Combination Star Shape Prior for Data-driven Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper proposes a convex combination star (CCS) shape, possessing multi-center star shape properties, and has the advantage of effectively controlling the shape of the region through a smooth field function. |
Xinyu Zhao; Jun Xie; Shengzhe Chen; Jun Liu; | cvpr | 2025-06-07 |
| 234 | Benchmarking Large Vision-Language Models Via Directed Scene Graph for Comprehensive Image Captioning IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we introduce a detailed caption benchmark, termed as CompreCap, to evaluate the visual context from a directed scene graph view. |
FAN LU et. al. | cvpr | 2025-06-07 |
| 235 | Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Applying this pipeline to multiple 3D scene datasets, we create Mosaic3D-5.6M, a dataset of more than 30K annotated scenes with 5.6M mask-text pairs – significantly larger than existing datasets. Building on these data, we propose Mosaic3D, a 3D visiual foundation model (3D-VFM) combining a 3D encoder trained with contrastive learning and a lightweight mask decoder for open-vocabulary 3D semantic and instance segmentation. |
JUNHA LEE et. al. | cvpr | 2025-06-07 |
| 236 | Scaling Up Image Segmentation Across Data and Tasks Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Traditional segmentation models, while effective in isolated tasks, often fail to generalize to more complex and open-ended segmentation problems, such as free-form, open-vocabulary, and in-the-wild scenarios. To bridge this gap, we propose to scale up image segmentation across diverse datasets and tasks such that the knowledge across different tasks and datasets can be integrated while improving the generalization ability. |
PEI WANG et. al. | cvpr | 2025-06-07 |
| 237 | Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work, we design Unified-Lift, a new end-to-end object-aware lifting approach that aims for high-quality 3D segmentation based on our object-aware 3D Gaussian representation. |
RUNSONG ZHU et. al. | cvpr | 2025-06-07 |
| 238 | A Dataset for Semantic Segmentation in The Presence of Unknowns Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Existing datasets allow evaluation of only either knowns or unknowns – but not both, which is required to establish "in the wild" suitability of deep neural network models. To bridge this gap, we propose a novel anomaly segmentation dataset, ISSU, featuring a diverse set of anomaly inputs from cluttered real-world environments. |
ZAKARIA LASKAR et. al. | cvpr | 2025-06-07 |
| 239 | COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian Splitting Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Unlike existing approaches that remove ambiguous Gaussians and sacrifice visual quality, COB-GS, as a 3DGS refinement method, jointly optimizes semantic and visual information, allowing the two different levels to cooperate with each other effectively. Specifically, for the semantic guidance, we introduce a boundary-adaptive Gaussian splitting technique that leverages semantic gradient statistics to identify and split ambiguous Gaussians, aligning them closely with object boundaries. |
Jiaxin Zhang; Junjun Jiang; Youyu Chen; Kui Jiang; Xianming Liu; | cvpr | 2025-06-07 |
| 240 | Using Diffusion Priors for Video Amodal Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To this end, we propose to tackle video amodal segmentation by formulating it as a conditional generation task, thereby capitalizing on the foundational knowledge in video generative models. |
Kaihua Chen; Deva Ramanan; Tarasha Khurana; | cvpr | 2025-06-07 |
| 241 | Real-Time Image Semantic Segmentation Based on Improved DeepLabv3+ Network Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: To improve the performance of the image semantic segmentation algorithm and make the algorithm achieve a better balance between accuracy and real-time performance when segmenting … |
Peibo Li; Jiangwu Zhou; Xiaohua Xu; | Big Data Cogn. Comput. | 2025-06-06 |
| 242 | Rethinking Semi-supervised Segmentation Beyond Accuracy: Reliability and Robustness Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: These qualities, which ensure consistent performance under diverse conditions (robustness) and well-calibrated model confidences as well as meaningful uncertainties (reliability), are essential for safety-critical applications like autonomous driving, where models must handle unpredictable environments and avoid sudden failures at all costs. To address this gap, we introduce the Reliable Segmentation Score (RSS), a novel metric that combines predictive accuracy, calibration, and uncertainty quality measures via a harmonic mean. |
Steven Landgraf; Markus Hillemann; Markus Ulrich; | arxiv-cs.CV | 2025-06-06 |
| 243 | U-NetMN and SegNetMN: Modified U-Net and SegNet Models for Bimodal SAR Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this study, we evaluate the impact of mode normalization on two widely used semantic segmentation models, U-Net and SegNet. |
MARWANE KZADRI et. al. | arxiv-cs.CV | 2025-06-05 |
| 244 | A Large-Scale Referring Remote Sensing Image Segmentation Dataset and Benchmark Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Existing datasets for RRSIS suffer from critical limitations in resolution, scene diversity, and category coverage, which hinders the generalization and real-world applicability of refer segmentation models. To facilitate the development of this field, we introduce NWPU-Refer, the largest and most diverse RRSIS dataset to date, comprising 15,003 high-resolution images (1024-2048px) spanning 30+ countries with 49,745 annotated targets supporting single-object, multi-object, and non-object segmentation scenarios. |
ZHIGANG YANG et. al. | arxiv-cs.CV | 2025-06-04 |
| 245 | Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: These models often struggle with thin structures and fine boundaries, leading to poor segmentation quality. We propose Talk2SAM, a novel approach that integrates textual guidance to improve segmentation of such challenging objects. |
Luka Vetoshkin; Dmitry Yudin; | arxiv-cs.CV | 2025-06-03 |
| 246 | 3DLST: 3D Learnable Supertoken Transformer for LiDAR Point Cloud Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Dening Lu; Linlin Xu; Jun Zhou; Kyle Gao; Jonathan Li; | Int. J. Appl. Earth Obs. Geoinformation | 2025-06-01 |
| 247 | Thermal Image-guided Complementary Masking with Multiscale Fusion for Multi-spectral Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Zeyang Chen; Mingnan Hu; Bo Chen; | Eng. Appl. Artif. Intell. | 2025-06-01 |
| 248 | Cascading Attention Enhancement Network for RGB-D Indoor Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
XU TANG et. al. | Comput. Vis. Image Underst. | 2025-06-01 |
| 249 | HMFENet: Hierarchical Matching Guided Feature Enhancement Network for Few-Shot RGB-Thermal Urban Scene Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: RGB-Thermal semantic segmentation provides reliable support for intelligent traffic perception systems, such as road safety monitoring and autonomous driving perception, by fusing … |
XIANGYU ZHOU et. al. | IEEE Transactions on Intelligent Transportation Systems | 2025-06-01 |
| 250 | AR-Light: Enabling Fast and Lightweight Multi-User Augmented Reality Via Semantic Segmentation and Collaborative View Synchronization Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Multi-user Augmented Reality (MuAR) allows multiple users to interact with shared virtual objects, facilitated by exchanging environment information. Current MuAR systems rely on … |
YU WEN et. al. | IEEE Transactions on Computers | 2025-06-01 |
| 251 | A Novel Hierarchical Generative Model for Semi-Supervised Semantic Segmentation of Biomedical Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In biomedical vision research, a significant challenge is the limited availability of pixel-wise labeled data. Data augmentation has been identified as a solution to this issue … |
Lu Chai; Zidong Wang; Yuheng Shao; Qinyuan Liu; | IEEE Transactions on Emerging Topics in Computational … | 2025-06-01 |
| 252 | WCMamba: Enhancing High-resolution Remote Sensing Image Semantic Segmentation with Pyramid Wavelet Convolution and SS2D Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Chao Zhan; Kui Yang; | Knowl. Based Syst. | 2025-06-01 |
| 253 | Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We present a unified, adaptive framework for automatic scene detection and keyframe selection that handles formats ranging from short-form media to long-form films, archival content, and surveillance footage. |
Vasilii Korolkov; | arxiv-cs.CV | 2025-05-31 |
| 254 | Federated Unsupervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Extending these ideas to federated settings requires feature representation and cluster centroid alignment across distributed clients — an inherently difficult task under heterogeneous data distributions in the absence of supervision. To address this, we propose FUSS Federated Unsupervised image Semantic Segmentation) which is, to our knowledge, the first framework to enable fully decentralized, label-free semantic segmentation training. |
Evangelos Charalampakis; Vasileios Mygdalis; Ioannis Pitas; | arxiv-cs.CV | 2025-05-29 |
| 255 | Bridging Classical and Modern Computer Vision: PerceptiveNet for Tree Crown Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Compared to the traditional methods, Deep Learning models improve accuracy by extracting informative and discriminative features, but often fall short in capturing the aforementioned complexities. To address these challenges, we propose PerceptiveNet, a novel model incorporating a Logarithmic Gabor-parameterised convolutional layer with trainable filter parameters, alongside a backbone that extracts salient features while capturing extensive context and spatial information through a wider receptive field. |
Georgios Voulgaris; | arxiv-cs.CV | 2025-05-29 |
| 256 | LiDAR Based Semantic Perception for Forklifts in Outdoor Environments Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this study, we present a novel LiDAR-based semantic segmentation framework tailored for autonomous forklifts operating in complex outdoor environments. |
Benjamin Serfling; Hannes Reichert; Lorenzo Bayerlein; Konrad Doll; Kati Radkhah-Lens; | arxiv-cs.RO | 2025-05-28 |
| 257 | What You Perceive Is What You Conceive: A Cognition-Inspired Framework for Open Vocabulary Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, existing paradigms typically perform class-agnostic region segmentation followed by category matching, which deviates from the human visual system’s process of recognizing objects based on semantic concepts, leading to poor alignment between region segmentation and target concepts. To bridge this gap, we propose a novel Cognition-Inspired Framework for open vocabulary image segmentation that emulates the human visual recognition process: first forming a conceptual understanding of an object, then perceiving its spatial extent. |
JIANGHANG LIN et. al. | arxiv-cs.CV | 2025-05-26 |
| 258 | The Missing Point in Vision Transformers for Universal Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work, we introduce ViT-P, a novel two-stage segmentation framework that decouples mask generation from classification. |
SAJJAD SHAHABODINI et. al. | arxiv-cs.CV | 2025-05-26 |
| 259 | Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this approach, the accuracy of the semantic segmentation model depends on the quality of the pseudo labels, and the quality of the pseudo labels depends on the performance of the model to be trained and the amount of data with annotated labels. In this paper, we generate pseudo labels using zero-shot annotation with the Segment Anything Model (SAM) and Contrastive Language-Image Pretraining (CLIP), improve the accuracy of the pseudo labels using the Unified Dual-Stream Perturbations Approach (UniMatch), and use them as enhanced labels to train a semantic segmentation model. |
Nagito Saito; Shintaro Ito; Koichi Ito; Takafumi Aoki; | arxiv-cs.CV | 2025-05-26 |
| 260 | ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Existing methods that employ semantic segmentation or object detection for dynamic identification and filtering typically rely on predefined categorical priors, while discarding dynamic scene information crucial for robotic applications such as dynamic obstacle avoidance and environmental interaction. To overcome these challenges, we propose ADD-SLAM: an Adaptive Dynamic Dense SLAM framework based on Gaussian splitting. |
WENHUA WU et. al. | arxiv-cs.CV | 2025-05-25 |
| 261 | Semantic Segmentation with Reward Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Sometimes, we need a semantic segmentation network, and even a visual encoder can have a high compatibility, and can be trained using various types of feedback beyond traditional labels, such as feedback that indicates the quality of the parsing results. To tackle this issue, we proposed RSS (Reward in Semantic Segmentation), the first practical application of reward-based reinforcement learning on pure semantic segmentation offered in two granular levels (pixel-level and image-level). |
Xie Ting; Ye Huang; Zhilin Liu; Lixin Duan; | arxiv-cs.CV | 2025-05-23 |
| 262 | EMRA-proxy: Enhancing Multi-Class Region Semantic Segmentation in Remote Sensing Images with Attention Proxy Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: High-resolution remote sensing (HRRS) image segmentation is challenging due to complex spatial layouts and diverse object appearances. While CNNs excel at capturing local … |
YICHUN YU et. al. | arxiv-cs.CV | 2025-05-23 |
| 263 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To facilitate research towards robust model design in segmentation and detection, our primary objective is to provide benchmarking tools regarding robustness to distribution shifts and adversarial manipulations. |
SHASHANK AGNIHOTRI et. al. | arxiv-cs.CV | 2025-05-23 |
| 264 | TextureSAM: Towards A Texture Aware Foundation Model for Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this study, we investigate SAM’s bias toward semantics over textures and introduce a new texture-aware foundation model, TextureSAM, which performs superior segmentation in texture-dominant scenarios. |
Inbal Cohen; Boaz Meivar; Peihan Tu; Shai Avidan; Gal Oren; | arxiv-cs.CV | 2025-05-22 |
| 265 | Multi-View Projection for Unsupervised Domain Adaptation in 3D Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose a multi-view projectionframework for unsupervised domain adaptation (UDA). |
Andrew Caunes; Thierry Chateau; Vincent Fremont; | arxiv-cs.CV | 2025-05-21 |
| 266 | From Pixels to Images: Deep Learning Advances in Remote Sensing Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This review offers a holistic view of DL-based SS for RS, highlighting key advancements, comparative insights, and open challenges to guide future research. |
Quanwei Liu; Tao Huang; Yanni Dong; Jiaqi Yang; Wei Xiang; | arxiv-cs.CV | 2025-05-21 |
| 267 | Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We present Scan, Materialize, Simulate (SMS), a unified framework that combines 3D Gaussian Splatting for accurate scene reconstruction, visual foundation models for semantic segmentation, vision-language models for material property inference, and physics simulation for reliable prediction of action outcomes. |
Amine Elhafsi; Daniel Morton; Marco Pavone; | arxiv-cs.RO | 2025-05-20 |
| 268 | Self-Supervised Learning for Image Segmentation: A Comprehensive Survey Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This survey thoroughly investigates over 150 recent image segmentation articles, particularly focusing on SSL. |
Thangarajah Akilan; Nusrat Jahan; Wandong Zhang; | arxiv-cs.CV | 2025-05-19 |
| 269 | Enhancing Shape Perception and Segmentation Consistency for Industrial Image Inspection Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, a Shape-Aware Efficient Network (SPENet) is proposed, which focuses on the shapes of objects to achieve excellent segmentation consistency by separately supervising the extraction of boundary and body information from images. |
Guoxuan Mao; Ting Cao; Ziyang Li; Yuan Dong; | arxiv-cs.CV | 2025-05-19 |
| 270 | MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we explore the potential of a pure visual foundation model as an alternative to widely used vision-language models for universal visual anomaly segmentation. |
Bin-Bin Gao; | arxiv-cs.CV | 2025-05-14 |
| 271 | FedSaaS: Class-Consistency Federated Semantic Segmentation Via Global Prototype Supervision and Local Adversarial Harmonization Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This oversight results in ambiguities between class representation. To overcome this challenge, we propose a novel federated segmentation framework that strikes class consistency, termed FedSaaS. |
XIAOYANG YU et. al. | arxiv-cs.CV | 2025-05-14 |
| 272 | MESSI: A Multi-Elevation Semantic Segmentation Image Dataset of An Urban Environment Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper presents a Multi-Elevation Semantic Segmentation Image (MESSI) dataset comprising 2525 images taken by a drone flying over dense urban environments. |
Barak Pinkovich; Boaz Matalon; Ehud Rivlin; Hector Rotstein; | arxiv-cs.CV | 2025-05-13 |
| 273 | Method for Semantic Image Segmentation Based on The Neural Network with Gabor Filters Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
E. Murin; D. V. Sorokin; A. S. Krylov; | Program. Comput. Softw. | 2025-05-12 |
| 274 | Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Leveraging Color Shift Correction, RoPE-Swin Backbone, and Quantile-based Label Denoising Strategy for Robust Outdoor Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This report presents our semantic segmentation framework developed by team ACVLAB for the ICRA 2025 GOOSE 2D Semantic Segmentation Challenge, which focuses on parsing outdoor scenes into nine semantic categories under real-world conditions. |
CHIH-CHUNG HSU et. al. | arxiv-cs.CV | 2025-05-11 |
| 275 | Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we present a comprehensive study on cross-spectral UDA for thermal image semantic segmentation. |
Seokjun Kwon; Jeongmin Shin; Namil Kim; Soonmin Hwang; Yukyung Choi; | arxiv-cs.CV | 2025-05-11 |
| 276 | MultiTaskVIF: Segmentation-oriented Visible and Infrared Image Fusion Via Multi-task Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, most existing segmentation-oriented VIF methods adopt a cascade structure comprising separate fusion and segmentation models, leading to increased network complexity and redundancy. This raises a critical question: can we design a more concise and efficient structure to integrate semantic information directly into the fusion model during training-Inspired by multi-task learning, we propose a concise and universal training framework, MultiTaskVIF, for segmentation-oriented VIF models. |
Zixian Zhao; Andrew Howes; Xingchen Zhang; | arxiv-cs.CV | 2025-05-10 |
| 277 | CLIMS++: Cross Language Image Matching with Automatic Context Discovery for Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
JINHENG XIE et. al. | Int. J. Comput. Vis. | 2025-05-09 |
| 278 | Segment Any RGB-Thermal Model with Language-aided Distillation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Given that RGB-T provides a robust solution for scene understanding in adverse weather and lighting conditions, such as low light and overexposure, we propose a novel framework, SARTM, which customizes the powerful SAM for RGB-T semantic segmentation. |
DONG XING et. al. | arxiv-cs.CV | 2025-05-03 |
| 279 | Parallel Segmentation Network for Real-time Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Guanke Chen; Haibin Li; Yaqian Li; Wenming Zhang; Tao Song; | Eng. Appl. Artif. Intell. | 2025-05-01 |
| 280 | SegTrackDetect: A Window-based Framework for Tiny Object Detection Via Semantic Segmentation and Tracking Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Aleksandra Kos; Karol Majek; Dominik Belter; | SoftwareX | 2025-05-01 |
| 281 | NTRENet++: Unleashing The Power of Non-Target Knowledge for Few-Shot Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Few-shot semantic segmentation (FSS) aims to segment the target object under the condition of a few annotated samples. However, current studies on FSS primarily concentrate on … |
YUANWEI LIU et. al. | IEEE Transactions on Circuits and Systems for Video … | 2025-05-01 |
| 282 | Improving RGB-Thermal Semantic Scene Understanding With Synthetic Data Augmentation for Autonomous Driving Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic scene understanding is an important capability for autonomous vehicles. Despite recent advances in RGB-Thermal (RGB-T) semantic segmentation, existing methods often rely … |
Haotian Li; H. K. Chu; Yuxiang Sun; | IEEE Robotics and Automation Letters | 2025-05-01 |
| 283 | Mamba Based Feature Extraction And Adaptive Multilevel Feature Fusion For 3D Tumor Segmentation From Multi-modal Medical Image Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a Mamba based feature extraction and adaptive multilevel feature fusion for 3D tumor segmentation using multi-modal medical image. |
ZEXIN JI et. al. | arxiv-cs.CV | 2025-04-29 |
| 284 | Segmenting Objectiveness and Task-awareness Unknown Region for Autonomous Driving Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper,we propose a novel framework termed Segmenting Objectiveness and Task-Awareness(SOTA) for autonomous driving scenes. |
MI ZHENG et. al. | arxiv-cs.CV | 2025-04-27 |
| 285 | A Data-Centric Approach to 3D Semantic Segmentation of Railway Scenes Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper introduces two targeted data augmentation methods designed to improve segmentation performance on the railway-specific OSDaR23 dataset. |
NICOLAS MÜNGER et. al. | arxiv-cs.CV | 2025-04-25 |
| 286 | SAIP-Net: Enhancing Remote Sensing Image Segmentation Via Spectral Adaptive Information Propagation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To address limitations arising from spatial domain featurefusion and insufficient receptive fields, this paper introduces SAIP-Net, anovel frequency-aware segmentation framework that leverages Spectral AdaptiveInformation Propagation. |
Zhongtao Wang; Xizhe Cao; Yisong Chen; Guoping Wang; | arxiv-cs.CV | 2025-04-23 |
| 287 | Lightweight Road Environment Segmentation Using Vector Quantization Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: (3) Vector quantization encourages the latent space to form coarse clusters of continuous features, forcing the model to group similar features, making the learned representations more structured for the decoding process. In this work, we combined vector quantization with the lightweight image segmentation model MobileUNETR and used it as a baseline model for comparison to demonstrate its efficiency. |
Jiyong Kwag; Alper Yilmaz; Charles Toth; | arxiv-cs.CV | 2025-04-18 |
| 288 | Occlusion-Ordered Semantic Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose to solve the joint task of relative depth ordering and segmentation of instances based on occlusions. |
Soroosh Baselizadeh; Cheuk-To Yu; Olga Veksler; Yuri Boykov; | arxiv-cs.CV | 2025-04-18 |
| 289 | DC-SAM: In-Context Segment Anything in Images and Videos Via Dual Consistency Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work, we propose the Dual Consistency SAM (DC-SAM) method based on prompt-tuning to adapt SAM and SAM2 for in-context segmentation of both images and videos. |
MENGSHI QI et. al. | arxiv-cs.CV | 2025-04-16 |
| 290 | FCoDT-Net: A Novel Framework for High-Precision Medical Image Segmentation Using Contextual Distillation Transformer Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The unused information leads to suboptimal segmentation results. In this paper, we propose the Feature Context Distillation Transformer Network (FCoDT-Net), a deep learning model designed to address these limitations by leveraging the rich contextual information within the skip connections. |
Q. YuTao; Y. SiZhe; H. Bang; R. Wei; | icassp | 2025-04-15 |
| 291 | Harnessing Light Field Angular Cues and Spatial Geometries for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we introduce a novel backbone network called the Light Field Extraction Interaction Network (LFEI-Net). |
C. Jia; F. Shi; X. Cheng; | icassp | 2025-04-15 |
| 292 | Text-Guided Few-Shot Semantic Segmentation with Training-Free Multimodal Feature Matching Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose a training-free approach using multimodal feature matching that performs segmentation by identifying regions in a target image that match the features from both the image and text references. |
G. Buthmann; T. Sakai; H. Qiu; T. Katsuki; D. Kimura; | icassp | 2025-04-15 |
| 293 | PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, PraNet-V1 struggles with multi-class segmentation tasks. To address this limitation, we propose PraNet-V2, which, compared to PraNet-V1, effectively performs a broader range of tasks including multi-class segmentation. |
Bo-Cheng Hu; Ge-Peng Ji; Dian Shao; Deng-Ping Fan; | arxiv-cs.CV | 2025-04-15 |
| 294 | Dual-Path Consistency Unsupervised Domain Adaptation for Nighttime Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, it is often hindered by the lack of annotations due to interference caused by inadequate lighting or exposure. To overcome these difficulties, we propose a Dual-Path Consistency (DPC) unsupervised domain adaptation (UDA) approach. |
Y. Lu; J. Lang; M. Ding; | icassp | 2025-04-15 |
| 295 | A Weakly Supervised Semantic Segmentation Model with Enhanced CLIP Feature Extraction Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper addresses the limitations of the Contrastive Language-Image Pre-training (CLIP) model’s image encoder and proposes a segmentation model WSSS-ECFE with enhanced CLIP feature extraction, aiming to improve the performance of the Weakly Supervised Semantic Segmentation (WSSS) task. |
F. Kong; J. Lu; | icassp | 2025-04-15 |
| 296 | U-SAM: Upgrade Segment Anything Model With Semantic-Aware and Memory-Efficient Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: (2) SAM’s inefficient use of instance-independent visual features and tokens necessitates maintaining unique features and tokens for each instance, leading to excessive GPU memory consumption and diminished segmentation efficiency. To address these issues, we propose the Universal Segment Anything Model (U-SAM), a semantic-aware and memory-efficient segmentation model designed to perform both promptable and traditional segmentation tasks within a compact and unified framework. |
X. Jin; J. Hu; J. Lin; S. Zhang; L. Cao; | icassp | 2025-04-15 |
| 297 | PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in The Wild Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This report provides a comprehensive overview of the 4th Pixel-level Video Understanding in the Wild (PVUW) Challenge, held in conjunction with CVPR 2025. |
HENGHUI DING et. al. | arxiv-cs.CV | 2025-04-15 |
| 298 | Joint Semantic Segmentation of Optical and SAR Image in Hazy Environments Via Cross-modal Information Rectification and Cross-attention Fusion Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper presents a joint semantic segmentation of optical and SAR in hazy environments network that incorporates channel fusion for feature enhancement and cross-attention for feature fusion, enabling efficient segmentation of hazy optical images. |
X. Fan; L. Zhang; | icassp | 2025-04-15 |
| 299 | SPT: Sequence Prompt Transformer for Interactive Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, existing methods typically process one image at a time, failing to consider the sequential nature of the images. To overcome this limitation, we propose a novel method called Sequence Prompt Transformer (SPT), the first to utilize sequential image information for interactive segmentation. |
S. Cheng; | icassp | 2025-04-15 |
| 300 | ES-NeRF: Enhancing Segmentation in NeRF with CLIP Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, they face the challenge of accurately and consistently segmenting objects in complex scenarios. To address this issue, we introduce the Enhancing Segmentation in NeRF with CLIP(ES-NeRF), which aims to improve the segmentation quality through feature fusion with the help of CLIP’s powerful semantic comprehension. |
C. ZHAO et. al. | icassp | 2025-04-15 |
| 301 | Rethinking Early-Fusion Strategies for Improved Multimodal Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, these methods require massive parameter updates and computational effort during the feature extraction and fusion. To address this issue, we propose a novel multimodal fusion network (EFNet) based on an early fusion strategy and a simple but effective feature clustering for training efficient RGB-T semantic segmentation. |
Z. Shen; Y. Li; H. Zhang; Y. Weng; J. Wang; | icassp | 2025-04-15 |
| 302 | Hazy Remote Sensing Image Semantic Segmentation with Weak Annotations Via Pre-training Optimization and Co-training Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Despite the numerous haze removal methods developed for remote sensing images, their efficacy in the subsequent task of semantic segmentation remains inadequate. To address these issues, this paper aims to enhance the robustness of the segmentation network against haze interference by proposing a weakly supervised semantic segmentation framework based on pre-training optimization and dual-network co-training. |
J. Xu; L. Zhang; | icassp | 2025-04-15 |
| 303 | UMSSS: A Visual Scene Semantic Segmentation Dataset for Underground Mines Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper proposes a challenging semantic segmentation dataset focusing on underground mines, named the underground mine scenes semantic segmentation (UMSSS) dataset, which contains 4200 high-quality annotated images and 18 annotated categories. |
J. Wang; | icassp | 2025-04-15 |
| 304 | MASSeg : 2nd Technical Report for 4th PVUW MOSE Track Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This report presents our solution, which ranked second in the MOSE track of CVPR 2025 PVUW Challenge. |
XUQIANG CAO et. al. | arxiv-cs.CV | 2025-04-14 |
| 305 | IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework Under Limited Annotation Scheme Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, existing methods struggle to balance global semantic representation with fine-grained local feature extraction. To address this challenge, we propose a novel tri-branch semi-supervised segmentation framework incorporating a dual-teacher strategy, named IGL-DT. |
DINH DAI QUAN TRAN et. al. | arxiv-cs.CV | 2025-04-13 |
| 306 | AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This necessitates the development of OVS approaches specifically tailored for remote sensing. In this context, we propose AerOSeg, a novel OVS approach for remote sensing data. |
Saikat Dutta; Akhil Vasim; Siddhant Gole; Hamid Rezatofighi; Biplab Banerjee; | arxiv-cs.CV | 2025-04-12 |
| 307 | ChildlikeSHAPES: Semantic Hierarchical Region Parsing for Animating Figure Drawings Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This semantic understanding is a crucial prerequisite for animation tools that seek to modify figures while preserving their unique style. To help achieve this, we propose a novel hierarchical segmentation model, built upon the architecture and pre-trained SAM, to quickly and accurately obtain these semantic labels. |
Astitva Srivastava; Harrison Jesse Smith; Thu Nguyen-Phuoc; Yuting Ye; | arxiv-cs.GR | 2025-04-10 |
| 308 | PathSegDiff: Pathology Segmentation Using Diffusion Model Representations Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose PathSegDiff, a novel approach for histopathology image segmentation that leverages Latent Diffusion Models (LDMs) as pre-trained featured extractors. |
Sachin Kumar Danisetty; Alexandros Graikos; Srikar Yellapragada; Dimitris Samaras; | arxiv-cs.CV | 2025-04-09 |
| 309 | MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, segmenting moving objects from a single image remains challenging for existing methods due to the absence of temporal cues. To address this gap, we propose MovSAM, the first framework for single-image moving object segmentation. |
CHANG NIE et. al. | arxiv-cs.CV | 2025-04-09 |
| 310 | InvNeRF-Seg: Fine-Tuning A Pre-Trained NeRF for 3D Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose Invariant NeRF for Segmentation (InvNeRFSeg), a two step, zero change fine tuning strategy for 3D segmentation. |
Jiangsan Zhao; Jakob Geipel; Krzysztof Kusnierek; Xuean Cui; | arxiv-cs.CV | 2025-04-08 |
| 311 | Zero-Shot 4D Lidar Panoptic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, the primary challenge in advancing research and developing generalized, versatile methods for spatio-temporal scene understanding in Lidar lies in the scarcity of datasets that provide the necessary diversity and scale of annotations.To overcome these challenges, we propose SAL-4D (Segment Anything in Lidar–4D), a method that utilizes multi-modal robotic sensor setups as a bridge to distill recent developments in Video Object Segmentation (VOS) in conjunction with off-the-shelf Vision-Language foundation models to Lidar. We utilize VOS models to pseudo-label tracklets in short video sequences, annotate these tracklets with sequence-level CLIP tokens, and lift them to the 4D Lidar space using calibrated multi-modal sensory setups to distill them to our SAL-4D model. |
Yushan Zhang; Aljoša Ošep; Laura Leal-Taixé; Tim Meinhardt; | arxiv-cs.CV | 2025-04-01 |
| 312 | CCANet: Cross-Modality Comprehensive Feature Aggregation Network for Indoor Scene Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The semantic segmentation of indoor scenes based on RGB and depth information has been a persistent and enduring research topic. However, how to fully utilize the complementarity … |
ZHANG ZIHAO et. al. | IEEE Transactions on Cognitive and Developmental Systems | 2025-04-01 |
| 313 | Hierarchical Context Learning of Object Components for Unsupervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save |
Dong Bao; Jun Zhou; Gervase Tuxworth; Jue Zhang; Yongsheng Gao; | Pattern Recognit. | 2025-04-01 |
| 314 | Combining Feature Compensation and GCN-based Reconstruction for Multimodal Remote Sensing Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Zhen Wang; Jiayuan Li; Nan Xu; Zhuhong You; | Inf. Fusion | 2025-04-01 |
| 315 | HSPFormer: Hierarchical Spatial Perception Transformer for Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Abstract: Semantic perception in driving scenarios plays a crucial role in intelligent transportation systems. However, existing Transformer-based semantic segmentation methods often do not … |
SIYU CHEN et. al. | IEEE Transactions on Intelligent Transportation Systems | 2025-04-01 |
| 316 | Domain-Incremental Semantic Segmentation for Traffic Scenes Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Traffic scene segmentation is an important visual perception process to provide strong support for the decision-making of autonomous driving systems. The traffic scene is an open … |
Yazhou Liu; Haoqi Chen; P. Lasang; Zheng Wu; | IEEE Transactions on Intelligent Transportation Systems | 2025-04-01 |
| 317 | Knowledge Distillation for Reduced Footprint Semantic Segmentation with The U-Net Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Model compression techniques such as knowledge distillation, pruning, and quantization are well documented in the computer vision literature for image classification and … |
Ciro Rosa; Nina Hirata; | Proceedings of the 40th ACM/SIGAPP Symposium on Applied … | 2025-03-31 |
| 318 | Improving Underwater Semantic Segmentation with Underwater Image Quality Attention and Muti-scale Aggregation Attention Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, the low illumination in underwater environments degrades the imaging quality, which in turn seriously deteriorates the performance of underwater semantic segmentation, particularly for outlining the object region boundaries. To tackle this issue, we present UnderWater SegFormer (UWSegFormer), a transformer-based framework for semantic segmentation of low-quality underwater images. |
Xin Zuo; Jiaran Jiang; Jifeng Shen; Wankou Yang; | arxiv-cs.CV | 2025-03-30 |
| 319 | Open-Vocabulary Semantic Segmentation with Uncertainty Alignment for Robotic Scene Understanding in Indoor Building Environments Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Additionally, most existing methods ignore the uncertainty of the scene recognition problem, leading to low success rates, particularly in ambiguous and complex environments. To address these challenges, we propose an open-vocabulary scene semantic segmentation and detection pipeline leveraging Vision Language Models (VLMs) and Large Language Models (LLMs). |
Yifan Xu; Vineet Kamat; Carol Menassa; | arxiv-cs.CV | 2025-03-29 |
| 320 | A Dataset for Semantic Segmentation in The Presence of Unknowns Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Existing datasets allow evaluation of only knowns or unknowns – but not both, which is required to establish in the wild suitability of deep neural network models. To bridge this gap, we propose a novel anomaly segmentation dataset, ISSU, that features a diverse set of anomaly inputs from cluttered real-world environments. |
ZAKARIA LASKAR et. al. | arxiv-cs.CV | 2025-03-28 |
| 321 | Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, it often overfits and memorizes training data, limiting their ability to generate diverse and well-aligned samples. To overcome these issues, we propose Concept-Aware LoRA (CA-LoRA), a novel fine-tuning approach that selectively identifies and updates only the weights associated with necessary concepts (e.g., style or viewpoint) for domain alignment while preserving the pretrained knowledge of the T2I model to produce informative samples. |
MINHO PARK et. al. | arxiv-cs.CV | 2025-03-28 |
| 322 | Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work, we propose a novel approach able to generate 3D semantic scene-scale data without relying on any projection or decoupled trained multi-resolution models, achieving more realistic semantic scene data generation compared to previous state-of-the-art methods. |
Lucas Nunes; Rodrigo Marcuzzi; Jens Behley; Cyrill Stachniss; | arxiv-cs.CV | 2025-03-27 |
| 323 | A Deep Learning Framework for Boundary-Aware Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, they still struggle with blurred target boundaries and insufficient recognition of small targets. To address these issues, this study proposes a Mask2Former-based semantic segmentation algorithm incorporating a boundary enhancement feature bridging module (BEFBM). |
TAI AN et. al. | arxiv-cs.CV | 2025-03-27 |
| 324 | OpenLex3D: A Tiered Evaluation Benchmark for Open-Vocabulary 3D Scene Representations Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: By introducing an open-set 3D semantic segmentation task andan object retrieval task, we evaluate various existing 3D open-vocabularymethods on OpenLex3D, showcasing failure cases, and avenues for improvement.Our experiments provide insights on feature precision, segmentation, anddownstream capabilities. |
CHRISTINA KASSAB et. al. | arxiv-cs.CV | 2025-03-25 |
| 325 | The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We benchmark a wide range of semantic segmentation models, and find that transfer learning from Coralscapes to existing smaller datasets consistently leads to state-of-the-art performance. |
JONATHAN SAUDER et. al. | arxiv-cs.CV | 2025-03-25 |
| 326 | OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: 3D scene understanding has been transformed by open-vocabulary language models that enable interaction via natural language. However, the evaluation of these representations is … |
CHRISTINA KASSAB et. al. | ArXiv | 2025-03-25 |
| 327 | BIMII-Net: Brain-Inspired Multi-Iterative Interactive Network for RGB-T Road Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Nevertheless, existing RGB-T semantic segmentation models typically depend on simple addition or concatenation strategies or ignore the differences between information at different levels. To address these issues, we proposed a novel RGB-T road scene semantic segmentation network called Brain-Inspired Multi-Iteration Interaction Network (BIMII-Net). |
Hanshuo Qiu; Jie Jiang; Ruoli Yang; Lixin Zhan; Jizhao Liu; | arxiv-cs.CV | 2025-03-24 |
| 328 | Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Current models, such as CNN and Transformer-based architectures, excel at identifying pixel-level features but fail to distinguish semantically similar objects (e.g., doctor vs. nurse in a hospital scene) or understand complex contextual scenarios (e.g., differentiating a running child from a regular pedestrian in autonomous driving). To address these limitations, we proposed a novel Context-Aware Semantic Segmentation framework that integrates Large Language Models (LLMs) with state-of-the-art vision backbones. |
Ben Rahman; | arxiv-cs.CV | 2025-03-24 |
| 329 | Seg2Box: 3D Object Detection By Point-Wise Semantics Supervision Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, the challenge arises due to the incomplete geometry structure and boundary ambiguity of point-cloud instances, leading to inaccurate pseudo labels and poor detection results. To address these challenges, we propose a novel method, named Seg2Box. |
MAOJI ZHENG et. al. | arxiv-cs.CV | 2025-03-20 |
| 330 | SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Extending these capabilities to 3D segmentation introduces challenges, as CLIP’s image-based embeddings often lack the geometric detail necessary for 3D scene segmentation. Recent methods tend to address this by introducing additional segmentation models or replacing CLIP with variations trained on segmentation data, which lead to redundancy or loss on CLIP’s general language capabilities. |
WEIWEN HU et. al. | arxiv-cs.CV | 2025-03-19 |
| 331 | High Temporal Consistency Through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we propose a lightweight videosemantic segmentation approach-suited to onboard real-time inference-achievinghigh temporal consistency on aerial data through Semantic SimilarityPropagation across frames. |
Cédric Vincent; Taehyoung Kim; Henri Meeß; | arxiv-cs.CV | 2025-03-19 |
| 332 | USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation Using Features from A Pre-trained Image Segmentation Network Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The increasing demand for high-accuracy depth estimation in autonomous driving and augmented reality applications necessitates advanced neural architectures capable of effectively leveraging multiple data modalities. In this context, we introduce the Unified Segmentation Attention Mechanism Network (USAM-Net), a novel convolutional neural network that integrates stereo image inputs with semantic segmentation maps and attention to enhance depth estimation performance. |
Joseph Emmanuel DL Dayo; Prospero C. Naval Jr; | arxiv-cs.CV | 2025-03-19 |
| 333 | 3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We accordingly propose the \textit{3D-AffordanceLLM} (3D-ADLLM), a framework designed for reasoning affordance detection in 3D open-scene. |
HENGSHUO CHU et. al. | iclr | 2025-03-17 |
| 334 | Adaptive Transformer Attention and Multi-Scale Fusion for Spine 3D Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This study proposes a 3D semantic segmentation method for the spine based on the improved SwinUNETR to improve segmentation accuracy and robustness. |
YANLIN XIANG et. al. | arxiv-cs.CV | 2025-03-17 |
| 335 | Adaptive Transformer Attention and Multi -Scale Fusion for Spine 3D Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: This study proposes a 3D semantic segmentation method for the spine based on the improved SwinUNETR to improve segmentation accuracy and robustness. Aiming at the complex … |
YANLIN XIANG et. al. | 2025 5th International Conference on Artificial … | 2025-03-17 |
| 336 | Text4Seg: Reimagining Image Segmentation As Text Generation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we introduce Text4Seg, a novel text-as-mask paradigm that casts image segmentation as a text generation problem, eliminating the need for additional decoders and significantly simplifying the segmentation process. |
MENGCHENG LAN et. al. | iclr | 2025-03-17 |
| 337 | DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, these models often struggle with domain-specific nuances and underrepresented fine-grained categories. To address these challenges, we introduce DynAlign, a two-stage framework that integrates UDA with foundation models to bridge both the image-level and label-level domain gaps. |
Han Sun; Rui Gong; Ismail Nejjar; Olga Fink; | iclr | 2025-03-17 |
| 338 | Clustering Is Back: Reaching State-of-the-art LiDAR Instance Segmentation Without Training Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we demonstrate that competitive panoptic segmentation can be achieved using only semantic labels, with instances predicted without any training or annotations. |
Corentin Sautier; Gilles Puy; Alexandre Boulch; Renaud Marlet; Vincent Lepetit; | arxiv-cs.CV | 2025-03-17 |
| 339 | Class Distribution-induced Attention Map for Open-vocabulary Semantic Segmentations Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we argue that CLIP-based prior works yield patch-wise noisy class predictions while having highly correlated class distributions for each object. |
Dong Un Kang; Hayeon Kim; Se Young Chun; | iclr | 2025-03-17 |
| 340 | Point Cloud Based Scene Segmentation: A Survey Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To inspire future research, in this review paper, we provide a comprehensive overview of the current state-of-the-art methods in the field of Point Cloud Semantic Segmentation for autonomous driving. |
Dan Halperin; Niklas Eisl; | arxiv-cs.CV | 2025-03-16 |
| 341 | LangDA: Building Context-Awareness Via Language for Domain Adaptive Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Two key approaches in DASS are (1) vision-only approaches using masking or multi-resolution crops, and (2) language-based approaches that use generic class-wise prompts informed by target domain (e.g. a {snowy} photo of a {class}). |
CHANG LIU et. al. | arxiv-cs.CV | 2025-03-16 |
| 342 | OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper introduces adynamically configurable and highly automated LLM/LVLM-powered pipeline forevaluating OSM solutions called OSMa-Bench (Open Semantic Mapping Benchmark). |
Maxim Popov; Regina Kurkova; Mikhail Iumanov; Jaafar Mahmoud; Sergey Kolyubin; | arxiv-cs.CV | 2025-03-13 |
| 343 | Entropy Guidance Hierarchical Rich-scale Feature Network for Remote Sensing Image Semantic Segmentation of High Resolution Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
HAOXUE ZHANG et. al. | Appl. Intell. | 2025-03-13 |
| 344 | Zero-shot Image Segmentation for Scene Objects Based on The L0 Gradient Minimization and Adaptive Superpixel Method Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Hailong Yan; Junjiang Huang; Mao Zheng; Yijie Tang; | Neural Comput. Appl. | 2025-03-12 |
| 345 | MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Low-resolution image segmentation is crucial in real-world applications such as robotics, augmented reality, and large-scale scene understanding, where high-resolution data is often unavailable due to computational constraints. To address this challenge, we propose MaskAttn-UNet, a novel segmentation framework that enhances the traditional U-Net architecture via a mask attention mechanism. |
ANZHE CHENG et. al. | arxiv-cs.CV | 2025-03-11 |
| 346 | Aligning Instance-Semantic Sparse Representation Towards Unsupervised Object Segmentation and Shape Abstraction with Repeatable Primitives Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Driven by the tendency of high-dimensional semantically similar features to lie in or near low-dimensional subspaces, we introduce a one-stage, fully unsupervised framework towards semantic-aware shape representation. |
Jiaxin Li; Hongxing Wang; Jiawei Tan; Zhilong Ou; Junsong Yuan; | arxiv-cs.CV | 2025-03-10 |
| 347 | Approximate Size Targets Are Sufficient for Accurate Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Our ideas are validated on PASCAL VOC using our new human annotations of approximate object sizes. |
Xingye Fan; Yuri Boykov; | arxiv-cs.CV | 2025-03-10 |
| 348 | OmniSAM: Omnidirectional Segment Anything Model for UDA in Panoramic Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Two major concerns for this application includes 1)inevitable distortion and object deformation brought by the large FoV disparitybetween domains; 2) the lack of pixel-level semantic understanding that theoriginal SAM2 cannot provide. To address these issues, we propose a novelOmniSAM framework, which makes the first attempt to apply SAM2 for panoramicsemantic segmentation. |
DING ZHONG et. al. | arxiv-cs.CV | 2025-03-10 |
| 349 | MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Inspired by cross-frame correlation in videos, we propose to treat multi-modal data as a sequence of frames representing the same scene. |
CHENFEI LIAO et. al. | arxiv-cs.CV | 2025-03-09 |
| 350 | Dynamically Evolving Segment Anything Model with Continuous Learning for Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, in practical applications, the diversity of scenarios and tasks in medical image segmentation continues to expand, necessitating models that can dynamically evolve to meet the demands of various segmentation tasks. Here, we introduce EvoSAM, a dynamically evolving medical image segmentation model that continuously accumulates new knowledge from an ever-expanding array of scenarios and tasks, enhancing its segmentation capabilities. |
ZHAORI LIU et. al. | arxiv-cs.CV | 2025-03-08 |
| 351 | EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Existing mapping methods often suffer fromoverconfident semantic predictions, and sparse and noisy depth sensing, leadingto inconsistent map representations. In this paper, we therefore introduceEvidMTL, a multi-task learning framework that uses evidential heads for depthestimation and semantic segmentation, enabling uncertainty-aware inference frommonocular RGB images. |
Rohit Menon; Nils Dengler; Sicong Pan; Gokul Krishna Chenchani; Maren Bennewitz; | arxiv-cs.RO | 2025-03-06 |
| 352 | BEVMOSNet: Multimodal Fusion for BEV Moving Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Conversely, LiDAR and radar sensors remain almost unaffected in these scenarios, and radar provides key velocity information of the objects. Therefore, we introduce BEVMOSNet, to our knowledge, the first end-to-end multimodal fusion leveraging cameras, LiDAR, and radar to precisely predict the moving objects in BEV. |
HIEP TRUONG CONG et. al. | arxiv-cs.CV | 2025-03-05 |
| 353 | GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, existing methods primarily focus on embedding compressed CLIP features to 3D Gaussians, suffering from low object segmentation accuracy and lack spatial reasoning capabilities. To address these limitations, we propose GaussianGraph, a novel framework that enhances 3DGS-based scene understanding by integrating adaptive semantic clustering and scene graph generation. |
XIHAN WANG et. al. | arxiv-cs.CV | 2025-03-05 |
| 354 | SurgiSAM2: Fine-tuning A Foundational Model for Surgical Video Anatomy Segmentation and Detection Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Methods: We utilized five public datasets to evaluate and fine-tune SAM 2 for segmenting anatomical tissues in surgical videos/images. |
DEVANISH N. KAMTAM et. al. | arxiv-cs.CV | 2025-03-05 |
| 355 | Exploring The Better Correlation for Few-Shot Video Object Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Few-shot video object segmentation (FSVOS) aims to achieve accurate segmentation of novel objects in given video sequences, where the target objects are specified by limited … |
NAISONG LUO et. al. | IEEE Transactions on Circuits and Systems for Video … | 2025-03-01 |
| 356 | Pseudo 5D Hyperspectral Light Field for Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Ruixuan Cong; Hao Sheng; Da Yang; Rongshan Chen; Zhenglong Cui; | Inf. Fusion | 2025-03-01 |
| 357 | TS‐Net: Trans‐Scale Network for Medical Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Accurate medical image segmentation is crucial for clinical diagnosis and disease treatment. However, there are still great challenges for most existing methods to extract … |
HuiFang Wang; Yatong Liu; Jiongyao Ye; Dawei Yang; Yu Zhu; | International Journal of Imaging Systems and Technology | 2025-03-01 |
| 358 | Attention Guided Filter and Refinement Feature Network for Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
SHUSHENG LI et. al. | Knowl. Based Syst. | 2025-03-01 |
| 359 | CMAA: Channel-wise Multi-scale Adaptive Attention Network for Metallographic Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Yongliang Sun; Xiangyang Huang; | Expert Syst. Appl. | 2025-03-01 |
| 360 | Adaptive Sparse Lightweight Multi-scale Hybrid Network for Remote Sensing Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
HAONAN SUN et. al. | Expert Syst. Appl. | 2025-03-01 |
| 361 | DCSSGA-UNet: Biomedical Image Segmentation with DenseNet Channel Spatial and Semantic Guidance Attention IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Tahir Hussain; Hayaru Shouno; M. A. Mohammed; Haydar Abdulameer Marhoon; Taukir Alam; | Knowl. Based Syst. | 2025-03-01 |
| 362 | CGViT: Cross-image GroupViT for Zero-shot Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Jie Jiang; Xingjian He; Xinxin Zhu; Weining Wang; Jing Liu; | Pattern Recognit. | 2025-03-01 |
| 363 | Boundaries Matters: A Novel Multibranch Semisupervised Semantic Segmentation Method Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In recent years, semisupervised semantic segmentation (SSS) research has been progressing rapidly. Existing methods usually ignore the classification of detailed pixels, such as … |
Yitong Li; Changlun Zhang; Hengyou Wang; | IEEE Intelligent Systems | 2025-03-01 |
| 364 | Realistic Evaluation of Deep Active Learning for Image Classification and Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
SUDHANSHU MITTAL et. al. | Int. J. Comput. Vis. | 2025-02-28 |
| 365 | FuseForm: Multimodal Transformer for Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: For semantic segmentation, integrating multimodal data can vastly improve segmentation performance at the cost of increased model complexity. We introduce FuseForm, a multimodal … |
Justin McMillen; Yasin Yilmaz; | 2025 IEEE/CVF Winter Conference on Applications of Computer … | 2025-02-28 |
| 366 | Learning Under Noisy Labels, Spurious Points, and Diverse Structures: TS40K, A 3D Point Cloud Dataset of Rural Terrain and Electrical Transmission Systems Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Research in 3D scene understanding, particularly in autonomous driving and indoor segmentation, has made significant strides. However, most available datasets focus on urban … |
DIOGO MATEUS LAVADO et. al. | 2025 IEEE/CVF Winter Conference on Applications of Computer … | 2025-02-26 |
| 367 | Enhanced Neuromorphic Semantic Segmentation Latency Through Stream Event Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Traditional frame-based methods often struggle to balance latency, accuracy, and energy efficiency. To address these challenges, we leverage event streams from event-based cameras-bio-inspired sensors that trigger events in response to changes in the scene. |
D. Hareb; J. Martinet; B. Miramond; | arxiv-cs.CV | 2025-02-26 |
| 368 | SegBuilder: A Semi-Automatic Annotation Tool for Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: This paper addresses the problem of image annotation for segmentation tasks. Semantic segmentation involves la-beling each pixel in an image with predefined categories, such as … |
Md. Alimoor Reza; Eric Manley; Sean Chen; Sameer Chaudhary; Jacob Elafros; | 2025 IEEE/CVF Winter Conference on Applications of Computer … | 2025-02-26 |
| 369 | Multi-Granularity Video Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work, we aim to generate multi-granularity video segmentation dataset that is annotated for both salient and non-salient masks. |
SANGBEOM LIM et. al. | aaai | 2025-02-25 |
| 370 | Every Component Counts: Rethinking The Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We present Connected-Component (CC)-Metrics, a novel semantic segmentation evaluation protocol, targeted to align existing semantic segmentation metrics to a multi-instance detection scenario in which each connected component matters. |
ALEXANDER JAUS et. al. | aaai | 2025-02-25 |
| 371 | SeeDiff: Off-the-Shelf Seeded Mask Generation from Diffusion Models Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we take a closer look at attention mechanisms of Stable Diffusion, from which we draw connections with classical seeded segmentation approaches. |
Joon Hyun Park; Kumju Jo; Sungyong Baik; | aaai | 2025-02-25 |
| 372 | Structural Pruning Via Spatial-aware Information Redundancy for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Within this framework, we introduce a spatial-aware redundancy metric based on feature maps, thus endowing the pruning process with location sensitivity to better adapt to pruning segmentation networks. |
Dongyue Wu; Zilin Guo; Li Yu; Nong Sang; Changxin Gao; | aaai | 2025-02-25 |
| 373 | InvSeg: Test-Time Prompt Inversion for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This discrepancy hinders diffusion models from capturing accurate visual-textual correlations. To solve this, we propose InvSeg, a test-time prompt inversion method that tackles open-vocabulary semantic segmentation by inverting image-specific visual context into text prompt embedding space, leveraging structure information derived from the diffusion model’s reconstruction process to enrich text prompts so as to associate each class with a structure-consistent mask. |
Jiayi Lin; Jiabo Huang; Jian Hu; Shaogang Gong; | aaai | 2025-02-25 |
| 374 | Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Domain randomization-based methods frequently incorporate domain-irrelevant noise due to the uncontrollability of style transformations, resulting in segmentation ambiguity. To address these challenges, we introduce a novel framework, named SCSD for Semantic Consistency prediction and Style Diversity generalization. |
Hongwei Niu; Linhuang Xie; Jianghang Lin; Shengchuan Zhang; | aaai | 2025-02-25 |
| 375 | Weakly Supervised Gland Segmentation with Class Semantic Consistency and Purified Labels Filtration Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Specifically, for class consistency, we propose Consistency Correlation Attention (CCA) to encourage the network to focus on the contribution of class features to semantic dependencies. |
SIYANG FENG et. al. | aaai | 2025-02-25 |
| 376 | Efficient Event-Based Semantic Segmentation Via Exploiting Frame-Event Fusion: A Hybrid Neural Network Approach Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, existing event-based semantic segmentation methods often fail to fully exploit the complementary information provided by frames and events, resulting in complex training strategies and increased computational costs. To address these challenges, we propose an efficient hybrid framework for image semantic segmentation, comprising a Spiking Neural Network branch for events and an Artificial Neural Network branch for frames. |
HEBEI LI et. al. | aaai | 2025-02-25 |
| 377 | SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Even worse, most of the existing approaches pay much attention to image-level information and ignore semantic features, resulting in the inability to perceive weak boundaries. To address these issues, we propose a novel Semantic-Guided Triplet Co-training (SGTC) framework, which achieves high-end medical image segmentation by only annotating three orthogonal slices of a few volumetric samples, significantly alleviating the burden of radiologists. |
Ke Yan; Qing Cai; Fan Zhang; Ziyan Cao; Zhi Liu; | aaai | 2025-02-25 |
| 378 | Holistic Correction with Object Prototype for Video Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To this end, we propose a Holistic Correction Network (HCNet) to adaptively acquire concise object prototypes for holistic correction at semantic, spatial and temporal aspects. |
Shengye Qiao; Changqun Xia; Yanjie Liang; Gongjin Lan; Jia Li; | aaai | 2025-02-25 |
| 379 | Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We propose a unique neural model, leveraging advances from the state space and diffusion generative modeling to achieve remarkable 3D semantic scene completion performance with monocular image input. |
Li Liang; Naveed Akhtar; Jordan Vice; Xiangrui Kong; Ajmal Saeed Mian; | aaai | 2025-02-25 |
| 380 | S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In response, we introduce a novel, domain-agnostic, add-on, and data-driven strategy inspired by image stacking in image denoising. |
Yimu Pan; Sitao Zhang; Alison D. Gernand; Jeffery A. Goldstein; James Z. Wang; | aaai | 2025-02-25 |
| 381 | Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: As a result, optimization typically lacks awareness of semantic category information, which can result in floaters with ambiguous segmentation. To address these challenges, we introduce CCGS, a method designed to achieve both view consistent 2D segmentation and a compact 3D Gaussian segmentation field. |
WENHAO HU et. al. | arxiv-cs.CV | 2025-02-22 |
| 382 | Multimodal Deep Learning Framework for Enhanced Semantic Scene Classification Using RGB-D Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Typically, an abstract scene is made up of multiple less abstract elements, such themes or objects. The semantic gap between low-level visual characteristics and abstract scenes … |
Aysha Naseer; Ahmad Jalal; | 2025 6th International Conference on Advancements in … | 2025-02-18 |
| 383 | Hybrid Deep Learning Aerial Framework for Road Scene Objects Segmentation and Classification Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: This research proposes an advanced approach of object segmentation and categorization using aerial image sequences for enhancing intelligent traffic monitoring systems. … |
Aysha Naseer; Ahmad Jalal; | 2025 6th International Conference on Advancements in … | 2025-02-18 |
| 384 | From Open-Vocabulary to Vocabulary-Free Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This work proposes a Vocabulary-Free Semantic Segmentation pipeline, eliminating the need for predefined class vocabularies. |
KLARA REICHARD et. al. | arxiv-cs.CV | 2025-02-17 |
| 385 | NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this thesis, we introduce a novel approach named NPSim, which enables the simulation of realistic nighttime images from real daytime counterparts with monocular inverse rendering and ray tracing. |
Shutong Zhang; | arxiv-cs.CV | 2025-02-15 |
| 386 | Prototype Contrastive Consistency Learning for Semi-Supervised Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, although previous contrastive learning methods can mine semantic information from partial pixels within images, they ignore the whole context information of unlabeled images, which is very important to precise segmentation. In order to solve this problem, we propose a novel prototype contrastive learning method called Prototype Contrastive Consistency Segmentation (PCCS) for semi-supervised medical image segmentation. |
Shihuan He; Zhihui Lai; Ruxin Wang; Heng Kong; | arxiv-cs.CV | 2025-02-10 |
| 387 | Convolutional Neural Network Segmentation for Satellite Imagery Data to Identify Landforms Using U-Net Architecture Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The study applies the U-Net model for effective feature extraction by using Convolutional Neural Network (CNN) segmentation techniques. |
Mitul Goswami; Sainath Dey; Aniruddha Mukherjee; Suneeta Mohanty; Prasant Kumar Pattnaik; | arxiv-cs.CV | 2025-02-08 |
| 388 | Deep Unfolding Multi-modal Image Fusion Network Via Attribution Analysis IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Although some approaches attempt to jointly optimize image fusion and downstream tasks, these efforts often lack direct guidance or interaction, serving only to assist with a predefined fusion loss. To address this, we propose an “Unfolding Attribution Analysis Fusion network” (UAAFusion), using attribution analysis to tailor fused images more effectively for semantic segmentation, enhancing the interaction between the fusion and segmentation. |
HAOWEN BAI et. al. | arxiv-cs.CV | 2025-02-03 |
| 389 | Image-text Aggregation for Open-vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Shengyang Cheng; Jianyong Huang; Xiaodong Wang; Lei Huang; Zhiqiang Wei; | Neurocomputing | 2025-02-01 |
| 390 | Image-point Cloud Embedding Network for Simultaneous Image-based Farmland Instance Extraction and Point Cloud-based Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Jinpeng Li; Yuan Li; Shuhang Zhang; Yiping Chen; | Int. J. Appl. Earth Obs. Geoinformation | 2025-02-01 |
| 391 | Increase The Sensitivity of Moderate Examples for Semantic Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
QUAN TANG et. al. | Image Vis. Comput. | 2025-02-01 |
| 392 | LBFormer: Scene Perception Segmentation Transformer Based on Local Block Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Scene perception for autonomous vehicles and vessels is crucial for autonomous navigation. Current mainstream transformer methods typically split the feature map into windows, … |
YONGJIE ZHANG et. al. | IEEE Transactions on Industrial Informatics | 2025-02-01 |
| 393 | INF-PCA: Implicit Neural Field-Based Interactive Point Cloud Semantic Annotation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Point cloud semantic segmentation helps Intelligent Transportation Systems understand traffic scenes by assigning semantic label to each point in the point cloud, and it relies on … |
CHONG LIU et. al. | IEEE Transactions on Intelligent Transportation Systems | 2025-02-01 |
| 394 | Removing Visual Occlusion of Construction Scaffolds Via A Two-step Method Combining Semantic Segmentation and Image Inpainting Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Yuexiong Ding; Muyang Liu; Ming Zhang; Xiaowei Luo; | Eng. Appl. Artif. Intell. | 2025-02-01 |
| 395 | Boundary Semantic Interactive Aggregation Network for Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Fan Zhang; Qijun Lv; Binrong Pan; Yun Wang; | Expert Syst. Appl. | 2025-02-01 |
| 396 | Lifting By Gaussians: A Simple, Fast and Flexible Method for 3D Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce Lifting By Gaussians (LBG), a novel approach for open-world instance segmentation of 3D Gaussian Splatted Radiance Fields (3DGS). |
Rohan Chacko; Nicolai Haeni; Eldar Khaliullin; Lin Sun; Douglas Lee; | arxiv-cs.CV | 2025-01-31 |
| 397 | SynthmanticLiDAR: A Synthetic Dataset for Semantic Segmentation on LiDAR Imaging Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work, we present a modified CARLA simulator designed with LiDAR semantic segmentation in mind, with new classes, more consistent object labeling with their counterparts from real datasets such as SemanticKITTI, and the possibility to adjust the object class distribution. |
Javier Montalvo; Pablo Carballeira; Álvaro García-Martín; | arxiv-cs.CV | 2025-01-31 |
| 398 | Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Our approach introduces an innovative sketch-guided interactive segmentation framework, allowing users to intuitively annotate objects with freehand sketches (drawing a rough contour of the object) instead of the traditional bounding boxes or points used in classic interactive segmentation models like SAM. |
YING ZANG et. al. | arxiv-cs.CV | 2025-01-31 |
| 399 | Side Information-driven Image Coding for Hybrid Machine–human Vision Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: With the development of machine learning, advanced photography and image transmission systems, images are being processed more and more by machines, so image coding for machines … |
Zhongpeng Zhang; Ying Liu; Wen-Hsiao Peng; | EURASIP Journal on Image and Video Processing | 2025-01-28 |
| 400 | Beyond-Labels: Advancing Open-Vocabulary Segmentation With Vision-Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Our research proposesBeyond-Labels, a lightweight transformer-based fusion module that uses asmall amount of image segmentation data to fuse frozen visual representationswith language concepts. |
Muhammad Atta ur Rahman; Dooseop Choi; Seung-Ik Lee; KyoungWook Min; | arxiv-cs.CV | 2025-01-28 |
| 401 | Freestyle Sketch-in-the-Loop Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we expand the domain of sketch research into the field of image segmentation, aiming to establish freehand sketches as a query modality for subjective image segmentation. |
SUBHADEEP KOLEY et. al. | arxiv-cs.CV | 2025-01-27 |
| 402 | Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose LangSeg, a novel LLM-guided semantic segmentation method that leverages context-sensitive, fine-grained subclass descriptors generated by LLMs. |
Philip Hughes; Larry Burns; Luke Adams; | arxiv-cs.CV | 2025-01-27 |
| 403 | D-PLS: Decoupled Semantic Segmentation for 4D-Panoptic-LiDAR-Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper introduces a novel approach to 4D Panoptic LiDAR Segmentation that decouples semantic and instance segmentation, leveraging single-scan semantic predictions as prior information for instance segmentation. |
Maik Steinhauser; Laurenz Reichardt; Nikolas Ebert; Oliver Wasenmüller; | arxiv-cs.CV | 2025-01-27 |
| 404 | Improved Gated Recurrent Units Together with Fusion for Semantic Segmentation of Remote Sensing Images Based on Parallel Hybrid Network Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: : Transformer together with convolutional neural network (CNN) has achieved better performance than the pure module-based methods. However, the advantages of both coding styles … |
Tongchi Zhou; Hongyu He; Yanzhao Wang; Yuan Liao; | Multim. Syst. | 2025-01-20 |
| 405 | Rethinking Early-Fusion Strategies for Improved Multimodal Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, these methods require massive parameter updates and computational effort during the feature extraction and fusion. To address this issue, we propose a novel multimodal fusion network (EFNet) based on an early fusion strategy and a simple but effective feature clustering for training efficient RGB-T semantic segmentation. |
Zhengwen Shen; Yulian Li; Han Zhang; Yuchen Weng; Jun Wang; | arxiv-cs.CV | 2025-01-19 |
| 406 | LSSMask: A Lightweight Semantic Segmentation Network for Dynamic Object Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Xiaofeng Lian; Maomao Kang; Li Tan; Xiao Sun; Yanli Wang; | Signal Image Video Process. | 2025-01-17 |
| 407 | Surface-SOS: Self-Supervised Object Segmentation Via Neural Surface Representation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Under conditions of multi-camera inputs, the structural, textural and geometrical consistency among each view can be leveraged to achieve fine-grained object segmentation. To make better use of the above information, we propose Surface representation based Self-supervised Object Segmentation (Surface-SOS), a new framework to segment objects for each view by 3D surface representation from multi-view images of a scene. |
Xiaoyun Zheng; Liwei Liao; Jianbo Jiao; Feng Gao; Ronggang Wang; | arxiv-cs.CV | 2025-01-16 |
| 408 | Unsupervised Semantic Segmentation of Urban Scenes Via Cross-Modal Distillation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic image segmentation models typically require extensive pixel-wise annotations, which are costly to obtain and prone to biases. Our work investigates learning semantic … |
ANTONÍN VOBECKÝ et. al. | Int. J. Comput. Vis. | 2025-01-15 |
| 409 | Hierarchical Superpixel Segmentation Via Structural Information Theory Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: These approaches do not fully leverage the global information in the graph, leading to suboptimal segmentation quality. To address this limitation, we present SIT-HSS, a hierarchical superpixel segmentation method based on structural information theory. |
MINHUI XIE et. al. | arxiv-cs.CV | 2025-01-13 |
| 410 | Adaptive Noise-Tolerant Network for Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, instead of relying on clean segmentation labels, we study whether and how integrating imperfect or noisy segmentation results from off-the-shelf segmentation algorithms may help achieve better segmentation results through a new Adaptive Noise-Tolerant Network (ANTN) model. |
Weizhi Li; | arxiv-cs.CV | 2025-01-13 |
| 411 | RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, these approaches often struggle to establish robust alignments between fine-grained semantic concepts, leading to inconsistent representations across textual and visual information. To address these limitations, we introduce a referring remote sensing image segmentation foundational model, RSRefSeg. |
Keyan Chen; Jiafan Zhang; Chenyang Liu; Zhengxia Zou; Zhenwei Shi; | arxiv-cs.CV | 2025-01-12 |
| 412 | LarvSeg: Exploring Image Classification Data For Large Vocabulary Semantic Segmentation Via Category-wise Attentive Classifier Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we propose a new large vocabulary semantic segmentation framework, called LarvSeg. |
HAOJUN YU et. al. | arxiv-cs.CV | 2025-01-12 |
| 413 | Multi-Grained Contrastive Learning for Text-Supervised Open-Vocabulary Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Learning open-vocabulary semantic segmentation (OVSS) from text supervision has recently received increasing attention for its promising potential in real-world applications. … |
Yajie Liu; Pu Ge; Guodong Wang; Qingjie Liu; Di-Wei Huang; | ACM Transactions on Multimedia Computing, Communications … | 2025-01-10 |
| 414 | Image Segmentation: Inducing Graph-based Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We compare our proposed UNet-GNN model against established convolutional neural networks (CNNs) based segmentation models, including U-Net and U-Net++, as well as the transformer-based SwinUNet. |
Aryan Singh; Pepijn Van de Ven; Ciarán Eising; Patrick Denny; | arxiv-cs.CV | 2025-01-07 |
| 415 | BEN: Using Confidence-Guided Matting for Dichotomous Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: As improvements in image segmentation become increasingly challenging to achieve, combining image matting and grayscale segmentation techniques offers promising new directions for architectural innovation. Inspired by the possibility of aligning these two model tasks, we propose a new architectural approach for DIS called Confidence-Guided Matting (CGM). |
Maxwell Meyer; Jack Spruyt; | arxiv-cs.CV | 2025-01-07 |
| 416 | LM-Net: A Light-weight and Multi-scale Network for Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This results in over-segmentation, under-segmentation, and blurred segmentation boundaries. To tackle these challenges, we explore multi-scale feature representations from different perspectives, proposing a novel, lightweight, and multi-scale architecture (LM-Net) that integrates advantages of both Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to enhance segmentation accuracy. |
Zhenkun Lu; Chaoyin She; Wei Wang; Qinghua Huang; | arxiv-cs.CV | 2025-01-07 |
| 417 | Superpixel Boundary Correction for Weakly-Supervised Semantic Segmentation on Histopathology Images Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, Class Activation Map (CAM)-based methods still suffer from low spatial resolution and unclear boundaries. To address these issues, we propose a multi-level superpixel correction algorithm that refines CAM boundaries using superpixel clustering and floodfill. |
Hongyi Wu; Hong Zhang; | arxiv-cs.CV | 2025-01-07 |
| 418 | 4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, these methods often overlook the segmentation consistency in space and time, which may result in point clouds within the same object being predicted as different categories. To handle this issue, our core idea is to generate cluster labels across multiple frames that can reflect the complete spatial structure and temporal information of objects. |
Jiexi Zhong; Zhiheng Li; Yubo Cui; Zheng Fang; | arxiv-cs.CV | 2025-01-06 |
| 419 | The 2nd Place Solution from The 3D Semantic Segmentation Track in The 2024 Waymo Open Dataset Challenge Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this report, we introduce MixSeg3D, a sophisticated combination of the strong point cloud segmentation model with advanced 3D data mixing strategies. |
Qing Wu; | arxiv-cs.CV | 2025-01-06 |
| 420 | Enhancing Semantic Scene Segmentation for Indoor Autonomous Systems Using Advanced Attention-supported Improved UNet Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
HOANG N. TRAN et. al. | Signal Image Video Process. | 2025-01-06 |
| 421 | MedSegDiffNCA: Diffusion Models With Neural Cellular Automata for Skin Lesion Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This work proposes three NCA-based improvements for diffusion-based medical image segmentation. |
Avni Mittal; John Kalkhof; Anirban Mukhopadhyay; Arnav Bhavsar; | arxiv-cs.CV | 2025-01-05 |
| 422 | IAM: Enhancing RGB-D Instance Segmentation with New Benchmarks Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: There is a relative scarcity of instance-level RGB-D segmentation datasets, which restricts current methods to broad category distinctions rather than fully capturing the fine-grained details required for recognizing individual objects. To bridge this gap, we introduce three RGB-D instance segmentation benchmarks, distinguished at the instance level. |
Aecheon Jung; Soyun Choi; Junhong Min; Sungeun Hong; | arxiv-cs.CV | 2025-01-03 |
| 423 | SSDFusion: A Semantic Segmentation Driven Framework for Infrared and Visible Image Fusion Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Fusing infrared images with visible images facilitates obtaining more abundant and accurate information content. However, existing infrared and visible image fusion methods often … |
QISHEN LV et. al. | IEEE Access | 2025-01-01 |
| 424 | Low-Light Enhancement and Global-Local Feature Interaction for RGB-T Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The performance of RGB-T semantic segmentation tasks is affected by the quality of visible (VIS) and infrared (IR) images captured by sensor instruments. In low-light … |
Xueyi Guo; Yisha Liu; Weimin Xue; Zhiwei Zhang; Zhuang Yan; | IEEE Transactions on Instrumentation and Measurement | 2025-01-01 |
| 425 | 3D Scene Segmentation: A Comprehensive Survey and Open Problems Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Slavcho Neshev; Krasimir Tonchev; A. Manolova; Vladimir K. Poulkov; | IEEE Access | 2025-01-01 |
| 426 | L2A: Learning Affinity From Attention for Weakly Supervised Continual Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Despite significant advances in continual semantic segmentation (CSS), they still rely on the pixel-level annotation to train models, which is time-consuming and labor-intensive. … |
Hao Liu; Yong Zhou; Bing Liu; Ming Yan; Joey Tianyi Zhou; | IEEE Transactions on Circuits and Systems for Video … | 2025-01-01 |
| 427 | HPAN: Hierarchical Part-Aware Network for Fine-Grained Segmentation of Street View Imagery Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Street view imagery (SVI) has become a valuable geospatial data source for urban analysis, offering rich information about urban environments from a human-centric perspective. … |
LEIYANG ZHONG et. al. | IEEE Journal of Selected Topics in Applied Earth … | 2025-01-01 |
| 428 | DSMF-Net: Dual Semantic Metric Learning Fusion Network for Few-Shot Aerial Image Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation of aerial images is crucial yet resource-intensive. Inspired by human ability to learn rapidly, few-shot semantic segmentation offers a promising solution by … |
XIYU QI et. al. | IEEE Journal of Selected Topics in Applied Earth … | 2025-01-01 |
| 429 | Tuning A SAM-Based Model With Multicognitive Visual Adapter to Remote Sensing Instance Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The segment anything model (SAM), a foundational model designed for promptable segmentation tasks, demonstrates exceptional generalization capabilities, making it highly promising … |
Linghao Zheng; Xinyang Pu; Su Zhang; Feng Xu; | IEEE Journal of Selected Topics in Applied Earth … | 2025-01-01 |
| 430 | A Lightweight Semantic Segmentation Network Based on Self-Attention Mechanism and State Space Model for Efficient Urban Scene Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In the semantic segmentation of remote sensing images, methods based on convolutional neural networks (CNNs) and Transformers have been extensively studied. Nevertheless, CNN … |
Langping Li; Jizheng Yi; Hui Fan; Hui Lin; | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 431 | A Nested Self-supervised Learning Framework for 3-D Semantic Segmentation-driven Multi-modal Medical Image Fusion Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Ying Zhang; Ren-qi Nie; Jinde Cao; Chaozhen Ma; Mingchuan Tan; | Biomed. Signal Process. Control. | 2025-01-01 |
| 432 | Hierarchical Super-Pixels Graph Neural Networks for Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Xavier Hoarau; Julien Mille; Hugo Raguet; Romain Raveaux; | Workshop on Graph Based Representations in Pattern … | 2025-01-01 |
| 433 | Deep Merge: Deep-Learning-Based Region Merging for Remote Sensing Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Image segmentation represents a fundamental step in analyzing very high-spatial-resolution (VHR) remote sensing imagery. Its objective is to partition an image into segments that … |
XIANWEI LV et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 434 | Efficient Semantic Segmentation of Remote Sensing Images Through Global-Local Feature Integration Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The rapid acquisition of remote sensing information plays a significant role in the development of image semantic segmentation methods for remote sensing image interpretation … |
Fengyi Zhang; Xiuyu Xia; | IEEE Access | 2025-01-01 |
| 435 | UM2Former: U-Shaped Multimixed Transformer Network for Large-Scale Hyperspectral Image Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Transformer-based deep learning (DL) methods have gradually been advocated for remote sensing (RS) image semantic segmentation due to the great global modeling capability. … |
AIJUN XU et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 436 | Grid Point Serialized Transformer for LiDAR Point Cloud Semantic Segmentation in Various Densities and Heights Scenes Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Point cloud semantic segmentation is among the important tasks to achieve comprehensive perception of 3-D environments. However, current segmentation methods suffer from limited … |
Huchen Li; Wu-da Huang; Jiacheng Liu; Ke Chen; Fei Deng; | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 437 | Boosting Few-Shot Semantic Segmentation With Prior-Driven Edge Feature Enhancement Network Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Few-shot semantic segmentation (FSS) focuses on segmenting objects of novel classes with only a small number of annotated samples and has achieved great development. However, … |
Jingkai Ma; Shuang Bai; Wenchao Pan; | IEEE Transactions on Artificial Intelligence | 2025-01-01 |
| 438 | Geographical Scenario Knowledge-Informed Graph Structure Attention for Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Deep learning methods, renowned for their ability to discern physical features from images, are frequently used in the semantic segmentation of remote sensing images. However, … |
HUILING ZHAO et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 439 | Tissue Segmentation for Traumatic Brain Injury Based on Multimodal MRI Image Fusion-semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
YAO XU et. al. | Biomed. Signal Process. Control. | 2025-01-01 |
| 440 | SIT-SAM: A Semantic-integration Transformer That Adapts The Segment Anything Model to Zero-shot Medical Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Wentao Shi; Junjun He; Yiqing Shen; | Biomed. Signal Process. Control. | 2025-01-01 |
| 441 | Multiview Integration Network for Multitask Robotic Surgical Scene Analysis Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Surgical scene analysis holds a pivotal role in robot-assisted surgery. However, existing methods often suffer from a single or few views, leading to erroneous scene analysis … |
WENTING SHEN et. al. | IEEE Transactions on Instrumentation and Measurement | 2025-01-01 |
| 442 | FBINet: Few-Shot Semantic Segmentation With Foreground and Background Iteration Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Defect detection methods based on few-shot segmentation are becoming more and more popular in industrial applications, and few-shot segmentation methods need to use only a limited … |
Zhifu Huang; Ziwei Chen; Yu Liu; | IEEE Transactions on Instrumentation and Measurement | 2025-01-01 |
| 443 | SalsaNext+: A Multimodal-Based Point Cloud Semantic Segmentation With Range and RGB Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Advances in sensor fusion techniques are redefining the landscape of 3D point cloud semantic segmentation, particularly for autonomous driving applications. We propose an enhanced … |
FABIO SÁNCHEZ-GARCÍA et. al. | IEEE Access | 2025-01-01 |
| 444 | Multitask Analysis Method for Tongue Image Based on Edge Computing Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In response to the application scenarios of modernized Traditional Chinese Medicine (TCM) diagnostic and treatment equipment moving towards the user end, an effort has been made … |
TINGTING SONG et. al. | IEEE Access | 2025-01-01 |
| 445 | Dense Segmentation Techniques Using Deep Learning for Urban Scene Parsing: A Review Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Dense segmentation tasks, including semantic, instance, and panoptic segmentation, are essential for improving our comprehension of urban landscapes. This paper examines various … |
Rajesh Ankareddy; Radhakrishnan Delhibabu; | IEEE Access | 2025-01-01 |
| 446 | HMAFNet: Hybrid Mamba-Attention Fusion Network for Remote Sensing Image Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Remote sensing (RS) images have rich ground information, diverse object types, and large-scale differences, and these characteristics make difficulties in achieving precise … |
Haoyue Sun; Jianjun Liu; Jinlong Yang; Zebin Wu; | IEEE Geoscience and Remote Sensing Letters | 2025-01-01 |
| 447 | SAM Enhanced Semantic Segmentation for Remote Sensing Imagery Without Additional Training Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation is a critical process in remote sensing image analysis, supporting various applications. The recent development of the segment anything model (SAM), a visual … |
YANG QIAO et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 448 | FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: As a result, information extracted directly from VLMs can’t meet the requirements of segmentation tasks. To address this limitation, we propose FGAseg, a model designed for fine-grained pixel-text alignment and category boundary supplementation. |
Bingyu Li; Da Zhang; Zhiyuan Zhao; Junyu Gao; Xuelong Li; | arxiv-cs.CV | 2025-01-01 |
| 449 | INVITATION: A Framework for Enhancing UAV Image Semantic Segmentation Accuracy Through Depth Information Fusion Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: With the increasing use of uncrewed aerial vehicles (UAVs), improving the accuracy of semantic segmentation is becoming critical. Depth information preserves geometric structure, … |
XIAODONG ZHANG et. al. | IEEE Geoscience and Remote Sensing Letters | 2025-01-01 |
| 450 | Multiscale Semantic Segmentation of Remote Sensing Images Based on Edge Optimization Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation of remote sensing images is crucial for disaster monitoring, urban planning, and land use. Due to scene complexity and multiscale features of targets, … |
Wu-da Huang; Fei Deng; Haibing Liu; Mingtao Ding; Qi Yao; | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 451 | Open-Vocabulary High-Resolution Remote Sensing Image Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Open-vocabulary image semantic segmentation (OVS) seeks to segment images into semantic regions across an open set of categories. Existing OVS methods commonly depend on … |
Qinglong Cao; Yuntian Chen; Chao Ma; Xiaokang Yang; | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 452 | Semantic Uncertainty-Awared for Semantic Segmentation of Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Remote sensing image segmentation is crucial for applications ranging from urban planning to environmental monitoring. However, traditional approaches struggle with the unique … |
XIANGFENG QIU et. al. | IET Image Process. | 2025-01-01 |
| 453 | SOLSTM: Multisource Information Fusion Semantic Segmentation Network Based on SAR-OPT Matching Attention and Long Short-Term Memory Network Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: With the significant advancements in deep learning technology and the substantial improvement in remote sensing image resolution, remote sensing semantic segmentation has garnered … |
HAO CHANG et. al. | IEEE Geoscience and Remote Sensing Letters | 2025-01-01 |
| 454 | LHAS: A Lightweight Network Based on Hierarchical Attention for Hyperspectral Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Deep learning has garnered extensive attention in hyperspectral image (HSI) processing. However, its application in HSI semantic segmentation tasks has been relatively limited. … |
LUJIE SONG et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 455 | Peering Into The Heart: A Comprehensive Exploration of Semantic Segmentation and Explainable AI on The MnMs-2 Cardiac MRI Dataset Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Accurate and interpretable segmentation of medical images is crucial for computer-aided diagnosis and image-guided interventions. This study explores the integration of semantic … |
Mohamed Ayoob; Oshan Nettasinghe; Vithushan Sylvester; Helmini Bowala; Hamdaan Mohideen; | Applied Computer Systems | 2025-01-01 |
| 456 | Tuple Perturbation-Based Contrastive Learning Framework for Multimodal Remote Sensing Image Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Deep learning models exhibit promising potential in multimodal remote sensing image semantic segmentation (MRSISS). However, the constrained access to labeled samples for training … |
Y. YE et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 457 | Wetland Scene Segmentation of Remote Sensing Images Based on Lie Group Feature and Graph Cut Model Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Given the increasingly severe destruction of wetlands in recent years, research and monitoring for wetland protection are urgently needed. However, wetland monitoring still faces … |
Canyu Chen; Guobin Zhu; Xiliang Chen; | IEEE Journal of Selected Topics in Applied Earth … | 2025-01-01 |
| 458 | BaAFN: A Boundary-Aware Attention Fusion Network for Remote Sensing Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The performance of remote sensing semantic segmentation on object boundaries and small objects continues to pose a significant challenge due to the semantics near them being … |
Jiaen Chen; Shengjie Xu; Yuchen Zheng; | IEEE Journal of Selected Topics in Applied Earth … | 2025-01-01 |
| 459 | CSFNet: Cross-Modal Semantic Focus Network for Semantic Segmentation of Large-Scale Point Clouds Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation of large-scale point clouds is an indispensable component of outdoor scene perception, providing essential 3-D semantic insights for applications in scene … |
YANG LUO et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 460 | LNFormer: Lightweight Design for Nighttime Semantic Segmentation With Transformer Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: General image semantic segmentation methods mainly focus on daytime images with sufficient light, nighttime images have low contrast and blurred details compared to daytime … |
Longsheng Wei; Yuhang Liao; | IEEE Transactions on Instrumentation and Measurement | 2025-01-01 |
| 461 | Style Adaptation for Avoiding Semantic Inconsistency in Unsupervised Domain Adaptation Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Ziqiang Liu; Zhaomin Chen; Huiling Chen; Shu Teng; Lei Chen; | Biomed. Signal Process. Control. | 2025-01-01 |
| 462 | Data Fusion and Models Integration for Enhanced Semantic Segmentation in Remote Sensing Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Remote sensing semantic segmentation is a key research area in the remote sensing domain. Despite advancements, there is still no unified standard dataset such as ImageNet for … |
Xiaorui Dong; Jiansheng Li; Qingfang Chang; Shufeng Miao; Hongxiang Wan; | IEEE Journal of Selected Topics in Applied Earth … | 2025-01-01 |
| 463 | An Improved Method for Zero-Shot Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Zero-shot semantic segmentation continues to face challenges in effectively handling unseen object classes, despite its critical applications in medical imaging, autonomous … |
Kong Kuok Yong; Tan Fong Ang; Chin Soon Ku; Firdaus Sahran; Lip Yee Por; | IEEE Access | 2025-01-01 |
| 464 | Confidence-Guided Joint Complementary Learning for Weakly Annotated Remote Sensing Object Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Object segmentation from weakly annotated remote sensing images (RSIs) is an essential task that helps substantially reduce pixelwise labeling costs. Although mainstream … |
Yanan Liu; Libao Zhang; | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 465 | An Alternating Guidance With Cross-View Teacher–Student Framework for Remote Sensing Semi-Supervised Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The semantic segmentation of remote sensing images is crucial for Earth observation. The semi-supervised semantic segmentation method can effectively reduce the dependence of the … |
Yujia Fu; Mingyang Wang; G. Vivone; Yunhong Ding; Lin Zhang; | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 466 | Enhanced BP Algorithm Combined With Semantic Segmentation and Subaperture for Improving Agricultural Scene Image Quality in GEO SAR Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Geosynchronous synthetic aperture radar (GEO SAR) plays a crucial role in various fields, such as crop growth monitoring, irrigation management, terrain and soil analysis, and … |
YIFAN WU et. al. | IEEE Journal of Selected Topics in Applied Earth … | 2025-01-01 |
| 467 | Unsupervised Remote Sensing Image Semantic Segmentation Based on Multiscale Contrastive Domain Adaptation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Unsupervised domain adaptation (UDA) for remote sensing image semantic segmentation aims to train a deep model on the labeled source domain and apply it to the unlabeled target … |
Jie Geng; Shuai Song; Zhen Xu; Wen Jiang; | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 468 | SiMultiF: A Remote Sensing Multimodal Semantic Segmentation Network With Adaptive Allocation of Modal Weights for Siamese Structures in Multiscene IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation of remote sensing images is crucial for resource exploration, precision agriculture, and environmental monitoring. However, conducting semantic segmentation … |
SHICHAO CUI et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 469 | SACU-Net: Shape-Aware U-Net for Biomedical Image Segmentation With Attention Mechanism and Context Extraction Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: With the rapid development of convolutional neural networks in image processing, deep learning has been widely applied to medical image segmentation tasks, including liver, … |
Yinuo Cao; Yong Cheng; | IEEE Access | 2025-01-01 |
| 470 | UAVSeg: Dual-Encoder Cross-Scale Attention Network for UAV Images’ Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Benefiting from the powerful feature extraction and feature correlation modeling capabilities of convolutional neural networks (CNNs) and Transformer models, these techniques have … |
Zhen Wang; Zhuhong You; Nan Xu; Chuanlei Zhang; De-Shuang Huang; | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 471 | DiffRSS: A Diffusion-Guided Multi-Scale Features Remote Sensing Image Segmentation Method Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation in remote sensing is a fundamental task with crucial applications across various domains. Traditional approaches primarily utilize bottom-up discriminative … |
Honghao Liu; Ruixia Yang; Yue Xu; Zhengchao Chen; Yuyang Zheng; | IEEE Access | 2025-01-01 |
| 472 | A Retinal Vessel Segmentation Network With Dual-Stage Network and Vessel Pixel Emendation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: An automatic retinal vessel detection method is vital for the computer-aided diagnosis of eye diseases. Nevertheless, high-precision fundus image segmentation still faces some … |
Yanhong Liu; Ji Shen; Chenxu Zhai; Lei Yang; Guibin Bian; | IEEE Transactions on Instrumentation and Measurement | 2025-01-01 |
| 473 | Dual-Level Masked Semantic Inference for Semi-Supervised Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semi-supervised semantic segmentation pursues a holistic pixel-wise understanding of unseen images with limited annotation. To this end, existing methods focus on regularizing … |
QIANKUN MA et. al. | IEEE Transactions on Multimedia | 2025-01-01 |
| 474 | Robust One-Stop Multi-Modality Image Registration-Fusion-Segmentation Framework Against Misalignments and Adversarial Attacks Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In complex open scenes, multi-modality image fusion and segmentation encounter two challenges: i) Imaging misalignments, manifested as pixel shifts and structural distortions, are … |
Di Wang; Xianghao Jiao; Jinyuan Liu; Xin-Yue Fan; | IEEE Transactions on Multimedia | 2025-01-01 |
| 475 | PSDA: Pyramid Spatial Deformable Aggregation for Building Segmentation in Multiview Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: As increasingly more deep learning models are designed and implemented, the performance of single-view image semantic segmentation is approaching its upper limit. With the … |
XUEJUN HUANG et. al. | IEEE Journal of Selected Topics in Applied Earth … | 2025-01-01 |
| 476 | RML: Efficient Representation Mutual Learning Framework for End-to-End Weakly Supervised Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Research on efficient semantic segmentation models is increasing the number of instrumentation and measurement applications. In recent years, there has been significant progress … |
Rongtao Xu; Changwei Wang; Shibiao Xu; Weiliang Meng; Xiaopeng Zhang; | IEEE Transactions on Instrumentation and Measurement | 2025-01-01 |
| 477 | LEST: Large-Scale LiDAR Semantic Segmentation With Deployment-Friendly Transformer Architecture Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Large-scale LiDAR-based point cloud semantic segmentation is a critical challenge for autonomous driving perception. Most state-of-the-art LiDAR semantic segmentation methods rely … |
CHUANYU LUO et. al. | IEEE Access | 2025-01-01 |
| 478 | DarkSegNet: Low-light Semantic Segmentation Network Based on Image Pyramid Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Jintao Tan; Longyang Huang; Zhonghui Chen; Ruokun Qu; Chenglong Li; | Signal Process. Image Commun. | 2025-01-01 |
| 479 | Boundary-aware Semantic Segmentation of Remote Sensing Images Via Segformer and Snake Convolution Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation of remote sensing images remains challenging due to complex object structures and varying scales. This paper proposes a novel hybrid segmentation model that … |
Yanting Xia; Lin Zhang; Ting Guo; Q. Jin; | Comput. Sci. Inf. Syst. | 2025-01-01 |
| 480 | A Fusion Method Incorporating Dual-Attention Mechanism and Transfer Learning Into UNet++ for Remote Sensing Image Coastline Extraction Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The segmentation of land and sea in remote sensing imagery is of great significance for coastline extraction and dynamic monitoring. Traditional coastline recognition and … |
YANRU SONG et. al. | IEEE Access | 2025-01-01 |
| 481 | MCKTNet: Multiscale Cross-Modal Knowledge Transfer Network for Semantic Segmentation of Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Multimodal data fusion can provide valuable and diverse information for remote sensing image segmentation. However, different modal data have different feature distributions, … |
Jian Cui; Jiahang Liu; Yue Ni; Yuan Sun; Mao-yin Guo; | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 482 | SAM2Former: Segment Anything Model 2 Assisting UNet-Like Transformer for Remote Sensing Image Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Remote sensing semantic segmentation plays a crucial role in the fields of land cover classification, disaster monitoring, and urban planning. However, due to the high complexity … |
XUEWEN LI et. al. | IEEE Access | 2025-01-01 |
| 483 | A Transfer Learning Approach for Landslide Semantic Segmentation Based on Visual Foundation Model Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Landslides are one of the most destructive natural disasters in the world, threatening human life and safety. With excellent performance as a foundation model for image … |
CHANGHONG HOU et. al. | IEEE Journal of Selected Topics in Applied Earth … | 2025-01-01 |
| 484 | A Generalized Geodesic Voting Framework for Interactive Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In this article, we introduce a new variational model for addressing the image segmentation problem of minimal user interaction. The proposed variational segmentation model, … |
SHUWANG ZHOU et. al. | IEEE Transactions on Instrumentation and Measurement | 2025-01-01 |
| 485 | OMRF-HS: Object Markov Random Field With Hierarchical Semantic Regularization for High-Resolution Image Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: As spatial resolution increases in remote-sensing imagery, the challenge of semantic segmentation intensifies due to the need to discern intricate changes in terrain. Terrain, a … |
HAOYU FU et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
| 486 | EPNet: An Efficient Postprocessing Network for Enhancing Semantic Segmentation in Autonomous Driving Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation is of great importance in the field of autonomous driving, as it provides semantic information for a scene that intelligent vehicles need to interact with. … |
Libo Sun; Jiatong Xia; Hui Xie; Changming Sun; | IEEE Transactions on Instrumentation and Measurement | 2025-01-01 |
| 487 | Cross-Scale Feature Interaction Network for Semantic Segmentation in Side-Scan Sonar Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic Segmentation in side-scan sonar images (SSS-Seg) is an emerging topic and plays an important function in sonar image interpretation. However, due to the interference of … |
Zhen Wang; Zhuhong You; Nan Xu; Buhong Wang; De-Shuang Huang; | IEEE Journal of Selected Topics in Applied Earth … | 2025-01-01 |
| 488 | PanoSLAM: Panoptic 3D Scene Reconstruction Via Gaussian SLAM Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we introduce PanoSLAM, the first SLAM system to integrate geometric reconstruction, 3D semantic segmentation, and 3D instance segmentation within a unified framework. |
RUNNAN CHEN et. al. | arxiv-cs.CV | 2024-12-31 |
| 489 | OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we propose \textbf{OVGaussian}, a generalizable \textbf{O}pen-\textbf{V}ocabulary 3D semantic segmentation framework based on the 3D \textbf{Gaussian} representation. |
RUNNAN CHEN et. al. | arxiv-cs.CV | 2024-12-31 |
| 490 | HisynSeg: Weakly-Supervised Histopathological Image Segmentation Via Image-Mixing Synthesis and Consistency Regularization Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, CAM-based methods are prone to suffer from under-activation and over-activation issues, leading to poor segmentation performance. To address this problem, we propose a novel weakly-supervised semantic segmentation framework for histopathological images based on image-mixing synthesis and consistency regularization, dubbed HisynSeg. |
Zijie Fang; Yifeng Wang; Peizhang Xie; Zhi Wang; Yongbing Zhang; | arxiv-cs.CV | 2024-12-30 |
| 491 | LiDAR-Camera Fusion for Video Panoptic Segmentation Without Video Training Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This work seeks to introduce a feature fusion module that enhances PS and VPS by fusing LiDAR and image data for autonomous vehicles. |
Fardin Ayar; Ehsan Javanmardi; Manabu Tsukada; Mahdi Javanmardi; Mohammad Rahmati; | arxiv-cs.CV | 2024-12-30 |
| 492 | LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To this end, we propose a Language-Embedded Surface Field (LangSurf), which accurately aligns the 3D language fields with the surface of objects, facilitating precise 2D and 3D segmentation with text query, widely expanding the downstream tasks such as removal and editing. |
HAO LI et. al. | arxiv-cs.CV | 2024-12-23 |
| 493 | Multi-Scale Foreground-Background Confidence for Out-of-Distribution Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we present a multi-scale OOD segmentation method that exploits the confidence information of a foreground-background segmentation model. |
Samuel Marschall; Kira Maag; | arxiv-cs.CV | 2024-12-22 |
| 494 | Imaging Segmentation of Brain Tumors Based on The Modified U-net Method IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Brain tumor segmentation in medical image analysis is a challenging task. Deep learning techniques have recently shown promise in resolving a variety of computer vision problems, … |
Yajie Zhang; Hea Choon Ngo; Yifan Zhang; Noor Fazilla Abd Yusof; Xiaohan Wang; | Inf. Technol. Control. | 2024-12-21 |
| 495 | Leveraging Contrastive Learning for Semantic Segmentation with Consistent Labels Across Varying Appearances Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: This paper introduces a novel synthetic dataset that captures urban scenes under a variety of weather conditions, providing pixel-perfect, ground-truth-aligned images to … |
JAVIER MONTALVO et. al. | ArXiv | 2024-12-21 |
| 496 | VerSe: Integrating Multiple Queries As Prompts for Versatile Cardiac MRI Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Abstract: Despite the advances in learning-based image segmentation approach, the accurate segmentation of cardiac structures from magnetic resonance imaging (MRI) remains a critical … |
BANGWEI GUO et. al. | ArXiv | 2024-12-20 |
| 497 | VerSe: Integrating Multiple Queries As Prompts for Versatile Cardiac MRI Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, they are semi-automatic and inefficient, due to their reliance on click-based prompts, especially for 3D cardiac MRI volumes. To address these limitations, we propose VerSe, a Versatile Segmentation framework to unify automatic and interactive segmentation through mutiple queries. |
BANGWEI GUO et. al. | arxiv-cs.CV | 2024-12-20 |
| 498 | Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We set a new state-of-the-art for SNNs in various semantic segmentation datasets, with a significant improvement of +12.7% mIoU and 5.0 efficiency on ADE20K, +14.3% mIoU and 5.2 efficiency on VOC2012, and +9.1% mIoU and 6.6 efficiency on CityScapes. |
ZHENXIN LEI et. al. | arxiv-cs.CV | 2024-12-19 |
| 499 | Language-guided Medical Image Segmentation with Target-informed Multi-level Contrastive Alignments Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this study, we propose a language-guided segmentation network with Target-informed Multi-level Contrastive Alignments (TMCA). |
MINGJIAN LI et. al. | arxiv-cs.CV | 2024-12-18 |
| 500 | Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we treat segmentation as tokenizing pixels and study a united perceptual and semantic token compression for all granular understanding and consequently facilitate open vocabulary semantic segmentation. |
Jianyu Zhang; Li Zhang; Shijian Li; | arxiv-cs.CV | 2024-12-18 |
| 501 | Edge-Centric Real-Time Segmentation for Autonomous Underwater Cave Exploration Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: This paper addresses the challenge of deploying machine learning (ML)-based segmentation models on edge platforms to facilitate real-time scene segmentation for Autonomous … |
MOHAMMADREZA MOHAMMADI et. al. | 2024 International Conference on Machine Learning and … | 2024-12-18 |
| 502 | Open-World Panoptic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this article, we tackle the problem of open-world panoptic segmentation, i.e., the task of discovering new semantic categories and new object instances at test time, while enforcing consistency among the categories that we incrementally discover. |
Matteo Sodano; Federico Magistri; Jens Behley; Cyrill Stachniss; | arxiv-cs.CV | 2024-12-17 |
| 503 | SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: SAM has shown promising binary segmentation performance in natural domains, however, transferring it to the medical domain remains challenging, as medical images often possess substantial inter-category overlaps. To address this, we propose the SEmantic-Guided SAM (SEG-SAM), a unified medical segmentation model that incorporates semantic medical knowledge to enhance medical segmentation performance. |
SHUANGPING HUANG et. al. | arxiv-cs.CV | 2024-12-17 |
| 504 | Classification Drives Geographic Bias in Street Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We examined if instance segmentation models trained on European driving scenes (Eurocentric models) are geo-biased. |
Rahul Nair; Gabriel Tseng; Esther Rolf; Bhanu Tokas; Hannah Kerner; | arxiv-cs.CV | 2024-12-15 |
| 505 | Efficient Image Transmission Using Semantic Communication for Static Environment Surveillance Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic Communication is a novel approach that focuses on transmitting only essential information, leading to significant reductions in the number of bits required and conserving … |
K. R. Nandakishore; Rayani Venkat; Sai Rithvik; Mohammed Zafar; Ali Khan; | 2024 IEEE International Conference on Advanced Networks and … | 2024-12-15 |
| 506 | DCSEG: Decoupled 3D Open-Set Segmentation Using Gaussian Splatting Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We present a decoupled 3D segmentation pipeline to ensure modularity and adaptability to novel 3D representations as well as semantic segmentation foundation models. |
Luis Wiedmann; Luca Wiehe; David Rozenberszki; | arxiv-cs.CV | 2024-12-14 |
| 507 | CFSSeg: Closed-Form Solution for Class-Incremental Semantic Segmentation of 2D Images and 3D Point Clouds Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, stochastic gradient descent-based approaches inevitably update the model’s weights for past knowledge, leading to catastrophic forgetting, a problem exacerbated by pixel/point-level granularity. To address these challenges, we propose CFSSeg, a novel exemplar-free approach that leverages a closed-form solution, offering a practical and theoretically grounded solution for continual semantic segmentation tasks. |
JIAXU LI et. al. | arxiv-cs.CV | 2024-12-14 |
| 508 | SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To this end, We introduce SuperGSeg, a novel approach that fosters cohesive, context-aware scene representation by disentangling segmentation and language field distillation. |
SIYUN LIANG et. al. | arxiv-cs.CV | 2024-12-13 |
| 509 | SPT: Sequence Prompt Transformer for Interactive Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, existing methods typically process one image at a time, failing to consider the sequential nature of the images. To overcome this limitation, we propose a novel method called Sequence Prompt Transformer (SPT), the first to utilize sequential image information for interactive segmentation. |
Senlin Cheng; Haopeng Sun; | arxiv-cs.CV | 2024-12-13 |
| 510 | FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Inspired by the characteristics of frequency domain similarity across different domains, we propose a Frequency-aware Matching Network (FAMNet), which includes two key components: a Frequency-aware Matching (FAM) module and a Multi-Spectral Fusion (MSF) module. |
Yuntian Bo; Yazhou Zhu; Lunbo Li; Haofeng Zhang; | arxiv-cs.CV | 2024-12-12 |
| 511 | A Deep Semantic Segmentation Network with Semantic and Contextual Refinements Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation is a fundamental task in multimedia processing, which can be used for analyzing, understanding, editing contents of images and videos, among others. To … |
ZHIYAN WANG et. al. | ArXiv | 2024-12-11 |
| 512 | Superpixel-Based Hierarchical Graph Convolution for Enhanced Interpretability in Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Despite the success of pixel-based black-box models in most semantic segmentation algorithms for autonomous driving technology, identifying the causes of recognition errors … |
Yimeng Dong; Guangyao Liu; Yankai Yin; Mengxuan Wu; Feng Duan; | 2024 IEEE International Conference on Robotics and … | 2024-12-10 |
| 513 | Semantic Segmentation and Spatial Relationship Modeling in Hyperspectral Imagery Using Deep Learning and Graph-Based Representations Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The effective analysis of spatial data from diverse sources, such as satellite imagery and aerial views, remains pivotal for informed decision-making across various domains. This … |
Ravikumar Yenni; Arun P V; | 2024 14th Workshop on Hyperspectral Imaging and Signal … | 2024-12-09 |
| 514 | Channel Selection and Local Attention Transformer Model for Semantic Segmentation on UAV Remote Sensing Scene Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Compared with common urban landscape semantic segmentation, unmanned aerial vehicle (UAV) image semantic segmentation is more challenging because small targets have very low pixel … |
Da Liu; Hao Long; Zhenbao Liu; | IET Image Process. | 2024-12-09 |
| 515 | GCUNet: A GNN-Based Contextual Learning Network for Tertiary Lymphoid Structure Semantic Segmentation in Whole Slide Image Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, this prevents the model from accessing information outside of the patches, limiting the performance. To address this issue, we propose GCUNet, a GNN-based contextual learning network for TLS semantic segmentation. |
Lei Su; Yang Du; | arxiv-cs.CV | 2024-12-08 |
| 516 | Efficient Semantic Splatting for Remote Sensing Multiview Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Remote sensing multiview image segmentation is essential for achieving accurate and consistent stereoscopic perception of target scenes. This task involves processing RGB images … |
Zipeng Qi; Hao Chen; Haotian Zhang; Zhengxia Zou; Z. Shi; | IEEE Transactions on Geoscience and Remote Sensing | 2024-12-08 |
| 517 | LULC-SegNet: Enhancing Land Use and Land Cover Semantic Segmentation with Denoising Diffusion Feature Fusion Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Deep convolutional networks often encounter information bottlenecks when extracting land object features, resulting in critical geometric information loss, which impedes semantic … |
Zongwen Shi; Junfu Fan; Yujie Du; Yuke Zhou; Yi Zhang; | Remote. Sens. | 2024-12-06 |
| 518 | Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis In-the-Wild Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: During inference, we introduce an automated exemplar retrieval method for selecting exemplar image-segmentation pairs efficiently. |
SIYOON JIN et. al. | arxiv-cs.CV | 2024-12-04 |
| 519 | Point-GR: Graph Residual Point Cloud Network for 3D Object Classification and Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper presents Point-GR, a novel deep learning architecture designed explicitly to transform unordered raw point clouds into higher dimensions while preserving local geometric features. |
Md Meraz; Md Afzal Ansari; Mohammed Javed; Pavan Chakraborty; | arxiv-cs.CV | 2024-12-04 |
| 520 | MedSegViG: Medical Image Segmentation with A Vision Graph Neural Network Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Medical image segmentation is a crucial step toward automatic clinical diagnosis, which has received growing interest. Although some existing methods based on convolutional neural … |
XINHONG LI et. al. | 2024 IEEE International Conference on Bioinformatics and … | 2024-12-03 |
| 521 | Progressive Stepwise Diffusion Model with Dual Decoders for Semi-Supervised Medical Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semi-supervised medical image segmentation tasks aim to harness the potential of vast amounts of unlabeled data using a limited amount of annotated data. Denoising Diffusion … |
XIAOLIN HUANG et. al. | 2024 IEEE International Conference on Bioinformatics and … | 2024-12-03 |
| 522 | MFSegDiff: A Multi-Frequency Diffusion Model for Medical Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Medical image segmentation accurately identifies and delineates diagnostic regions, which is a crucial step in the early detection and accurate diagnosis of diseases. Diffusion … |
Zidi Shi; Hua Zou; Fei Luo; Zhiyu Huo; | 2024 IEEE International Conference on Bioinformatics and … | 2024-12-03 |
| 523 | SJTU:Spatial Judgments in Multimodal Models Towards Unified Segmentation Through Coordinate Detection Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This paper introduces SJTU: Spatial Judgments in multimodal models – Towards Unified segmentation through coordinate detection, a novel framework that leverages spatial coordinate understanding to bridge vision-language interaction and precise segmentation, enabling accurate target identification through natural language instructions. |
Joongwon Chae; Zhenyu Wang; Peiwu Qin; | arxiv-cs.CV | 2024-12-03 |
| 524 | Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Additionally, the same region may have a strong response to more than one prompt and it will lead to semantic ambiguity for image super-resolution. To alleviate the above two issues, in this paper, we propose to consider semantic segmentation as an additional control condition into diffusion-based image super-resolution. |
JIAHUA XIAO et. al. | arxiv-cs.CV | 2024-12-03 |
| 525 | Mamba-SAM: An Adaption Framework for Accurate Medical Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The Segment Anything Model (SAM) shows strong performance in natural images but struggles with medical images due to a significant semantic gap and characteristics like … |
YIFENG WU et. al. | 2024 IEEE International Conference on Bioinformatics and … | 2024-12-03 |
| 526 | Test-Time Medical Image Segmentation Using CLIP-Guided SAM Adaptation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Test-time medical image segmentation is a critical component in clinical practice, enabling pre-trained medical segmentation models to effectively adapt unseen medical samples … |
Haotian Chen; Yonghui Xu; Yanyu Xu; Yixin Zhang; Li-zhen Cui; | 2024 IEEE International Conference on Bioinformatics and … | 2024-12-03 |
| 527 | ECSeg: Edge-Cloud Switched Image Segmentation for Autonomous Vehicles Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Existing autonomous vehicles have not utilized the cloud computing for execution of their deep learning-based driving tasks due to the long vehicle-to-cloud communication latency. … |
Siyuan Zhou; D. V. Le; Rui Tan; | 2024 21st Annual IEEE International Conference on Sensing, … | 2024-12-02 |
| 528 | Advancing Perturbation Space Expansion Based on Information Fusion for Semi-supervised Remote Sensing Image Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Liang Zhou; Keyi Duan; Jinkun Dai; Yuanxin Ye; | Inf. Fusion | 2024-12-01 |
| 529 | RailEINet:A Novel Scene Segmentation Network for Automatic Train Operation Based on Feature Alignment IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Tao Sun; Baoqing Guo; Tao Ruan; Xingfang Zhou; Dingyuan Bai; | Eng. Appl. Artif. Intell. | 2024-12-01 |
| 530 | Domain Adaptation Transformer for Unsupervised Driving-Scene Segmentation in Adverse Conditions Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation in driving scenarios is important for modern autonomous driving technology. While the existing methods have shown promising results in segmenting … |
Wenyu Liu; Song Wang; Jianke Zhu; Xuansong Xie; Lei Zhang; | IEEE Transactions on Intelligent Transportation Systems | 2024-12-01 |
| 531 | Density-aware Global-Local Attention Network for Point Cloud Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The point cloud data collected in real scenes often contain small objects and categories with small sample sizes, which are difficult to handle by existing networks. In this regard, we propose a point cloud segmentation network that fuses local attention based on density perception with global attention. |
Chade Li; Pengju Zhang; Yihong Wu; | arxiv-cs.CV | 2024-11-30 |
| 532 | GradiSeg: Gradient-Guided Gaussian Segmentation with Enhanced 3D Boundary Precision IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: While 3D Gaussian Splatting enables high-quality real-time rendering, existing Gaussian-based frameworks for 3D semantic segmentation still face significant challenges in boundary recognition accuracy. To address this, we propose a novel 3DGS-based framework named GradiSeg, incorporating Identity Encoding to construct a deeper semantic understanding of scenes. |
ZEHAO LI et. al. | arxiv-cs.CV | 2024-11-30 |
| 533 | LMSeg: Unleashing The Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we propose to alleviate the above-mentioned issues by leveraging multiple large-scale models to enhance the alignment between fine-grained visual features and enriched linguistic features. |
HUADONG TANG et. al. | arxiv-cs.CV | 2024-11-30 |
| 534 | Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work, we present FreeGS, an unsupervised semantic-embedded 3DGS framework that achieves view-consistent 3D scene understanding without the need for 2D labels. |
WENBO ZHANG et. al. | arxiv-cs.CV | 2024-11-29 |
| 535 | HoliSDiP: Image Super-Resolution Via Holistic Semantics and Diffusion Prior Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Abstract: Text-to-image diffusion models have emerged as powerful priors for real-world image super-resolution (Real-ISR). However, existing methods may produce unintended results due to … |
LI-YUAN TSAO et. al. | ArXiv | 2024-11-27 |
| 536 | Enhancing Semantic Segmentation with Synthetic Image Generation: A Novel Approach Using Stable Diffusion and ControlNet Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: This paper presents a novel methodology for generating synthetic images that adhere accurately to provided semantic segmentation maps using the Stable Diffusion model with the … |
Austin Bevacqua; Tanmay Singha; Duc-Son Pham; | 2024 International Conference on Digital Image Computing: … | 2024-11-27 |
| 537 | Semantic Image Segmentation of Cell Volumes Using 3D U-Net Convolutional Neural Network Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Image segmentation is the process of assigning a label to every pixel in an image such that pixels with the same label share certain characteristics. Traditionally image … |
LAZAR DASIC et. al. | 2024 IEEE 24th International Conference on Bioinformatics … | 2024-11-27 |
| 538 | OUTBACK: A Multimodal Synthetic Dataset for Rural Australian Off-road Robot Navigation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: One of the most important aspects of robot scene understanding is semantic segmentation of external environments. Urban environment semantic segmentation has been extensively … |
Liyana Wijayathunga; Dulitha Dabare; A. Rassau; Douglas Chai; S. Islam; | 2024 International Conference on Digital Image Computing: … | 2024-11-27 |
| 539 | Box for Mask and Mask for Box: Weak Losses for Multi-task Partially Supervised Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We propose Box-for-Mask and Mask-for-Box strategies, and their combination BoMBo, to distil necessary information from one task annotations to train the other. |
Hoàng-Ân Lê; Paul Berg; Minh-Tan Pham; | arxiv-cs.CV | 2024-11-26 |
| 540 | A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: A representative dataset for emerging countries consists of low-resolution images of poorly maintained roads and includes labels of damage classes; in this scenario, three challenges arise: objects with few pixels, objects with undefined shapes, and highly underrepresented classes. To tackle these challenges, this work proposes the Performance Increment Strategy for Semantic Segmentation (PISSS) as a methodology of 14 training experiments to boost performance. |
Rafael S. Toledo; Cristiano S. Oliveira; Vitor H. T. Oliveira; Eric A. Antonelo; Aldo von Wangenheim; | arxiv-cs.CV | 2024-11-25 |
| 541 | Effective SAM Combination for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose ESC-Net, a novel one-stage open-vocabulary segmentation model that leverages the SAM decoder blocks for class-agnostic segmentation within an efficient inference framework. |
MINHYEOK LEE et. al. | arxiv-cs.CV | 2024-11-21 |
| 542 | BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Existing 3D benchmarking datasets typically evaluate deep learning models under the assumption that training and test data are independently and identically distributed (IID), which affects the models’ usability for real-world point cloud segmentation. To address these challenges, we introduce the BelHouse3D dataset, a new synthetic point cloud dataset designed for 3D indoor scene semantic segmentation. |
Umamaheswaran Raman Kumar; Abdur Razzaq Fayjie; Jurgen Hannaert; Patrick Vandewalle; | arxiv-cs.CV | 2024-11-20 |
| 543 | SAM Carries The Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: The recently introduced Segment Anything Model (SAM) enables prompt-based segmentation and offers zero-shot generalization to unfamiliar objects. |
RON KEUTH et. al. | arxiv-cs.CV | 2024-11-19 |
| 544 | TP-UNet: Temporal Prompt Guided UNet for Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To efficiently integrate temporal information, we propose TP-UNet that utilizes temporal prompts, encompassing organ-construction relationships, to guide the segmentation UNet model. |
Ranmin Wang; Limin Zhuang; Hongkun Chen; Boyan Xu; Ruichu Cai; | arxiv-cs.CV | 2024-11-18 |
| 545 | Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we introduce a sampling-free approach for estimating well-calibrated confidence values for classification tasks, achieving alignment with true classification accuracy and significantly reducing inference time compared to sampling-based methods. |
Hanieh Shojaei Miandashti; Qianqian Zou; Claus Brenner; | arxiv-cs.CV | 2024-11-18 |
| 546 | ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this study, we enhance the semantic segmentation performance of CLIP by introducing new modules and modifications: 1) architectural changes in the last layer of ViT and the incorporation of attention maps from the middle layers with the last layer, 2) Image Engineering: applying data augmentations to enrich input image representations, and 3) using Large Language Models (LLMs) to generate definitions and synonyms for each class name to leverage CLIP’s open-vocabulary capabilities. |
M. Arda Aydın; Efe Mert Çırpar; Elvin Abdinli; Gozde Unal; Yusuf H. Sahin; | arxiv-cs.CV | 2024-11-18 |
| 547 | ULTra: Unveiling Latent Token Interpretability in Transformer-Based Understanding and Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, their complexity makes latent token representations difficult to interpret. We introduce ULTra, a framework for interpreting Transformer embeddings and uncovering meaningful semantic patterns within them. |
Hesam Hosseini; Ghazal Hosseini Mighan; Amirabbas Afzali; Sajjad Amini; Amir Houmansadr; | arxiv-cs.CV | 2024-11-15 |
| 548 | ULTra: Unveiling Latent Token Interpretability in Transformer Based Understanding Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Transformers have revolutionized Computer Vision (CV) through self-attention mechanisms. However, their complexity makes latent token representations difficult to interpret. We … |
Hesam Hosseini; Ghazal Hosseini Mighan; Amirabbas Afzali; Sajjad Amini; Amir Houmansadr; | ArXiv | 2024-11-15 |
| 549 | Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Specifically, we introduce Trident, a training-free framework that first splices features extracted by CLIP and DINO from sub-images, then leverages SAM’s encoder to create a correlation matrix for global aggregation, enabling a broadened receptive field for effective segmentation. |
Yuheng Shi; Minjing Dong; Chang Xu; | arxiv-cs.CV | 2024-11-14 |
| 550 | Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a new approach that integrates learnable morphological skeleton prior into deep neural networks using the variational method. |
JUN XIE et. al. | arxiv-cs.CV | 2024-11-13 |
| 551 | Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Vision Transformers (ViT) have recently brought a new wave of research in the field of computer vision. These models have performed particularly well in image classification and segmentation. |
Ashim Dahal; Saydul Akbar Murad; Nick Rahimi; | arxiv-cs.CV | 2024-11-13 |
| 552 | Zero-shot Capability of SAM-family Models for Bone Segmentation in CT Scans Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The Segment Anything Model (SAM) and similar models build a family of promptable foundation models (FMs) for image and video segmentation. |
Caroline Magg; Hoel Kervadec; Clara I. Sánchez; | arxiv-cs.CV | 2024-11-13 |
| 553 | Semantic Segmentation with Attention-Modulated Feature Fusion in HRNET V2 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation is pivotal for precise object identification and localization within images, a cornerstone for automated analysis and machine vision. Despite advancements, … |
Weijie Zhang; Shuhei Kaneko; Shuichi Arai; | 2024 International Symposium on Information Theory and Its … | 2024-11-10 |
| 554 | Superpixel Segmentation: A Long-Lasting Ill-Posed Problem Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Concurrently, recent deep learning-based superpixelmethods mainly focus on the object segmentation task at the expense ofregularity. In this ill-posed context, we show that we can achieve competitiveresults using a recent architecture like the Segment Anything Model (SAM),without dedicated training for the superpixel segmentation task. |
Rémi Giraud; Michaël Clément; | arxiv-cs.CV | 2024-11-10 |
| 555 | ZAHA: Introducing The Level of Facade Generalization and The Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In ZAHA, we introduce Level of Facade Generalization (LoFG), novel hierarchical facade classes designed based on international urban modeling standards, ensuring compatibility with real-world challenging classes and uniform methods’ comparison. |
OLAF WYSOCKI et. al. | arxiv-cs.CV | 2024-11-07 |
| 556 | OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To address the task, we propose a plug-and-play approach termed OLAF. |
Pranav Gupta; Rishubh Singh; Pradeep Shenoy; Ravikiran Sarvadevabhatla; | arxiv-cs.CV | 2024-11-05 |
| 557 | PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Subsequently, we mathematically design a padding-based rotation equivariant convolution mode (PreCM), which is not only applicable to multi-scale images and convolutional kernels but can also serve as a replacement component for various types of convolutions, such as dilated convolutions, transposed convolutions, and asymmetric convolution. |
Xinyu Xu; Huazhen Liu; Tao Zhang; Huilin Xiong; Wenxian Yu; | arxiv-cs.CV | 2024-11-03 |
| 558 | Enhanced Scene Understanding and Situation Awareness for Autonomous Vehicles Based on Semantic Segmentation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Accurate visual perception and comprehensive scene understanding are critical for the safety and reliability of autonomous vehicles (AVs). Nevertheless, the efficacy of visual … |
YIYUE ZHAO et. al. | IEEE Transactions on Systems, Man, and Cybernetics: Systems | 2024-11-01 |
| 559 | Panoramic Image Semantic Segmentation Using Channel Attention-based HarDNet and Distorted Boundary Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Xun Jin; Chongyang Zhu; De Li; | Multim. Syst. | 2024-11-01 |
| 560 | Temporal Consistency for RGB-Thermal Data-Based Semantic Scene Understanding Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic scene understanding is a fundamental capability for autonomous vehicles. Under challenging lighting conditions, such as nighttime and on-coming headlights, the semantic … |
Haotian Li; Henry K. Chu; Yuxiang Sun; | IEEE Robotics and Automation Letters | 2024-11-01 |
| 561 | L-DeeplabV3+: A Lightweight Semantic Segmentation Algorithm for Complex Scene Perception Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Abstract. Current semantic segmentation algorithms are often burdened by high computational complexity and inadequate boundary localization accuracy in complex scenarios of … |
ZHENGSHUN FEI et. al. | Journal of Electronic Imaging | 2024-11-01 |
| 562 | Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications Via Diffusion-Based Image Editing Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we introduce Cityscape-Adverse, a benchmark that employs diffusion-based image editing to simulate eight adverse conditions, including variations in weather, lighting, and seasons, while preserving the original semantic labels. |
NAUFAL SURYANTO et. al. | arxiv-cs.CV | 2024-11-01 |
| 563 | Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In response, we propose the Class-Aware Semantic Diffusion Model (CASDM), a novel approach which utilizes segmentation maps as conditions for image synthesis to tackle data scarcity and imbalance. |
Yihang Zhou; Rebecca Towning; Zaid Awad; Stamatia Giannarou; | arxiv-cs.CV | 2024-10-31 |
| 564 | Interactive Segmentation By Considering First-Click Intentional Ambiguity Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Given the fact that most of the related algorithms generate a single mask only, the robustness of which might be constrained due to the diversity of user intention in the early interaction stage, namely the vague selection of object part/whole object/adherent object, especially when there’s only one click. To handle this, we propose a novel framework called Diversified Interactive Segmentation Network (DISNet) in which we revisit the peculiarity of first-click: given an input image, DISNet outputs multiple candidate masks under the guidance of first-click only, then a Dual-attentional Mask Correction (DAMC) module is utilized to measure the complex mutual effect within first-click, all-clicks and image features. |
Kangpeng Hu; Quansen Sun; Yinghui Sun; Tao Wang; | mm | 2024-10-30 |
| 565 | Multi-fineness Boundaries and The Shifted Ensemble-aware Encoding for Point Cloud Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: There is limited focus on explicitly addressing semantic segmentation of point cloud boundaries. We introduce a method called Multi-fineness Boundary Constraint (MBC) to tackle this challenge. |
ZIMING WANG et. al. | mm | 2024-10-30 |
| 566 | S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose S3PT a novel scene semantics and structure guided clustering to provide more scene-consistent objectives for self-supervised training. |
MACIEJ K. WOZNIAK et. al. | arxiv-cs.CV | 2024-10-30 |
| 567 | Generalized Source-Free Domain-adaptive Segmentation Via Reliable Knowledge Propagation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we focus on a more challenging paradigm in semantic segmentation, Generalized SFDA (G-SFDA), aiming to achieve robust performance on both source and target domains. |
QI ZANG et. al. | mm | 2024-10-30 |
| 568 | 3D Scene De-occlusion in Neural Radiance Fields: A Framework for Obstacle Removal and Realistic Inpainting Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, the performance of these works have been validated for data collected in a narrow range of multi-view, while degrade for the wide range of multi-view. To address this problem, we propose a novel NeRF framework to remove the obstacle and reproduce occluded areas in high quality for both wide and narrow range of multi-view. |
Yi Liu; Xinyi Li; Wenjing Shuai; | mm | 2024-10-30 |
| 569 | 3D-GRES: Generalized 3D Referring Expression Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, current approaches are limited to segmenting a single target, restricting the versatility of the task. To overcome this limitation, we introduce Generalized 3D Referring Expression Segmentation (3D-GRES), which extends the capability to segment any number of instances based on natural language instructions. |
CHANGLI WU et. al. | mm | 2024-10-30 |
| 570 | Anatomical Prior Guided Spatial Contrastive Learning for Few-Shot Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose an anatomical prior guided spatial contrastive learning, called APSCL, which exploits anatomical prior knowledge derived from medical images to construct contrastive learning from a spatial perspective for few-shot medical image segmentation. |
Wendong Huang; Jinwu Hu; Xiuli Bi; Bin Xiao; | mm | 2024-10-30 |
| 571 | GS2-GNeSF: Geometry-Semantics Synergy for Generalizable Neural Semantic Fields Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, existing approaches to generalizable NeSF fall short in fully exploiting the geometric and semantic features as well as their mutual interactions, resulting in suboptimal performance in both novel-view image synthesis and semantic segmentation. To address this limitation, we propose Geometry-Semantics Synergy for Generalized Neural Semantic Fields (GS2-GNeSF), a novel approach aimed at improving the performance of generalizable NeSF through the comprehensive construction and synergistic interaction of geometric and semantic features. |
Chengshun Wang; Na Zhao; | mm | 2024-10-30 |
| 572 | Few-shot Semantic Segmentation Via Perceptual Attention and Spatial Control Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, due to probabilistic noising and denoising processes, it is difficult for them to maintain spatial relationships between inputs and outputs, leading to inaccurate segmentation masks. To address this issue, we propose a Diffusion-based Segmentation network (DiffSeg), which decouples probabilistic denoising and segmentation processes. |
GUANGCHEN SHI et. al. | mm | 2024-10-30 |
| 573 | RefMask3D: Language-Guided Transformer for 3D Referring Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work, we propose RefMask3D to explore the comprehensive multi-modal feature interaction and understanding. |
Shuting He; Henghui Ding; | mm | 2024-10-30 |
| 574 | ESNet: An Efficient Real-time Semantic Segmentation Network Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Efficient image segmentation algorithms are critical in computer vision, as they maintain high processing speeds while handling large amounts of data and providing practical … |
Renping Xie; Cong He; Ming Tao; Kai Ding; | 2024 IEEE International Symposium on Parallel and … | 2024-10-30 |
| 575 | Crossmodal Few-shot 3D Point Cloud Semantic Segmentation Via View Synthesis Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, previous methods use single-view point cloud generation algorithms to bridge the gap between 2D images and 3D point clouds, leaving the incomplete geometry of an object or scene due to occlusions. To address this issue, we propose a novel view synthesis cross-modal few-shot point cloud semantic segmentation network. |
Ziyu Zhao; Pingping Cai; Canyu Zhang; Xiaoguang Li; Song Wang; | mm | 2024-10-30 |
| 576 | Semantic Segmentation of River Video for Smart River Monitoring System Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In this work, a high-efficiency semantic segmentation method is proposed for a smart river monitoring surveillance system. The proposed model is trained by an original river … |
Haruki Inoue; Takafumi Katayama; Tian Song; T. Shimamoto; | 2024 IEEE 13th Global Conference on Consumer Electronics … | 2024-10-29 |
| 577 | Text2Seg: Zero-shot Remote Sensing Image Semantic Segmentation Via Text-Guided Visual Foundation Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
JIELU ZHANG et. al. | GeoAI@SIGSPATIAL | 2024-10-29 |
| 578 | LDCNet: Long-Distance Context Modeling for Large-Scale 3D Point Cloud Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Shoutong Luo; Zhengxing Sun; Yi Wang; Yunhan Sun; Chendi Zhu; | ACM Multimedia | 2024-10-28 |
| 579 | Automatic Semantic Segmentation and Classification of Remote Sensing Image Data for Flood Detection Using Novel LSTM Neural Network Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Amruta Sonavale; Midhun Chakkaravarthy; Surampudi Srinivasa Rao; H. B. M. Salleh; Jagannath Jadhav; | SN Comput. Sci. | 2024-10-28 |
| 580 | Semantic-Enhanced Point-Box Joint Prompting for Video Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Quan Zhao; Siying Wu; Yueyi Zhang; Xiaoyan Sun; | International Conference on Information Photonics | 2024-10-27 |
| 581 | Every Component Counts: Rethinking The Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We present Connected-Component~(CC)-Metrics, a novel semantic segmentation evaluation protocol, targeted to align existing semantic segmentation metrics to a multi-instance detection scenario in which each connected component matters. |
ALEXANDER JAUS et. al. | arxiv-cs.CV | 2024-10-24 |
| 582 | Surgical Scene Segmentation By Transformer With Asymmetric Feature Enhancement Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Secondly, the specific characteristics of anatomy and instruments are not specifically modeled. To tackle the above challenges, we propose a novel Transformer-based framework with an Asymmetric Feature Enhancement module (TAFE), which enhances local information and then actively fuses the improved feature pyramid into the embeddings from transformer encoders by a multi-scale interaction attention strategy. |
Cheng Yuan; Yutong Ban; | arxiv-cs.CV | 2024-10-23 |
| 583 | Semantic Segmentation and Scene Reconstruction of RGB-D Image Frames: An End-to-End Modular Pipeline for Robotic Applications Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce a novel end-to-end modular pipeline that integrates state-of-the-art semantic segmentation, human tracking, point-cloud fusion, and scene reconstruction. |
ZHIWU ZHENG et. al. | arxiv-cs.CV | 2024-10-23 |
| 584 | Multi Kernel Estimation Based Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This paper presents a novel approach for multi-kernel estimation by enhancing the KernelGAN algorithm, which traditionally estimates a single kernel for the entire image. |
Haim Goldfisher; Asaf Yekutiel; | arxiv-cs.CV | 2024-10-22 |
| 585 | TICNet: Three-Branch Real-Time Semantic Segmentation Network with Intensive Compensation of Railway Track Scenes Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: With the rapid development of railway traffic system, real-time semantic segmentation plays a crucial role in railway track scene monitoring. However, most of the existing methods … |
Yiwen Bai; Lu Yang; Lei Zhang; Yajing Song; | 2024 5th International Conference on Machine Learning and … | 2024-10-18 |
| 586 | Stroke-Seg: A Deep Learning-based Framework for Chinese Stroke Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Chinese stroke segmentation is a crucial and challenging task for various downstream applications such as font generation, aesthetic evaluation etc. Conventional semantic … |
Xinyu Gong; Zeyang Bai; Haitao Nie; Bin Xie; | IET Image Process. | 2024-10-18 |
| 587 | Railway LiDAR Semantic Segmentation Based on Intelligent Semi-automated Data Annotation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Thus, we propose an approach for a point-wise 3D semantic segmentation based on the 2DPass network architecture using scans and images jointly. |
Florian Wulff; Bernd Schaeufele; Julian Pfeifer; Ilja Radusch; | arxiv-cs.CV | 2024-10-17 |
| 588 | SemSim: Revisiting Weak-to-Strong Consistency from A Semantic Similarity Perspective for Semi-supervised Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, two key limitations still persist, impeding its efficient adaptation: (1) the neglect of contextual dependencies results in inconsistent predictions for similar semantic features, leading to incomplete object segmentation; (2) the lack of exploitation of semantic similarity between labeled and unlabeled data induces considerable class-distribution discrepancy. To address these limitations, we propose a novel semi-supervised framework based on FixMatch, named SemSim, powered by two appealing designs from semantic similarity perspective: (1) rectifying pixel-wise prediction by reasoning about the intra-image pair-wise affinity map, thus integrating contextual dependencies explicitly into the final prediction; (2) bridging labeled and unlabeled data via a feature querying mechanism for compact class representation learning, which fully considers cross-image anatomical similarities. |
SHIAO XIE et. al. | arxiv-cs.CV | 2024-10-17 |
| 589 | Adaptive Prompt Learning with SAM for Few-shot Scanning Probe Microscope Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Code and dataset used in this study will be made available upon acceptance. |
YAO SHEN et. al. | arxiv-cs.CV | 2024-10-16 |
| 590 | RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Using our model and dataset, we propose RClicks benchmark for a comprehensive comparison of existing interactive segmentation methods on realistic clicks. |
ANTON ANTONOV et. al. | arxiv-cs.CV | 2024-10-15 |
| 591 | Real-Time Semantic Segmentation in Natural Environments with SAM-assisted Sim-to-Real Domain Transfer Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation plays a pivotal role in many robotic applications requiring high-level scene understanding, such as smart farming, where the precise identification of trees … |
Han Wang; R. Mascaro; M. Chli; L. Teixeira; | 2024 IEEE/RSJ International Conference on Intelligent … | 2024-10-14 |
| 592 | Multi-View Graph Neural Network for Semantic Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic image segmentation is a fundamental task in computer vision, frequently addressed using deep learning techniques. Nevertheless, these methods often struggle to fully … |
Elie Karam; N. Jrad; Patty Coupeau; Jean-Baptiste Fasquel; Fahed Abdallah; | 2024 IEEE Thirteenth International Conference on Image … | 2024-10-14 |
| 593 | LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a remote-sensing image semantic segmentation network named LKASeg, which combines Large Kernel Attention(LSKA) and Full-Scale Skip Connections(FSC). |
XUEZHI XIANG et. al. | arxiv-cs.CV | 2024-10-14 |
| 594 | Weakly Scene Segmentation Using Efficient Transformer Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Current methods for large-scale point cloud scene semantic segmentation rely on manually annotated dense point-wise labels, which are costly, labor-intensive, and prone to errors. … |
Hao Huang; Shuaihang Yuan; Congcong Wen; Yu Hao; Yi Fang; | 2024 IEEE/RSJ International Conference on Intelligent … | 2024-10-14 |
| 595 | An Object-Aware Network Embedding Deep Superpixel for Semantic Segmentation of Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation forms the foundation for understanding very high resolution (VHR) remote sensing images, with extensive demand and practical application value. The … |
ZIRAN YE et. al. | Remote. Sens. | 2024-10-13 |
| 596 | High-Precision Dichotomous Image Segmentation Via Probing Diffusion Capacity Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To this end, we propose DiffDIS, adiffusion-driven segmentation model that taps into the potential of thepre-trained U-Net within diffusion models, specifically designed forhigh-resolution, fine-grained object segmentation. |
QIAN YU et. al. | arxiv-cs.CV | 2024-10-13 |
| 597 | Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose a method to distinguish in-distribution (ID) from OOD samples and quantify both epistemic and aleatoric uncertainties using the feature space of a single deterministic model. |
Hanieh Shojaei; Qianqian Zou; Max Mehltretter; | arxiv-cs.LG | 2024-10-11 |
| 598 | VideoSAM: Open-World Video Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To this end, we introduce VideoSAM, an end-to-end framework designed to address these challenges by improving object tracking and segmentation consistency in dynamic environments. |
PINXUE GUO et. al. | arxiv-cs.CV | 2024-10-11 |
| 599 | Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: These characteristics hinder the real-time semantic analysis, particularly on resource-constrained hardware architectures that constitute the main computational components of numerous robotic applications. Therefore, in this paper, we investigate various 3D semantic segmentation methodologies and analyze their performance and capabilities for resource-constrained inference on embedded NVIDIA Jetson platforms. |
Samir Abou Haidar; Alexandre Chariot; Mehdi Darouich; Cyril Joly; Jean-Emmanuel Deschaud; | arxiv-cs.RO | 2024-10-10 |
| 600 | Data Augmentation for Surgical Scene Segmentation with Anatomy-Aware Diffusion Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We introduce a multi-stage approach using diffusion models to generate multi-class surgical datasets with annotations. |
Danush Kumar Venkatesh; Dominik Rivoir; Micha Pfeiffer; Fiona Kolbinger; Stefanie Speidel; | arxiv-cs.CV | 2024-10-10 |
| 601 | Shift and Matching Queries for Video Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a method to extend a query-based image segmentation model to video using feature shift and query matching. |
Tsubasa Mizuno; Toru Tamaki; | arxiv-cs.CV | 2024-10-10 |
| 602 | Evaluating The Impact of Point Cloud Colorization on Semantic Segmentation Accuracy Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a novel statistical approach to evaluate the impact of inaccurate RGB information on image-based point cloud segmentation. |
Qinfeng Zhu; Jiaze Cao; Yuanzhi Cai; Lei Fan; | arxiv-cs.CV | 2024-10-09 |
| 603 | Rethinking The Evaluation of Visible and Infrared Image Fusion Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This paper proposes a Segmentation-oriented Evaluation Approach (SEA) to assess VIF methods by incorporating the semantic segmentation task and leveraging segmentation labels available in latest VIF datasets. |
Dayan Guan; Yixuan Wu; Tianzhu Liu; Alex C. Kot; Yanfeng Gu; | arxiv-cs.CV | 2024-10-09 |
| 604 | Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, these models face challenges in dealing with intricate scenes, primarily due to the heterogeneity between RGB and thermal modalities. To address this gap, we present Open-RGBT, a novel open-vocabulary RGB-T semantic segmentation model. |
Meng Yu; Luojie Yang; Xunjie He; Yi Yang; Yufeng Yue; | arxiv-cs.CV | 2024-10-09 |
| 605 | Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work, we introduce *Scribbles for All*, a label and training data generation algorithm for semantic segmentation trained on scribble labels. |
Wolfgang Boettcher; Lukas Hoyer; Ozan Unal; Jan Eric Lenssen; Bernt Schiele; | nips | 2024-10-07 |
| 606 | Toward Real Ultra Image Segmentation: Leveraging Surrounding Context to Cultivate General Segmentation Model Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Existing ultra image segmentation methods suffer from two major challenges, namely the generalization issue (i.e. they lack the stability and generality of standard segmentation models, as they are tailored to specific datasets), and the architectural issue (i.e. they are incompatible with real-world ultra image scenes, as they compromise between image size and computing resources). To tackle these issues, we revisit the classic sliding inference framework, upon which we propose a Surrounding Guided Segmentation framework (SGNet) for ultra image segmentation. |
Sai Wang; Yutian Lin; Yu Wu; Bo Du; | nips | 2024-10-07 |
| 607 | A Unified Framework for 3D Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We propose UniSeg3D, a unified 3D segmentation framework that achieves panoptic, semantic, instance, interactive, referring, and open-vocabulary semantic segmentation tasks within a single model. |
WEI XU et. al. | nips | 2024-10-07 |
| 608 | Geometric Exploitation for Indoor Panoramic Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Unlike previous works, in this paper, we propose a novel approach for semantic segmentation of panoramic images. |
Duc Cao Dinh; Seok Kim; Kyusung Cho; | nips | 2024-10-07 |
| 609 | Zero-Shot Image Segmentation Via Recursive Normalized Cut on Diffusion Features Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we consider a diffusion UNet encoder as a foundation vision encoder and we introduce DiffCut, an unsupervised zero-shot segmentation method that solely harnesses the output features from the final self-attention block. |
Paul Couairon; Mustafa Shukor; Jean-Emmanuel HAUGEARD; Matthieu Cord; Nicolas THOME; | nips | 2024-10-07 |
| 610 | Unsupervised Hierarchy-Agnostic Segmentation: Parsing Semantic Image Structure Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce a novel algebraic methodology for unsupervised image segmentation. |
Simone Rossetti; fiora pirri; | nips | 2024-10-07 |
| 611 | Relationship Prompt Learning Is Enough for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Prompt learning offers a direct and parameter-efficient approach, yet it falls short in guiding VLM for pixel-level visual localization. Therefore, we propose relationship prompt module (RPM), which generates relationship prompt that directs VLM to extract pixel-level semantic embeddings suitable for OVSS. |
li Jiahao; Yanyun Qu; Yuan Xie; Yang Lu; | nips | 2024-10-07 |
| 612 | DeiSAM: Segment Anything with Deictic Prompting Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, deep learning approaches cannot reliably interpret such deictic representations due to their lack of reasoning capabilities in complex scenarios. To remedy this issue, we propose DeiSAM — a combination of large pre-trained neural networks with differentiable logic reasoners — for deictic promptable segmentation. |
HIKARU SHINDO et. al. | nips | 2024-10-07 |
| 613 | Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we propose a novel method, MCLIP, to adapt the CLIP image encoder for pixel-level understanding by guiding the model on where, which is achieved using unlabeled images and masks generated from vision foundation models such as SAM and DINO. |
HEESEONG SHIN et. al. | nips | 2024-10-07 |
| 614 | One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We introduce VideoLISA, a video-based multimodal large language model designed to tackle the problem of language-instructed reasoning segmentation in videos. |
ZECHEN BAI et. al. | nips | 2024-10-07 |
| 615 | AdaptDiff: Cross-Modality Domain Adaptation Via Weak Conditional Semantic Diffusion for Retinal Vessel Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, despite its promise, deep learning has many challenges in practice due to its inability to effectively transition to unseen domains, caused by the inherent data distribution shift and the lack of manual annotations to guide domain adaptation. To tackle this problem, we present an unsupervised domain adaptation (UDA) method named AdaptDiff that enables a retinal vessel segmentation network trained on fundus photography (FP) to produce satisfactory results on unseen modalities (e.g., OCT-A) without any manual labels. |
DEWEI HU et. al. | arxiv-cs.CV | 2024-10-06 |
| 616 | Unleashing The Potential of The Diffusion Model in Few-shot Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Our initial focus lies in understanding how to facilitate interaction between the query image and the support image, resulting in the proposal of a KV fusion method within the self-attention framework. |
MUZHI ZHU et. al. | arxiv-cs.CV | 2024-10-03 |
| 617 | Annotated Dataset for Training Cloud Segmentation Neural Networks Using High-Resolution Satellite Remote Sensing Imagery Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The integration of satellite data with deep learning has revolutionized various tasks in remote sensing, including classification, object detection, and semantic segmentation. … |
Mingyuan He; Jie Zhang; Yang He; Xinjie Zuo; Zebin Gao; | Remote. Sens. | 2024-10-02 |
| 618 | MFH‐Net: A Hybrid CNN‐Transformer Network Based Multi‐Scale Fusion for Medical Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In recent years, U‐Net and its variants have gained widespread use in medical image segmentation. One key aspect of U‐Net’s design is the skip connection, facilitating the … |
Ying Wang; Meng Zhang; Jian’an Liang; Meiyan Liang; | International Journal of Imaging Systems and Technology | 2024-10-02 |
| 619 | Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images Using SegFormer Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper evaluates the effectiveness and efficiency of SegFormer, a semantic segmentation framework, for the semantic segmentation of UAV images. |
Vlatko Spasev; Ivica Dimitrovski; Ivan Chorbev; Ivan Kitanovski; | arxiv-cs.CV | 2024-10-01 |
| 620 | Beyond Low-dimensional Features: Enhancing Semi-supervised Medical Image Semantic Segmentation with Advanced Consistency Learning Techniques Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Yujie Lu; Wenting Li; Zhongwei Cui; Yongjun Zhang; | Expert Syst. Appl. | 2024-10-01 |
| 621 | Multi-Bottleneck Progressive Propulsion Network for Medical Image Semantic Segmentation with Integrated Macro-micro Dual-stage Feature Enhancement and Refinement IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
YUEFEI WANG et. al. | Expert Syst. Appl. | 2024-10-01 |
| 622 | Superpixel-Guided Multi-Type Rail Segmentation Via Contextual Information Aggregation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Vision-based anomaly inspection plays a crucial role in the efficient maintenance of millions of kilometers of railway, with rail segmentation, a key step in such anomaly … |
XUEFENG NI et. al. | IEEE Transactions on Intelligent Transportation Systems | 2024-10-01 |
| 623 | CAFA: Cross-Modal Attentive Feature Alignment for Cross-Domain Urban Scene Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Autonomous driving systems rely heavily on semantic segmentation models for accurate and safe decision-making. High segmentation performance in real-world urban scenes is crucial … |
Peng Liu; Yanqi Ge; Lixin Duan; Wen Li; Fengmao Lv; | IEEE Transactions on Industrial Informatics | 2024-10-01 |
| 624 | Deep Multimodal Fusion for Semantic Segmentation of Remote Sensing Earth Observation Data Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper proposes a late fusion deep learning model (LF-DLM) for semantic segmentation that leverages the complementary strengths of both VHR aerial imagery and SITS. |
Ivica Dimitrovski; Vlatko Spasev; Ivan Kitanovski; | arxiv-cs.CV | 2024-10-01 |
| 625 | I-MedSAM: Implicit Medical Image Segmentation with Segment Anything IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we propose I-MedSAM, which leverages the benefits of both continuous representations and SAM, to obtain better cross-domain ability and accurate boundary delineation. |
XIAOBAO WEI et. al. | eccv | 2024-09-30 |
| 626 | Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Though adversarial erasing has prevailed in weakly supervised semantic segmentation to help activate integral object regions, existing approaches still suffer from the dilemma of under-activation and over-expansion due to the difficulty in determining when to stop erasing. In this paper, we propose a Knowledge Transfer with Simulated Inter-Image Erasing (KTSE) approach for weakly supervised semantic segmentation to alleviate the above problem. |
TAO CHEN et. al. | eccv | 2024-09-30 |
| 627 | Explore The Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Our study delves into the impact of CLIP’s [CLS] token on patch feature correlations, revealing a dominance of ”global” patches that hinders local feature discrimination. To overcome this, we propose CLIPtrase, a novel training-free semantic segmentation strategy that enhances local feature awareness through recalibrated self-correlation among patches. |
Tong Shao; Zhuotao Tian; Hang Zhao; Jingyong Su; | eccv | 2024-09-30 |
| 628 | Beyond Pixels: Semi-Supervised Semantic Segmentation with A Multi-scale Patch-based Multi-Label Classifier Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we show that an effective way to incorporate contextual information is through a patch-based classifier. |
Prantik Howlader; Srijan Das; Hieu Le; Dimitris Samaras; | eccv | 2024-09-30 |
| 629 | SegPoint: Segment Any Point Cloud Via Large Language Model IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we propose a model, called , that leverages the reasoning capabilities of a multi-modal Large Language Model (LLM) to produce point-wise segmentation masks across a diverse range of tasks: 1) 3D instruction segmentation, 2) 3D referring segmentation, 3) 3D semantic segmentation, and 4) 3D open-vocabulary semantic segmentation.To advance 3D instruction research, we introduce a new benchmark, , designed to evaluate segmentation performance from complex and implicit instructional texts, featuring point cloud-instruction pairs. |
Shuting He; Henghui Ding; Xudong Jiang; Bihan Wen; | eccv | 2024-09-30 |
| 630 | Dataset Enhancement with Instance-Level Augmentations IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We present a method for expanding a dataset by incorporating knowledge from the wide distribution of pre-trained latent diffusion models. |
Orest Kupyn; Christian Rupprecht; | eccv | 2024-09-30 |
| 631 | SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To adapt the VLM from global to local reasoning, we introduce a spatial fine-tuning strategy for label-efficient learning. |
Lukas Hoyer; David Joseph Tan; Muhammad Ferjad Naeem; Luc Van Gool; Federico Tombari; | eccv | 2024-09-30 |
| 632 | VISA: Reasoning Video Object Segmentation Via Large Language Model Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we introduce a new task, Reasoning Video Object Segmentation (ReasonVOS). |
CILIN YAN et. al. | eccv | 2024-09-30 |
| 633 | Enriching Information and Preserving Semantic Congruence in Expanding Curvilinear Object Segmentation Datasets Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Curvilinear object segmentation plays a crucial role across various applications, yet datasets in this domain often suffer from small scale due to the high costs associated with data acquisition and annotation. To address these challenges, this paper introduces a novel approach for expanding curvilinear object segmentation datasets, focusing on enhancing the informativeness of generated data and the consistency between semantic maps and generated images. |
Qin Lei; Jiang Zhong; Qizhu Dai; | eccv | 2024-09-30 |
| 634 | Open-Vocabulary Camouflaged Object Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To fill in the gaps, we introduce a new task, open-vocabulary camouflaged object segmentation (OVCOS), and construct a large-scale complex scene dataset (OVCamo) containing 11,483 hand-selected images with fine annotations and corresponding object classes. |
Youwei Pang; Xiaoqi Zhao; JiaMing Zuo; Lihe Zhang; Huchuan Lu; | eccv | 2024-09-30 |
| 635 | From Pixels to Objects: A Hierarchical Approach for Part and Object Segmentation Using Local and Global Aggregation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we introduce a hierarchical transformer-based model designed for sophisticated image segmentation tasks, effectively bridging the granularity of part segmentation with the comprehensive scope of object segmentation. |
Yunfei Xie; Cihang Xie; Alan Yuille; Jieru Mei; | eccv | 2024-09-30 |
| 636 | View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work, we address the challenging task of lifting multi-granular and view-inconsistent image segmentations into a hierarchical and 3D-consistent representation. |
Haodi He; Colton Stearns; Adam Harley; Leonidas Guibas; | eccv | 2024-09-30 |
| 637 | Boosting Gaze Object Prediction Via Pixel-level Supervision from Vision Foundation Model Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This paper presents a more challenging gaze object segmentation (GOS) task, which involves inferring the pixel-level mask corresponding to the object captured by human gaze behavior. |
Yang Jin; Lei Zhang; Shi Yan; Bin Fan; Binglu Wang; | eccv | 2024-09-30 |
| 638 | Open Panoramic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To further enhance the distortion-aware modeling ability from the pinhole source domain, we propose a novel data augmentation method called Random Equirectangular Projection (RERP) which is specifically designed to address object deformations in advance. |
JUNWEI ZHENG et. al. | eccv | 2024-09-30 |
| 639 | Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation Without Manual Labels IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In contrast, recent 2D foundation models have demonstrated strong generalization and impressive zero-shot abilities, inspiring us to incorporate these characteristics from 2D models into 3D models. Therefore, we explore the use of image segmentation foundation models to automatically generate high-quality training labels for 3D segmentation models. |
RUI HUANG et. al. | eccv | 2024-09-30 |
| 640 | Open-Vocabulary RGB-Thermal Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Second, when fusing RGB and thermal images, they often need to design complex fusion network structures, which usually results in low network training efficiency. We present OpenRSS, the Open-vocabulary RGB-T Semantic Segmentation method, to solve these two disadvantages. |
GUOQIANG ZHAO et. al. | eccv | 2024-09-30 |
| 641 | 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we propose 3DSS-VLG, a weakly supervised approach for 3D Semantic Segmentation with 2D Vision-Language Guidance, an alternative approach that a 3D model predicts dense-embedding for each point which is co-embedded with both the aligned image and text spaces from the 2D vision-language model. |
XIAOXU XU et. al. | eccv | 2024-09-30 |
| 642 | RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To effectively embed high-dimensional features, we propose a double-nested autoencoder structure with a novel class-aware embedding objective to encode high-dimensional features into manageable voxel-wise embeddings. |
Li Li; Hubert P. H. Shum; Toby P Breckon; | eccv | 2024-09-30 |
| 643 | Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a Class-Agnostic Visio-Temporal Network (CAVT) for scene sketch semantic segmentation. |
Aleyna Kütük; Tevfik Metin Sezgin; | arxiv-cs.CV | 2024-09-30 |
| 644 | Occlusion-Aware Seamless Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Panoramic images can broaden the Field of View (FoV), occlusion-aware prediction can deepen the understanding of the scene, and domain adaptation can transfer across viewing domains. In this work, we introduce a novel task, Occlusion-Aware Seamless Segmentation (OASS), which simultaneously tackles all these three challenges. |
YIHONG CAO et. al. | eccv | 2024-09-30 |
| 645 | OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To address the task, we propose a plug-and-play approach termed OLAF. |
Pranav Gupta; Rishubh Singh; Pradeep Shenoy; Ravi Kiran Sarvadevabhatla; | eccv | 2024-09-30 |
| 646 | O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: For the purpose of preserving consistency in 3D object properties across different viewpoints, we propose a spatial adaptive voxel adjustment mechanism and a multi-view weight selection method. |
MUER TIE et. al. | eccv | 2024-09-30 |
| 647 | Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We propose several problem-specific novel attacks minimizing different metrics in accuracy and mIoU. |
Francesco Croce; Naman D. Singh; Matthias Hein; | eccv | 2024-09-30 |
| 648 | Placing Objects in Context Via Inpainting for Out-of-distribution Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we propose the Placing Objects in Context (POC) pipeline to realistically add any object into any image via diffusion models. |
Pau de Jorge Aranda; Riccardo Volpi; Puneet Dokania; Philip Torr; Gregory Rogez; | eccv | 2024-09-30 |
| 649 | Betrayed By Attention: A Simple Yet Effective Approach for Self-supervised Video Object Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we propose a simple yet effective approach for self-supervised video object segmentation (VOS). |
Shuangrui Ding; Rui Qian; Haohang Xu; Dahua Lin; Hongkai Xiong; | eccv | 2024-09-30 |
| 650 | Can Textual Semantics Mitigate Sounding Object Segmentation Preference? IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Motivated by the the fact that text modality is well explored and contains rich abstract semantics, we propose leveraging text cues from the visual scene to enhance audio guidance with the semantics inherent in text. |
Yaoting Wang; Peiwen Sun; Yuanchao Li; Honggang Zhang; Di Hu; | eccv | 2024-09-30 |
| 651 | In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We present Lazy Visual Grounding for open-vocabulary semantic segmentation, which decouples unsupervised object mask discovery from object grounding. |
Dahyun Kang; Minsu Cho; | eccv | 2024-09-30 |
| 652 | SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We present , a new data generation approach that pushes the performance boundaries of state-of-the-art image segmentation models. |
HANRONG YE et. al. | eccv | 2024-09-30 |
| 653 | Segment and Recognize Anything at Any Granularity IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we introduce , an augmented image segmentation foundation for segmenting and recognizing anything at desired granularities. |
FENG LI et. al. | eccv | 2024-09-30 |
| 654 | One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We introduce VideoLISA, a video-based multimodal large language model designed to tackle the problem of language-instructed reasoning segmentation in videos. |
ZECHEN BAI et. al. | arxiv-cs.CV | 2024-09-29 |
| 655 | Gaussian Heritage: 3D Digitization of Cultural Heritage with Integrated Object Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The creation of digital replicas of physical objects has valuable applications for the preservation and dissemination of tangible cultural heritage. However, existing methods are … |
Mahtab Dahaghin; Myrna Castillo; Kourosh Riahidehkordi; M. Toso; A. D. Bue; | ArXiv | 2024-09-27 |
| 656 | Get It For Free: Radar Segmentation Without Expert Labels and Its Application in Odometry and Localization Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper presents a novel weakly supervised semantic segmentation method for radar segmentation, where the existing LiDAR semantic segmentation models are employed to generate semantic labels, which then serve as supervision signals for training a radar semantic segmentation model. |
Siru Li; Ziyang Hong; Yushuai Chen; Liang Hu; Jiahu Qin; | arxiv-cs.RO | 2024-09-26 |
| 657 | Global-Local Medical SAM Adaptor Based on Full Adaption Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, Med-SA still can be improved, as it fine-tunes SAM in a partial adaption manner. To resolve this problem, we present a novel global medical SAM adaptor (GMed-SA) with full adaption, which can adapt SAM globally. |
MENG WANG et. al. | arxiv-cs.AI | 2024-09-25 |
| 658 | Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce Go-SLAM, a novel framework that utilizes 3D Gaussian Splatting SLAM to reconstruct dynamic environments while embedding object-level information within the scene representations. |
Phu Pham; Dipam Patel; Damon Conover; Aniket Bera; | arxiv-cs.RO | 2024-09-25 |
| 659 | Potential Field As Scene Affordance for Behavior Change-Based Visual Risk Object Identification Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we compute potential fields by assigning different energy levels according to the semantic labels obtained from BEV semantic segmentation. |
Pang-Yuan Pao; Shu-Wei Lu; Ze-Yan Lu; Yi-Ting Chen; | arxiv-cs.CV | 2024-09-24 |
| 660 | Potential Fields As Scene Affordance for Behavior Change-Based Visual Risk Object Identification Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: We study behavior change-based visual risk object identification (Visual-ROI), a critical framework designed to detect potential hazards for intelligent driving systems. Existing … |
Pang-Yuan Pao; Shu-Wei Lu; Ze-Yan Lu; Yi-Ting Chen; | 2025 IEEE International Conference on Robotics and … | 2024-09-24 |
| 661 | The BRAVO Semantic Segmentation Challenge Results in UNCV2024 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We propose the unified BRAVO challenge to benchmark the reliability of semantic segmentation models under realistic perturbations and unknown out-of-distribution (OOD) scenarios. |
TUAN-HUNG VU et. al. | arxiv-cs.CV | 2024-09-23 |
| 662 | Diffusion-based RGB-D Semantic Segmentation with Deformable Attention Transformer Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we introduce adiffusion-based framework to address the RGB-D semantic segmentation problem.Additionally, we demonstrate that utilizing a Deformable Attention Transformeras the encoder to extract features from depth images effectively captures thecharacteristics of invalid regions in depth measurements. |
Minh Bui; Kostas Alexis; | arxiv-cs.CV | 2024-09-23 |
| 663 | ZeroSCD: Zero-Shot Street Scene Change Detection Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Traditional change detection methods rely on training models that take these image pairs as input and estimate the changes, which requires large amounts of annotated data, a costly and time-consuming process. To overcome this, we propose ZeroSCD, a zero-shot scene change detection framework that eliminates the need for training. |
Shyam Sundar Kannan; Byung-Cheol Min; | arxiv-cs.RO | 2024-09-23 |
| 664 | Infield Disease Detection in Citrus Plants: Integrating Semantic Segmentation and Dynamic Deep Learning Object Detection Model for Enhanced Agricultural Yield Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
N. Rani; Arun Sri Krishna; M. Sunag; M. A. Sangamesha; B. R. Pushpa; | Neural Comput. Appl. | 2024-09-21 |
| 665 | MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose MOSE, a neural field semantic reconstruction approach to lift inferred image-level noisy priors to 3D, producing accurate semantics and geometry in both 3D and 2D space. |
Zhenhua Du; Binbin Xu; Haoyu Zhang; Kai Huo; Shuaifeng Zhi; | arxiv-cs.CV | 2024-09-21 |
| 666 | CUS3D :CLIP-based Unsupervised 3D Segmentation Via Object-level Denoise Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, unlike previous research that ignores the “noise” raised during feature projection from 2D to 3D, we propose a novel distillation learning framework named CUS3D. |
Fuyang Yu; Runze Tian; Zhen Wang; Xiaochuan Wang; Xiaohui Liang; | arxiv-cs.CV | 2024-09-20 |
| 667 | A Bottom-Up Approach to Class-Agnostic Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we present a novel bottom-up formulation for addressing the class-agnostic segmentation problem. |
Sebastian Dille; Ari Blondal; Sylvain Paris; Yağız Aksoy; | arxiv-cs.CV | 2024-09-20 |
| 668 | Learning Scene Semantics From Vehicle-Centric Data for City-Scale Digital Twins Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The creation of digital twins of cityscapes requires the understanding the semantics of relevant objects encountered in the scene, with classes possibly not well covered in … |
HERMANN FÜRNTRATT et. al. | 2024 International Conference on Content-Based Multimedia … | 2024-09-18 |
| 669 | HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving Scenarios Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Even though some datasets exist, there is no standard benchmark available to systematically measure progress on this task and evaluate the benefit of hyperspectral data. In this paper, we work towards closing this gap by providing the HyperSpectral Semantic Segmentation benchmark (HS3-Bench). |
Nick Theisen; Robin Bartsch; Dietrich Paulus; Peer Neubert; | arxiv-cs.CV | 2024-09-17 |
| 670 | Fuse4Seg: Image-Level Fusion Based Multi-Modality Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We argue the current feature-level fusion strategy is prone to semantic inconsistencies and misalignments across various imaging modalities because it merges features at intermediate layers in a neural network without evaluative control. To mitigate this, we introduce a novel image-level fusion based multi-modality medical image segmentation method, Fuse4Seg, which is a bi-level learning framework designed to model the intertwined dependencies between medical image segmentation and medical image fusion. |
Yuchen Guo; Weifeng Su; | arxiv-cs.CV | 2024-09-16 |
| 671 | Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Existing methods struggle with this setting, particularly when evaluated on label spaces mixed from the individual training sets. To overcome these issues, we introduce a simple yet effective multi-dataset training approach by integrating language-based embeddings of class names and label space-specific query embeddings. |
Qilong Zhangli; Di Liu; Abhishek Aich; Dimitris Metaxas; Samuel Schulter; | arxiv-cs.CV | 2024-09-15 |
| 672 | Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper presents a 2D lidar semantic segmentation dataset to enhance the semantic scene understanding for mobile robots in different indoor robotics applications. |
Zhanteng Xie; Philip Dames; | arxiv-cs.RO | 2024-09-15 |
| 673 | Weakly Supervised Point Cloud Semantic Segmentation Based on Scene Consistency Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Yingchun Niu; Jianqin Yin; Chao Qi; Liang Geng; | Appl. Intell. | 2024-09-14 |
| 674 | Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we propose a method for interpretable semantic segmentation that leverages multi-scale image representation for prototypical part learning. |
Hugo Porta; Emanuele Dalsasso; Diego Marcos; Devis Tuia; | arxiv-cs.CV | 2024-09-14 |
| 675 | A Edge-Guided Satellite Image Semantic Segmentation Method for Real Estate Appraisal Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: This paper proposes an edge-guided satellite image semantic segmentation method for real estate appraisal. The digitization of real estate appraisal has been significantly … |
Yinuo Cui; Yilin He; Fangyuan Zhu; | 2024 3rd International Conference on Artificial … | 2024-09-13 |
| 676 | Lightweight Semantic Segmentation Network for Remote Sensing Urban Scenes Based on Selective Kernel Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In the context of remote sensing images, semantic segmentation networks must possess robust global information extraction capabilities due to the presence of indistinct object … |
Youwen Fan; | 2024 3rd International Conference on Artificial … | 2024-09-13 |
| 677 | AFFSegNet: Adaptive Feature Fusion Segmentation Network for Microtumors and Multi-Organ Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We introduce an augmented multi-layer perceptron within the encoder to explicitly model long-range dependencies during feature extraction. |
FUCHEN ZHENG et. al. | arxiv-cs.CV | 2024-09-12 |
| 678 | UNIT: Unsupervised Online Instance Segmentation Through Time Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To that end, we leverage an instance segmentation backbone and propose a new training recipe that enables the online tracking of objects. |
Corentin Sautier; Gilles Puy; Alexandre Boulch; Renaud Marlet; Vincent Lepetit; | arxiv-cs.CV | 2024-09-12 |
| 679 | Segmentation By Factorization: Unsupervised Semantic Segmentation for Pathology By Factorizing Foundation Model Features Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce Segmentation by Factorization (F-SEG), an unsupervised segmentation method for pathology that generates segmentation masks from pre-trained deep learning models. |
Jacob Gildenblat; Ofir Hadar; | arxiv-cs.CV | 2024-09-09 |
| 680 | SGSeg: Enabling Text-free Inference in Language-guided Segmentation of Chest X-rays Via Self-guidance Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this study, we propose a self-guided segmentation framework (SGSeg) that leverages language guidance for training (multi-modal) while enabling text-free inference (uni-modal), which is the first that enables text-free inference in language-guided segmentation. |
Shuchang Ye; Mingyuan Meng; Mingjian Li; Dagan Feng; Jinman Kim; | arxiv-cs.CV | 2024-09-07 |
| 681 | ISeg: An Iterative Refinement-based Framework for Training-free Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To fully utilize self-attention map, we present a deep experimental analysis on iteratively refining cross-attention map with self-attention map, and propose an effective iterative refinement framework for training-free segmentation, named iSeg. |
Lin Sun; Jiale Cao; Jin Xie; Fahad Shahbaz Khan; Yanwei Pang; | arxiv-cs.CV | 2024-09-04 |
| 682 | Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Segment Anything Model (SAM) has demonstrated powerful zero-shot segmentation performance in natural scenes. |
Jialun Pei; Zhangjun Zhou; Tiantian Zhang; | arxiv-cs.CV | 2024-09-04 |
| 683 | AllWeatherNet:Unified Image Enhancement for Autonomous Driving Under Adverse Weather and Lowlight-conditions Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Existing methods have limited effectiveness in improving essential computer vision tasks, such as semantic segmentation, and often focus on only one specific condition, such as removing rain or translating nighttime images into daytime ones. To address these limitations, we propose a method to improve the visual quality and clarity degraded by such adverse conditions. |
CHENGHAO QIAN et. al. | arxiv-cs.CV | 2024-09-03 |
| 684 | Segmenting Object Affordances: Reproducibility and Sensitivity to Scale Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, experimental setups are often not reproducible, thus leading to unfair and inconsistent comparisons. In this work, we benchmark these methods under a reproducible setup on two single objects scenarios, tabletop without occlusions and hand-held containers, to facilitate future comparisons. |
Tommaso Apicella; Alessio Xompero; Paolo Gastaldo; Andrea Cavallaro; | arxiv-cs.CV | 2024-09-03 |
| 685 | Fast Semantic Segmentation of Ultra-High-Resolution Remote Sensing Images Via Score Map and Fast Transformer-Based Fusion Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: For ultra-high-resolution (UHR) image semantic segmentation, striking a balance between computational efficiency and storage space is a crucial research direction. This paper … |
Yihao Sun; Mingrui Wang; Xiaoyi Huang; Chengshu Xin; Yinan Sun; | Remote. Sens. | 2024-09-02 |
| 686 | Multiresolution Refinement Network for Semantic Segmentation in Internet of Things Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: With the large-scale deployment of the Internet of Things (IoT), the demand for real-time perception and environment understanding in road scenarios is becoming increasingly … |
Dakai Wang; Xiangyang Jiang; Shilong Li; Jianxin Ma; Miaohui Zhang; | IEEE Internet of Things Journal | 2024-09-01 |
| 687 | Transferring Multi-Modal Domain Knowledge to Uni-Modal Domain for Urban Scene Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Synthetic data (i.e., source domain) have been widely adopted to improve the semantic segmentation performance for real-world images (i.e., target domain), since obtaining … |
PENG LIU et. al. | IEEE Transactions on Intelligent Transportation Systems | 2024-09-01 |
| 688 | Multi-source Domain Adaptation for Panoramic Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, these methods struggle to understand the panoramic structure using only real pinhole images and lack real-world scene perception with only synthetic panoramic images. Therefore, in this paper, we propose a new task, Multi-source Domain Adaptation for Panoramic Semantic Segmentation (MSDA4PASS), which leverages both real pinhole and synthetic panoramic images to improve segmentation on unlabeled real panoramic images. |
JING JIANG et. al. | arxiv-cs.CV | 2024-08-29 |
| 689 | DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose decoupling things/stuff queries according to their intrinsic properties for individual decoding and disentangling classification/segmentation to mitigate ambiguity. |
YU YANG et. al. | arxiv-cs.CV | 2024-08-28 |
| 690 | SPNet: Dual-Branch Network with Spatial Supplementary Information for Building and Water Segmentation of Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation is primarily employed to generate accurate prediction labels for each pixel of the input image, and then classify the images according to the generated … |
WENYU ZHAO et. al. | Remote. Sens. | 2024-08-27 |
| 691 | MROVSeg: Breaking The Resolution Curse of Vision-Language Models in Open-Vocabulary Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: A typical solution is to employ additional image backbones for high-resolution inputs, but it also introduce significant computation overhead. Therefore, we propose MROVSeg, a multi-resolution training framework for open-vocabulary image segmentation with a single pretrained CLIP backbone, that uses sliding windows to slice the high-resolution input into uniform patches, each matching the input size of the well-trained image encoder. |
YUANBING ZHU et. al. | arxiv-cs.CV | 2024-08-27 |
| 692 | CLIP-SP: Vision-language Model with Adaptive Prompting for Scene Parsing Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: We present a novel framework, CLIP-SP, and a novel adaptive prompt method to leverage pre-trained knowledge from CLIP for scene parsing. Our approach addresses the limitations of … |
JIAAO LI et. al. | Comput. Vis. Media | 2024-08-27 |
| 693 | Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: While model development and validation are primarily conducted on idealistic scenes, geometric domain shifts, such as occlusions of the situs, are common in real-world open surgeries. To close this gap, we (1) present the first analysis of state-of-the-art (SOA) semantic segmentation models when faced with geometric out-of-distribution (OOD) data, and (2) propose an augmentation technique called Organ Transplantation, to enhance generalizability. |
SILVIA SEIDLITZ et. al. | arxiv-cs.CV | 2024-08-27 |
| 694 | Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Abstract: Robust semantic segmentation of intraoperative image data holds promise for enabling automatic surgical scene understanding and autonomous robotic surgery. While model development … |
Boqi Chen; Kevin Thandiackal; Pushpak Pati; O. Goksel; | ArXiv | 2024-08-27 |
| 695 | ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we leverage image complexity as a prior for refining segmentation features to achieve accurate real-time semantic segmentation. |
Xin Zhang; Teodor Boyadzhiev; Jinglei Shi; Jufeng Yang; | arxiv-cs.CV | 2024-08-25 |
| 696 | Accuracy Improvement of Cell Image Segmentation Using Feedback Former Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This tendency leads to a lack of detailed information for segmentation. Therefore, to supplement or reinforce the missing detailed information, we hypothesized that feedback processing in the human visual cortex should be effective. |
Hinako Mitsuoka; Kazuhiro Hotta; | arxiv-cs.CV | 2024-08-23 |
| 697 | Image Segmentation in Foundation Model Era: A Survey IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We investigate two basic lines of research — generic image segmentation (i.e., semantic segmentation, instance segmentation, panoptic segmentation), and promptable image segmentation (i.e., interactive segmentation, referring segmentation, few-shot segmentation) — by delineating their respective task settings, background concepts, and key challenges. |
TIANFEI ZHOU et. al. | arxiv-cs.CV | 2024-08-23 |
| 698 | SIn-NeRF2NeRF: Editing 3D Scenes with Instructions Through Segmentation and Inpainting Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Abstract: TL;DR Perform 3D object editing selectively by disentangling it from the background scene. Instruct-NeRF2NeRF (in2n) is a promising method that enables editing of 3D scenes … |
Jiseung Hong; Changmin Lee; Gyusang Yu; | ArXiv | 2024-08-23 |
| 699 | The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: During testing, while these models can effectively process information over short time steps, they struggle to maintain consistent perception over prolonged time sequences, leading to inconsistencies in the resulting semantic segmentation masks. To address this challenge, we take a step further in this work by leveraging the tracking capabilities of the newly introduced Segment Anything Model version 2 (SAM-v2) to enhance the temporal consistency of the referring object segmentation model. |
Tuyen Tran; | arxiv-cs.CV | 2024-08-22 |
| 700 | Improved Semi-Supervised Attention GAN for Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation is one of the cornerstone problems in computer vision that involves assigning each image pixel to a specific semantic class. Traditional supervised learning … |
Nusrat Jahan; Thangarajah Akilan; Thanh Minh Nguyen; | 2024 IEEE Pacific Rim Conference on Communications, … | 2024-08-21 |
| 701 | Rethinking Video Segmentation with Masked Video Consistency: Did The Model Learn As Intended? Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This leads to inconsistent segmentation results across frames. To address these issues, we propose a training strategy Masked Video Consistency, which enhances spatial and temporal feature aggregation. |
Chen Liang; Qiang Guo; Xiaochao Qu; Luoqi Liu; Ting Liu; | arxiv-cs.CV | 2024-08-20 |
| 702 | 3D-Aware Instance Segmentation and Tracking in Egocentric Videos Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Egocentric videos present unique challenges for 3D scene understanding due to rapid camera motion, frequent object occlusions, and limited object visibility. This paper introduces a novel approach to instance segmentation and tracking in first-person video that leverages 3D awareness to overcome these obstacles. |
YASH BHALGAT et. al. | arxiv-cs.CV | 2024-08-19 |
| 703 | OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we introduce OVOSE, the first Open-Vocabulary Semantic Segmentation algorithm for Event cameras. |
Muhammad Rameez Ur Rahman; Jhony H. Giraldo; Indro Spinelli; Stéphane Lathuilière; Fabio Galasso; | arxiv-cs.CV | 2024-08-18 |
| 704 | Depth-guided Texture Diffusion for Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we introduce a Depth-guided Texture Diffusion approach that effectively tackles the outlined challenge. |
Wei Sun; Yuan Li; Qixiang Ye; Jianbin Jiao; Yanzhao Zhou; | arxiv-cs.CV | 2024-08-17 |
| 705 | Tuning A SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, a Multi-Cognitive SAM-Based Instance Segmentation Model (MC-SAM SEG) is introduced to employ SAM on remote sensing domain. |
Linghao Zheng; Xinyang Pu; Feng Xu; | arxiv-cs.CV | 2024-08-16 |
| 706 | InstSynth: Instance-wise Prompt-guided Style Masked Conditional Data Synthesis for Scene Understanding Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Scene understanding at the instance level is an essential task in computer vision to support modern Advanced Driver Assistance Systems. Solutions have been proposed with abundant … |
THANH-DANH NGUYEN et. al. | 2024 International Conference on Multimedia Analysis and … | 2024-08-15 |
| 707 | HEFANet: Hierarchical Efficient Fusion and Aggregation Segmentation Network for Enhanced Rgb-thermal Urban Scene Parsing Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
ZHENGWEN SHEN et. al. | Appl. Intell. | 2024-08-14 |
| 708 | MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We propose a powerful semantic segmentation network, MetaSeg, which leverages the Metaformer architecture from the backbone to the decoder. |
Beoungwoo Kang; Seunghun Moon; Yubin Cho; Hyunwoo Yu; Suk-Ju Kang; | arxiv-cs.CV | 2024-08-14 |
| 709 | Enhancing Autonomous Vehicle Perception in Adverse Weather Through Image Augmentation During Semantic Segmentation Training Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We trained encoder-decoder UNet models to perform semantic segmentation. |
Ethan Kou; Noah Curran; | arxiv-cs.CV | 2024-08-13 |
| 710 | Domain-adapted Polyp Image Semantic Segmentation Utilizing A Generative Adversarial Network and U-Net Framework Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Medical image segmentation is a crucial area in medical image processing, aiming to help doctors accurately identify and segment different structures and tissues in images, … |
Shizhao Ma; Yunhai Gao; Shuquan Feng; Lin Li; Mengyuan Ma; | Proceedings of the 2024 5th International Symposium on … | 2024-08-13 |
| 711 | MacFormer: Semantic Segmentation with Fine Object Boundaries Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: While Vision Transformer-based models have made significant progress, current semantic segmentation methods often struggle with precise predictions in localized areas like object boundaries. To tackle this challenge, we introduce a new semantic segmentation architecture, “MacFormer”, which features two key components. |
GUOAN XU et. al. | arxiv-cs.CV | 2024-08-11 |
| 712 | TOSS: Real-time Tracking and Moving Object Segmentation for Static Scene Mapping Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose an integrated real-time framework that combines online tracking-based moving object segmentation with static map building. |
SEOYEON JANG et. al. | arxiv-cs.RO | 2024-08-10 |
| 713 | MEDANet: More Efficient Dual Attention Network for Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Ouyang Pan; Xiaoguo Yao; Zhijian Huang; | J. Circuits Syst. Comput. | 2024-08-09 |
| 714 | Embodied Uncertainty-Aware Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To deal with uncertainty in robot perception, we propose a method for generating a hypothesis distribution of object segmentation. |
Xiaolin Fang; Leslie Pack Kaelbling; Tomás Lozano-Pérez; | arxiv-cs.RO | 2024-08-08 |
| 715 | A Multiscale Feature Fusion‐guided Lightweight Semantic Segmentation Network Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation, a task of assigning class labels to each pixel in an image, has found applications in various real‐world scenarios, including autonomous driving and scene … |
Xin Ye; Junchen Pan; Jichen Chen; Jingbo Zhang; | Journal of Field Robotics | 2024-08-08 |
| 716 | SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, there are certain challenges that hinder the deployment of AI models in-the-wild scenarios, i.e., inefficient use of unlabeled data, lack of incorporation of human expertise, and lack of interpretation of the results. To mitigate these challenges, we propose a novel Explainable Active Learning (XAL) model, XAL-based semantic segmentation model SegXAL, that can (i) effectively utilize the unlabeled data, (ii) facilitate the Human-in-the-loop paradigm, and (iii) augment the model decisions in an interpretable way. |
Sriram Mandalika; Athira Nambiar; | arxiv-cs.CV | 2024-08-08 |
| 717 | Query3D: LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussians Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: This paper introduces a novel method for open-vocabulary 3D scene querying in autonomous driving by combining Language Embedded 3D Gaussians with Large Language Models (LLMs). We … |
Amirhosein Chahe; Lifeng Zhou; | 2025 IEEE/CVF Winter Conference on Applications of Computer … | 2024-08-07 |
| 718 | Improving Pavement Crack Segmentation Using Attention Mechanism and Self-gated Activation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Image segmentation is crucial in various applications, from autonomous driving, agriculture, and manufacturing to medical imaging and satellite imaging. It helps a computer … |
Nusrat Jahan; T. Akilan; Tharrengini Suresh; | 2024 IEEE Canadian Conference on Electrical and Computer … | 2024-08-06 |
| 719 | Biomedical SAM 2: Segment Anything in Biomedical Images and Videos IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To explore the performance of SAM-2 in biomedical applications, we designed three evaluation pipelines for single-frame 2D image segmentation, multi-frame 3D image segmentation and multi-frame video segmentation with varied prompt designs, revealing SAM-2’s limitations in medical contexts. |
ZHILING YAN et. al. | arxiv-cs.CV | 2024-08-06 |
| 720 | Query3D: LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussian Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This paper introduces a novel method for open-vocabulary 3D scene querying in autonomous driving by combining Language Embedded 3D Gaussians with Large Language Models (LLMs). |
Amirhosein Chahe; Lifeng Zhou; | arxiv-cs.CV | 2024-08-06 |
| 721 | Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we argue that the distribution discrepancy between the discriminative and the non-discriminative parts of objects prevents the model from producing complete and precise pseudo masks as ground truths. |
Ye Du; Zehua Fu; Qingjie Liu; | arxiv-cs.CV | 2024-08-04 |
| 722 | PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Traditional segmentation algorithms falter as they cannot accurately mimic the complexity of UAV perspectives, and the cost of obtaining multi-perspective labeled datasets is prohibitive. To address these issues, we introduce the PPTFormer, a novel Pseudo Multi-Perspective Transformer network that revolutionizes UAV image segmentation. |
Deyi Ji; Wenwei Jin; Hongtao Lu; Feng Zhao; | ijcai | 2024-08-03 |
| 723 | Aggregation and Purification: Dual Enhancement Network for Point Cloud Few-shot Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we design a novel Dual Enhancement Network (DENet) to comprehensively tackle different kinds of scene discrepancies in a coherent and synergistic framework. |
GUOXIN XIONG et. al. | ijcai | 2024-08-03 |
| 724 | Remote Sensing Image Semantic Segmentation Based on Cascaded Transformer Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: High-resolution (HR) remote sensing image semantic segmentation plays an important role in Earth’s surface. Despite remote sensing image semantic segmentation methods have … |
Falin Wang; Jian Ji; Yuan Wang; | IEEE Transactions on Artificial Intelligence | 2024-08-01 |
| 725 | Prompt Learning for Light Field Semantic Segmentation in The Consumer-Centric Internet of Intelligent Computing Things Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Light field semantic segmentation accurately identifies the semantic information of the scene, providing solutions for various intelligent computing tasks in consumer electronics … |
CHEN JIA et. al. | IEEE Transactions on Consumer Electronics | 2024-08-01 |
| 726 | Attention Mechanism and Out-of-Distribution Data on Cross Language Image Matching for Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Chi-Chia Sun; Jing-Ming Guo; Chen-Hung Chung; Bo-Yu Chen; | IEEE Transactions on Cognitive and Developmental Systems | 2024-08-01 |
| 727 | Efficient Dual-Stream Fusion Network for Real-Time Railway Scene Understanding Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Railway scene understanding is key to autonomous train operation and important in active train perception. However, most railway scene understanding methods focus on track … |
ZHIWEI CAO et. al. | IEEE Transactions on Intelligent Transportation Systems | 2024-08-01 |
| 728 | Multi-unit Stacked Architecture: An Urban Scene Segmentation Network Based on UNet and ShuffleNetv2 Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Dian Liu; Jianchao Du; Chuhan Li; Chenglong Yu; Mingjin Zhang; | Appl. Soft Comput. | 2024-08-01 |
| 729 | Medical SAM 2: Segment Medical Images As Video Via Segment Anything Model 2 IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we introduce Medical SAM 2 (MedSAM-2), a generalized auto-tracking model for universal 2D and 3D medical image segmentation. |
Jiayuan Zhu; Abdullah Hamdi; Yunli Qi; Yueming Jin; Junde Wu; | arxiv-cs.CV | 2024-08-01 |
| 730 | Point-supervised Brain Tumor Segmentation with Box-prompted MedSAM Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Although recent vision foundational models, such as the medical segment anything model (MedSAM), have made significant advancements in bounding-box-prompted segmentation, it is not straightforward to utilize point annotation, and is prone to semantic ambiguity. In this preliminary study, we introduce an iterative framework to facilitate semantic-aware point-supervised MedSAM. |
Xiaofeng Liu; Jonghye Woo; Chao Ma; Jinsong Ouyang; Georges El Fakhri; | arxiv-cs.CV | 2024-08-01 |
| 731 | MaskUno: Switch-Split Block For Enhancing Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In all the proposed variations to date, the problem of competing kernels (each class aims to maximize its own accuracy) persists when models try to synchronously learn numerous classes. In this paper, we propose mitigating this problem by replacing mask prediction with a Switch-Split block that processes refined ROIs, classifies them, and assigns them to specialized mask predictors. |
Jawad Haidar; Marc Mouawad; Imad Elhajj; Daniel Asmar; | arxiv-cs.CV | 2024-07-31 |
| 732 | MVG-Net: LiDAR Point Cloud Semantic Segmentation Network Integrating Multi-View Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Deep learning techniques are increasingly applied to point cloud semantic segmentation, where single-modal point cloud often suffers from accuracy-limiting confusion phenomena. … |
Yongchang Liu; Yawen Liu; Yansong Duan; | Remote. Sens. | 2024-07-31 |
| 733 | Leveraging Adaptive Implicit Representation Mapping for Ultra High-Resolution Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Secondly, SIRMF is shared across all samples, which limits its ability to generalize and handle diverse inputs. To address these limitations, we propose a novel approach that leverages the newly proposed Adaptive Implicit Representation Mapping (AIRM) for ultra-high-resolution Image Segmentation. |
Ziyu Zhao; Xiaoguang Li; Pingping Cai; Canyu Zhang; Song Wang; | arxiv-cs.CV | 2024-07-30 |
| 734 | Fine-grained Metrics for Point Cloud Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Because of this, the majority of categories and large objects are favored in the existing evaluation metrics. This paper suggests fine-grained mIoU and mAcc for a more thorough assessment of point cloud segmentation algorithms in order to address these issues. |
Zhuheng Lu; Ting Wu; Yuewei Dai; Weiqing Li; Zhiyong Su; | arxiv-cs.CV | 2024-07-30 |
| 735 | Learning Ordinality in Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: While existing deep learning approaches achieve high accuracy, they often overlook the ordinal relationships between classes, which can provide critical domain knowledge (e.g., the pupil lies within the iris, and lane markings are part of the road). This paper introduces novel methods for spatial ordinal segmentation that explicitly incorporate these inter-class dependencies. |
Ricardo P. M. Cruz; Rafael Cristino; Jaime S. Cardoso; | arxiv-cs.CV | 2024-07-30 |
| 736 | ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: The recent Segment Anything Model (SAM) reveals the capability to segment objects following prompts, but the manual annotations for prompts are impractical during the surgery. To address these limitations in operating rooms, we propose an audio-driven surgical instrument segmentation framework, named ASI-Seg, to accurately segment the required surgical instruments by parsing the audio commands of surgeons. |
ZHEN CHEN et. al. | arxiv-cs.CV | 2024-07-28 |
| 737 | SMPISD-MTPNet: Scene Semantic Prior-Assisted Infrared Ship Detection Using Multi-Task Perception Networks Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: For the training process, we introduce the Soft Fine-tuning training strategy to suppress the distortion caused by data augmentation. |
CHEN HU et. al. | arxiv-cs.CV | 2024-07-25 |
| 738 | Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This study investigates the effectiveness of modern Deformable Convolutional Neural Networks (DCNNs) for semantic segmentation tasks, particularly in autonomous driving scenarios with fisheye images. |
ANAM MANZOOR et. al. | arxiv-cs.CV | 2024-07-23 |
| 739 | Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation Through Hybrid Vision Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper introduces a novel approach to 3D semantic segmentation, distinguished by incorporating a hybrid blend of 2D and 3D computer vision techniques, enabling a streamlined, efficient process. |
Aditya Krishnan; Jayneel Vora; Prasant Mohapatra; | arxiv-cs.CV | 2024-07-22 |
| 740 | Disentangling Spatio-temporal Knowledge for Weakly Supervised Object Detection and Segmentation in Surgical Video Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This paper introduces Video Spatio-Temporal Disentanglement Networks (VDST-Net), a framework to disentangle spatiotemporal information using semi-decoupled knowledge distillation to predict high-quality class activation maps (CAMs). |
Guiqiu Liao; Matjaz Jogan; Sai Koushik; Eric Eaton; Daniel A. Hashimoto; | arxiv-cs.CV | 2024-07-22 |
| 741 | GaussianBeV: 3D Gaussian Representation Meets Perception Models for BeV Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose GaussianBeV, a novel method for transforming image features to BeV by finely representing the scene using a set of 3D gaussians located and oriented in 3D space. |
Florian Chabot; Nicolas Granger; Guillaume Lapouge; | arxiv-cs.CV | 2024-07-19 |
| 742 | Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We aim to harness their capabilities for breast lesion segmentation in a panoptic setting, which encompasses both semantic and instance-level predictions. |
Kun Zhao; Jakub Prokop; Javier Montalt Tordera; Sadegh Mohammadi; | arxiv-cs.CV | 2024-07-19 |
| 743 | ViLLa: Video Reasoning Segmentation with Large Language Model IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To bridge the gap between image and video, in this work, we propose a new video segmentation task – video reasoning segmentation. |
RONGKUN ZHENG et. al. | arxiv-cs.CV | 2024-07-18 |
| 744 | OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird’s-Eye-View Vehicle Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Bird’s-eye-view (BEV) semantic segmentation is becoming crucial in autonomous driving systems. It realizes ego-vehicle surrounding environment perception by projecting 2D … |
JIAN SUN et. al. | IEEE Transactions on Intelligent Transportation Systems | 2024-07-18 |
| 745 | MeshSegmenter: Zero-Shot Mesh Semantic Segmentation Via Texture Synthesis IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We present MeshSegmenter, a simple yet effective framework designed for zero-shot 3D semantic segmentation. |
ZIMING ZHONG et. al. | arxiv-cs.CV | 2024-07-18 |
| 746 | OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird’s-eye-view Vehicle Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, there are still two issues: 1) a lack of effective understanding and enhancement of BEV space features, particularly in accurately capturing long-distance environmental features and 2) recognizing fine details of target objects. To address these issues, we propose OE-BevSeg, an end-to-end multimodal framework that enhances BEV segmentation performance through global environment-aware perception and local target object enhancement. |
JIAN SUN et. al. | arxiv-cs.CV | 2024-07-17 |
| 747 | FoodMem: Near Real-time and Precise Food Video Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We present FoodMem, a novel framework designed to segment food items from video sequences of 360-degree unbounded scenes. |
Ahmad AlMughrabi; Adrián Galán; Ricardo Marques; Petia Radeva; | arxiv-cs.CV | 2024-07-16 |
| 748 | Open-set Hierarchical Semantic Segmentation for 3D Scene Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The Segment-Anything Model (SAM) shows exceptional zero-shot capabilities for 2D images. Developing a similar model for 3D, however, is challenging due to limited datasets. In … |
DIWEN WAN et. al. | 2024 IEEE International Conference on Multimedia and Expo … | 2024-07-15 |
| 749 | TrafficScene: A Multi-modal Dataset Including Light Field for Semantic Segmentation of Traffic Scenes Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: High-quality annotated data is crucial in semantic segmentation. However, existing datasets either provide single view images or offer small baseline multi-angle light field … |
Jie Luo; Xin Jin; Mingyu Liu; Yihui Fan; | 2024 IEEE International Conference on Multimedia and Expo … | 2024-07-15 |
| 750 | VISA: Reasoning Video Object Segmentation Via Large Language Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we introduce a new task, Reasoning Video Object Segmentation (ReasonVOS). |
CILIN YAN et. al. | arxiv-cs.CV | 2024-07-15 |
| 751 | RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To effectively embed high-dimensional RAPiD features, we propose a double-nested autoencoder structure with a novel class-aware embedding objective to encode high-dimensional features into manageable voxel-wise embeddings. |
Li Li; Hubert P. H. Shum; Toby P. Breckon; | arxiv-cs.CV | 2024-07-14 |
| 752 | FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Existing deep learning approaches leave out the semantic cues that are crucial in semantic segmentation present in complex scenarios including cluttered backgrounds and translucent objects, etc. To handle these challenges, we propose a feature amplification network (FANet) as a backbone network that incorporates semantic information using a novel feature enhancement module at multi-stages. |
Muhammad Ali; Mamoona Javaid; Mubashir Noman; Mustansar Fiaz; Salman Khan; | arxiv-cs.CV | 2024-07-12 |
| 753 | Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Curvilinear object segmentation plays a crucial role across various applications, yet datasets in this domain often suffer from small scale due to the high costs associated with data acquisition and annotation. To address these challenges, this paper introduces a novel approach for expanding curvilinear object segmentation datasets, focusing on enhancing the informativeness of generated data and the consistency between semantic maps and generated images. |
Qin Lei; Jiang Zhong; Qizhu Dai; | arxiv-cs.CV | 2024-07-11 |
| 754 | CycleSAM: Few-Shot Surgical Scene Segmentation with Cycle- and Scene-Consistent Feature Matching Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Recent approaches extend SAMto automatic segmentation by using a few labeled reference images to predictpoint prompts; however, they rely on feature matching pipelines that lackrobustness to out-of-domain data like surgical images. To tackle this problem,we introduce CycleSAM, an improved visual prompt learning approach that employsa data-efficient training phase and enforces a series of soft constraints toproduce high-quality feature similarity maps. |
ADITYA MURALI et. al. | arxiv-cs.CV | 2024-07-09 |
| 755 | An Uncertainty-aware Domain Adaptive Semantic Segmentation Framework Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation is significant to realize the scene understanding of autonomous driving. Due to the lack of annotated real-world data, the technology of domain adaptation is … |
Huilin Yin; Pengyu Wang; Boyu Liu; Jun Yan; | Auton. Intell. Syst. | 2024-07-08 |
| 756 | LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset Based on Muti-sensor for Autonomous Exploration Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Most of the existing lunar datasets are targeted at a single task, lacking diverse scenes and high-precision ground truth labels. To address this issue, we propose a multi-task, multi-scene, and multi-label lunar benchmark dataset LuSNAR. |
JIAYI LIU et. al. | arxiv-cs.CV | 2024-07-08 |
| 757 | Submodular Video Object Proposal Selection for Semantic Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper proposes to achieve semantic video object segmentation by learning a data-driven representation which captures the synergy of multiple instances from continuous frames. |
Tinghuai Wang; | arxiv-cs.CV | 2024-07-08 |
| 758 | Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To this end, we propose a complementarity-aware deep learning approach for RGB-D-based material classification built on top of an object-oriented pipeline. |
Siva Krishna Ravipati; Ehsan Latif; Ramviyas Parasuraman; Suchendra M. Bhandarkar; | arxiv-cs.RO | 2024-07-08 |
| 759 | RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose RHRSegNet, implementing a relighting model over a High-Resolution Network for semantic segmentation. |
Sarah Elmahdy; Rodaina Hebishy; Ali Hamdi; | arxiv-cs.CV | 2024-07-08 |
| 760 | Multi-Granularity Feature Fusion For Point Cloud Semantic Segmentation Under Urban Scenes Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Point cloud semantic segmentation plays a key role in scene understanding and digital twin cities tasks. This article proposed a multi-granularity feature fusion network (MGF-Net) … |
HUCHEN LI et. al. | IGARSS 2024 – 2024 IEEE International Geoscience and Remote … | 2024-07-07 |
| 761 | Wetland Segmentation Method for UAV Multispectral Remote Sensing Images Based on SegFormer Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In this study, an end-to-end semantic segmentation method (ConvSegFormer) is proposed by utilizing the multispectral imaging capability of UAVs for images containing multispectral … |
Pakezhamu Nuradili; Ji Zhou; F. Melgani; | IGARSS 2024 – 2024 IEEE International Geoscience and Remote … | 2024-07-07 |
| 762 | High-Order Transformer Semantic Segmentation Network for High-Resolution Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation of high-resolution remote sensing (HRRS) images is an important task in the field of remote sensing image analysis. However, the presence of a large number … |
YIJIE ZHANG et. al. | IGARSS 2024 – 2024 IEEE International Geoscience and Remote … | 2024-07-07 |
| 763 | Prototype-Guided Structural Learning from Visual Foundation Model for Few-Shot Aerial Image Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Few-shot aerial image semantic segmentation aims to segment query images with few annotated support samples. It is challenging due to intra-class variations and complex object … |
Qixiong Wang; Hongxiang Jiang; Jiaqi Feng; Guangyun Zhang; Jihao Yin; | IGARSS 2024 – 2024 IEEE International Geoscience and Remote … | 2024-07-07 |
| 764 | Self-supervised Learning Via Cluster Distance Prediction for Operating Room Context Awareness Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose a new 3D self-supervised task for OR scene understanding utilizing OR scene images captured with ToF cameras. |
Idris Hamoud; Alexandros Karargyris; Aidean Sharghi; Omid Mohareri; Nicolas Padoy; | arxiv-cs.CV | 2024-07-07 |
| 765 | Enhancing Label-efficient Medical Image Segmentation with Text-guided Diffusion Models IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Aside from offering state-of-the-art performance in medical image generation, denoising diffusion probabilistic models (DPM) can also serve as a representation learner to capture … |
Chun-Mei Feng; | International Conference on Medical Image Computing and … | 2024-07-07 |
| 766 | A Semantic Segmentation Method for SAR Image with Assistance of Self-Supervised Scene Classification Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Unlike natural images, synthetic aperture radar (SAR) images exhibit a more scattered and uneven spatial distribution of objects, making semantic segmentation of SAR images a … |
Yang Cheng; Chen Li; Zenghui Zhang; Wenxian Yu; | IGARSS 2024 – 2024 IEEE International Geoscience and Remote … | 2024-07-07 |
| 767 | Sea Ice Semantic Segmentation with Sentinel-2 Data Based on Adaptive Sample Training on U-Net Network Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The rapid melting of Arctic sea ice presents significant opportunities and challenges for humanity. The formation of numerous channels between the ice offers potential for Arctic … |
Z. Yin; Yuqi Tang; Miao Yu; F. Bovolo; | IGARSS 2024 – 2024 IEEE International Geoscience and Remote … | 2024-07-07 |
| 768 | CaRe-Ego: Contact-aware Relationship Modeling for Egocentric Interactive Hand-object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To make up for the shortcomings of existing methods, we propose a novel method called CaRe-Ego that achieves state-of-the-art performance by emphasizing the contact between hands and objects from two aspects. |
Yuejiao Su; Yi Wang; Lap-Pui Chau; | arxiv-cs.CV | 2024-07-07 |
| 769 | Knowledge-Enhancement Module for RGB-T Semantic Segmentation in Remote Sensing Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In accomplishing the task of semantic segmentation of RGB-T remote sensing images, there is a great challenge due to severe occlusion, long-tailed data distribution, and … |
QINGWANG WANG et. al. | IGARSS 2024 – 2024 IEEE International Geoscience and Remote … | 2024-07-07 |
| 770 | LMSeg: A Deep Graph Message-passing Network for Efficient and Accurate Semantic Segmentation of Large-scale 3D Landscape Meshes Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper presents an end-to-end deep graph message-passing network, LMSeg, designed to efficiently and accurately perform semantic segmentation on large-scale 3D landscape meshes. |
Zexian Huang; Kourosh Khoshelham; Gunditj Mirring Traditional Owners Corporation; Martin Tomko; | arxiv-cs.CV | 2024-07-05 |
| 771 | Attention Normalization Impacts Cardinality Generalization in Slot Attention Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we demonstrate that design decisions on normalizing the aggregated values in the attention architecture have considerable impact on the capabilities of Slot Attention to generalize to a higher number of slots and objects as seen during training. |
Markus Krimmel; Jan Achterhold; Joerg Stueckler; | arxiv-cs.CV | 2024-07-04 |
| 772 | ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Currently the semantic segmentation task of multispectral remotely sensed imagery (MSRSI) faces the following problems: 1) Usually, only single domain feature (i.e., space domain … |
Chang Li; Pengfei Zhang; Yu Wang; | arxiv-cs.CV | 2024-07-03 |
| 773 | Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we aim to learn multi-grained representations, which can effectively describe the image on various granularity levels, thus improving generalization on extensive downstream tasks. |
Chengchao Shen; Jianzhong Chen; Jianxin Wang; | arxiv-cs.CV | 2024-07-02 |
| 774 | Multiple Resolutions Detail Enhancement Network for Real-Time Image Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Real-time image semantic segmentation (ISS) draws the attentions of more and more researchers as a basis of scene understanding, and it has been applied in many fields that need … |
JING GU et. al. | IEEE Transactions on Artificial Intelligence | 2024-07-01 |
| 775 | An FPGA-Based Lightweight Semantic Segmentation Neural Network With Optimized Ghost Module Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation neural network classifies each pixel of the input image with semantic labels and it is widely used in varying domains, such as remote sensing, autonomous … |
Yan Chen; Jie Jiang; Yan Ma; | IEEE Internet of Things Journal | 2024-07-01 |
| 776 | Image Semantic Segmentation of Indoor Scenes: A Survey IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Ronny Velastegui; Maxim Tatarchenko; Sezer Karaoglu; Theo Gevers; | Comput. Vis. Image Underst. | 2024-07-01 |
| 777 | Multi-Level Object-Aware Guidance Network for Biomedical Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Most state-of-the-art models for biomedical image segmentation are developed based on U-shape architecture, which has two renowned, yet mutually affected, shortcomings: 1) … |
Huisi Wu; Baiming Zhang; Junquan Pan; Jing Qin; | IEEE Transactions on Automation Science and Engineering | 2024-07-01 |
| 778 | Multi-modal LiDAR Point Cloud Semantic Segmentation with Salience Refinement and Boundary Perception Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Point cloud segmentation is essential for scene understanding, which provides advanced information for many applications, such as autonomous driving, robots, and virtual reality. … |
YONG ZHOU et. al. | ACM Transactions on Multimedia Computing, Communications … | 2024-07-01 |
| 779 | Joint Optimization of Crack Segmentation With An Adaptive Dynamic Threshold Module IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Crack segmentation is a critical component in structural health monitoring. Conventional crack segmentation models usually focus on optimizing the cross-entropy-based objective … |
Qin Lei; Jiang Zhong; Chen Wang; | IEEE Transactions on Intelligent Transportation Systems | 2024-07-01 |
| 780 | PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a novel zero-shot panoptic reconstruction method from RGB-D images of scenes. |
XUAN YU et. al. | arxiv-cs.CV | 2024-07-01 |
| 781 | Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This paper presents MaskField, which enables efficient 3D open-vocabulary segmentation with neural fields from a novel perspective. |
ZIHAN GAO et. al. | arxiv-cs.CV | 2024-07-01 |
| 782 | Self-supported Prototype Rectification for Few-shot Medical Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Few-shot semantic segmentation aims to quickly adapt to pixel-wise predictions for novel classes with only a few labeled images. Recent works rely on prototypical learning, where … |
Zhaoxu Li; Hailing Wang; Guitao Cao; | 2024 International Joint Conference on Neural Networks … | 2024-06-30 |
| 783 | SC-ViT: Semantic Contrast Vision Transformer for Scene Recognition Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Scene recognition remains a challenging task in image recognition. Despite the remarkable advances made by deep learning, especially with the emergence of Convolutional Neural … |
Jiahui Niu; Xin Ma; Rui Li; | 2024 International Joint Conference on Neural Networks … | 2024-06-30 |
| 784 | MSSP: A Multi-view Benchmark for Street Scene Perception in Assistive Navigation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Finding safe paths in autonomous or assistive navigation systems is a challenging task. In this paper, we introduce MSSP, a novel multi-perspective street scene perception … |
Yang Di; S. L. Phung; A. Bouzerdoum; | 2024 International Joint Conference on Neural Networks … | 2024-06-30 |
| 785 | Cross-Patch Relation Enhanced for Weakly Supervised Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Weakly Supervised Semantic Segmentation (WSSS) using only image-level labels relies on Class Activation Map (CAM) to produce pixel-level pseudo segmentation labels, but it … |
Huiqing Su; Wenqin Huang; Qingmin Liao; Zongqing Lu; | 2024 International Joint Conference on Neural Networks … | 2024-06-30 |
| 786 | Segment Anything Model for Automated Image Data Annotation: Empirical Studies Using Text Prompts from Grounding DINO Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To this end, we perform empirical studies on six publicly available datasets across different domains and reveal that these errors consistently follow a predictable pattern and can, thus, be mitigated by a simple strategy. |
Fuseini Mumuni; Alhassan Mumuni; | arxiv-cs.CV | 2024-06-27 |
| 787 | SimTxtSeg: Weakly-Supervised Medical Image Segmentation with Simple Text Cues Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we present a novel framework, SimTxtSeg, that leverages simple text cues to generate high-quality pseudo-labels and study the cross-modal fusion in training segmentation models, simultaneously. |
Yuxin Xie; Tao Zhou; Yi Zhou; Geng Chen; | arxiv-cs.CV | 2024-06-27 |
| 788 | Artwork Segmentation in Eye-Tracking Experiments: Challenges and Future Directions Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Eye-tracking technology has gained prominence in cultural heritage studies, facilitating behavioral analysis and visitor engagement assessments. This paper explores the challenges … |
Alessio Ferrato; Carla Limongelli; M. Mezzini; Giuseppe Sansonetti; A. Micarelli; | Adjunct Proceedings of the 32nd ACM Conference on User … | 2024-06-27 |
| 789 | A Deep Learning Framework for Segmentation of Road Defects Using ResUNet-a Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: We present a deep learning framework leveraging the ResUNet-a framework for pixel-wise semantic segmentation of cracks and potholes. By integrating key components including a … |
IASON KATSAMENIS et. al. | Proceedings of the 17th International Conference on … | 2024-06-26 |
| 790 | A Lightweight Underwater Fish Image Semantic Segmentation Model Based on U-Net Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation of underwater fish images is vital for monitoring fish stocks, assessing marine resources, and sustaining fisheries. To tackle challenges such as low … |
Zhenkai Zhang; Wanghua Li; Boon-Chong Seet; | IET Image Process. | 2024-06-25 |
| 791 | Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This technical report outlines our method for generating a synthetic dataset for semantic segmentation using a latent diffusion model. |
Felix Stillger; Frederik Hasecke; Tobias Meisen; | arxiv-cs.CV | 2024-06-25 |
| 792 | Exploring Image Fusion Techniques for Off-Road Semantic Segmentation in Harsh Lighting Conditions. A Multispectral Imagery Analysis Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In recent years, we have witnessed significant progress in the field of autonomous mobility. However, these advancements have been highly limited to urban environments. Autonomous … |
Pankaj Deoli; Shubham Abhay Deshpande; A. Vierling; Karsten Berns; | 2024 21st International Conference on Ubiquitous Robots (UR) | 2024-06-24 |
| 793 | Enhancing Road Semantic Segmentation Using Generative Adversarial Networks Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Experts in remote sensing have focused a lot of effort on the extraction of geographical objects from high-resolution aerial data using supervised semantic segmentation and deep … |
Vemireddy Anvitha; R. Elakkiya; G. Mohan; Gundala Pallavi; R. P. Kumar; | 2024 15th International Conference on Computing … | 2024-06-24 |
| 794 | SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: 4D LiDAR semantic segmentation, also referred to as multi-scan semantic segmentation, plays a crucial role in enhancing the environmental understanding capabilities of autonomous … |
NENG WANG et. al. | ArXiv | 2024-06-24 |
| 795 | Automated Segmentation of COVID-19 Infected Lungs Via Modified U-Net Model Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The COVID-19 pandemic has led to significant outbreaks in more than 220 countries worldwide, profoundly impacting the public health and lives. As of February 2024, over 774 … |
Sunil Kumar; Biswajit Bhowmik; | 2024 15th International Conference on Computing … | 2024-06-24 |
| 796 | SegNet4D: Efficient Instance-Aware 4D Semantic Segmentation for LiDAR Point Cloud Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we introduce SegNet4D, a novel real-time 4D semantic segmentation network offering both efficiency and strong semantic understanding. |
NENG WANG et. al. | arxiv-cs.CV | 2024-06-23 |
| 797 | Bidirectional Feature Fusion and Enhanced Alignment Based Multimodal Semantic Segmentation for Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Image–text multimodal deep semantic segmentation leverages the fusion and alignment of image and text information and provides more prior knowledge for segmentation tasks. It is … |
Qianqian Liu; Xili Wang; | Remote. Sens. | 2024-06-22 |
| 798 | Performance of XLSTM for Semantic Segmentation of Remotely Sensed Images IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Recent advancements in autoregressive networks with linear complexity have driven significant research progress, demonstrating exceptional performance in large language models. A … |
Qinfeng Zhu; Yuanzhi Cai; Lei Fan; | Proceedings of the 2024 7th International Conference on … | 2024-06-20 |
| 799 | Seg-LSTM: Performance of XLSTM for Semantic Segmentation of Remotely Sensed Images Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Our study found that Vision-LSTM’s performance in semantic segmentation was limited and generally inferior to Vision-Transformers-based and Vision-Mamba-based models in most comparative tests. |
Qinfeng Zhu; Yuanzhi Cai; Lei Fan; | arxiv-cs.CV | 2024-06-20 |
| 800 | UMeshSegNet: Semantic Segmentation of 3D Mesh Generated from UAV Photogrammetry * Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: 3D mesh generated from UAV photogrammetry can depicts the urban scene realistically. Most of the studies on semantic segmentation of 3D mesh based on deep learning convert mesh … |
Xinyi Liu; Zihang Liu; Yongjun Zhang; Zhi Gao; Yuhui Tan; | 2024 IEEE 18th International Conference on Control & … | 2024-06-18 |
| 801 | Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Although existing real-time semantic segmentation models achieve a commendable balance between accuracy and speed, their multi-path blocks still affect overall speed. To address this issue, this study proposes a Reparameterizable Dual-Resolution Network (RDRNet) dedicated to real-time semantic segmentation. |
Guoyu Yang; Yuan Wang; Daming Shi; | arxiv-cs.CV | 2024-06-18 |
| 802 | RailPC: A Large-scale Railway Point Cloud Semantic Segmentation Dataset Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation in the context of 3D point clouds for the railway environment holds a significant economic value, but its development is severely hindered by the lack of … |
TENGPING JIANG et. al. | CAAI Trans. Intell. Technol. | 2024-06-17 |
| 803 | Narrowing The Synthetic-to-Real Gap for Thermal Infrared Semantic Image Segmentation Using Diffusion-based Conditional Image Synthesis Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation is the task of assigning a semantic class to each pixel in an image. Due to the high annotation efforts for fully supervised learning of Deep Neural Networks … |
Christian Mayr; Christian Kübler; Norbert Haala; Michael Teutsch; | 2024 IEEE/CVF Conference on Computer Vision and Pattern … | 2024-06-17 |
| 804 | FisheyeBEVSeg: Surround View Fisheye Cameras Based Bird’s-Eye View Segmentation for Autonomous Driving Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation is an effective way to perform scene understanding. Recently, segmentation in 3D Bird’s Eye View (BEV) space has become popular as its directly used by drive … |
S. Yogamani; David Unger; Venkatraman Narayanan; Varun Ravi Kumar; | 2024 IEEE/CVF Conference on Computer Vision and Pattern … | 2024-06-17 |
| 805 | Generalized Foggy-Scene Semantic Segmentation By Frequency Decoupling IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Foggy-scene semantic segmentation (FSSS) is highly challenging due to the diverse effects of fog on scene properties and the limited training data. Existing research has mainly … |
Qi Bi; Shaodi You; Theo Gevers; | 2024 IEEE/CVF Conference on Computer Vision and Pattern … | 2024-06-17 |
| 806 | Point-Supervised Semantic Segmentation of Natural Scenes Via Hyperspectral Imaging Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Natural scene semantic segmentation is an important task in computer vision. While training accurate models for semantic segmentation relies heavily on detailed and accurate … |
Tianqi Ren; Qiu Shen; Ying Fu; Shaodi You; | 2024 IEEE/CVF Conference on Computer Vision and Pattern … | 2024-06-17 |
| 807 | OoDIS: Anomaly Instance Segmentation and Detection Benchmark Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We provide a competition and benchmark website under https://vision.rwth-aachen.de/oodis |
ALEXEY NEKRASOV et. al. | arxiv-cs.CV | 2024-06-17 |
| 808 | SS-ADA: A Semi-Supervised Active Domain Adaptation Framework for Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Abstract: Semantic segmentation plays an important role in intelligent vehicles, providing pixel-level semantic information about the environment. However, the labeling budget is expensive … |
WEIHAO YAN et. al. | ArXiv | 2024-06-17 |
| 809 | GSAM+Cutie: Text-Promptable Tool Mask Annotation for Endoscopic Video Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Machine learning approaches for multi-view geometric scene understanding in endoscopic surgery often assume temporal consistency across the frames to limit challenges that … |
ROGER D. SOBERANIS-MUKUL et. al. | 2024 IEEE/CVF Conference on Computer Vision and Pattern … | 2024-06-17 |
| 810 | Noisy Annotations in Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This study sheds light on the quality of segmentation masks produced by various models and challenges the efficacy of popular methods designed to address learning with label noise. |
Moshe Kimhi; Omer Kerem; Eden Grad; Ehud Rivlin; Chaim Baskin; | arxiv-cs.CV | 2024-06-16 |
| 811 | PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, the actual multi-scale feature fusion often comes with the semantic redundancy issue due to homogeneous semantic contents in pyramid features. To handle this issue, we propose a novel Mamba-based segmentation network, namely PyramidMamba. |
LIBO WANG et. al. | arxiv-cs.CV | 2024-06-16 |
| 812 | Bias-Compensation Augmentation Learning for Semantic Segmentation in UAV Networks IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In the realm of emergency disaster relief, it is paramount to attain a thorough comprehension of the semantic information associated with the local disaster scene for strategic … |
TIANKUO YU et. al. | IEEE Internet of Things Journal | 2024-06-15 |
| 813 | Unlocking The Potential of Pre-trained Vision Transformers for Few-Shot Semantic Segmentation Through Relationship Descriptors IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: The recent advent of pre-trained vision transformers has unveiled a promising property: their inherent capability to group semantically related visual concepts. In this paper we explore to harnesses this emergent feature to tackle few-shot semantic segmentation a task focused on classifying pixels in a test image with a few example data. |
Ziqin Zhou; Hai-Ming Xu; Yangyang Shu; Lingqiao Liu; | cvpr | 2024-06-13 |
| 814 | Building A Strong Pre-Training Baseline for Universal 3D Large-Scale Perception Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Such inconsiderate consistency greatly hampers the promising path of reaching an universal pre-training framework: (1) The cross-scene semantic self-conflict \textit i.e. the intense collision between primitive segments of the same semantics from different scenes; (2) Lacking a globally unified bond that pushes the cross-scene semantic consistency into 3D representation learning. To address above challenges we propose a CSC framework that puts a scene-level semantic consistency in the heart bridging the connection of the similar semantic segments across various scenes. |
HAOMING CHEN et. al. | cvpr | 2024-06-13 |
| 815 | SAI3D: Segment Any Instance in 3D Scenes IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper we introduce SAI3D a novel zero-shot 3D instance segmentation approach that synergistically leverages geometric priors and semantic cues derived from Segment Anything Model (SAM). |
YINGDA YIN et. al. | cvpr | 2024-06-13 |
| 816 | Segment Every Out-of-Distribution Object IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This paper introduces a method to convert anomaly Score To segmentation Mask called S2M a simple and effective framework for OoD detection in semantic segmentation. |
Wenjie Zhao; Jia Li; Xin Dong; Yu Xiang; Yunhui Guo; | cvpr | 2024-06-13 |
| 817 | Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper we ask the question of whether any 2D vision model can be lifted to make 3D consistent predictions. |
MUKUND VARMA T et. al. | cvpr | 2024-06-13 |
| 818 | PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This divide-and-conquer strategy simplifies the algorithm development process but comes at the cost of losing an end-to-end unified solution to the problem. In this work we address this limitation by studying camera-based 3D panoptic segmentation aiming to achieve a unified occupancy representation for camera-only 3D scene understanding. |
Yuqi Wang; Yuntao Chen; Xingyu Liao; Lue Fan; Zhaoxiang Zhang; | cvpr | 2024-06-13 |
| 819 | Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We notice that there is a discrepancy between text alignment and semantic segmentation: A text often consists of multiple semantic concepts whereas semantic segmentation strives to create semantically homogeneous segments. To address this issue we propose a novel framework Image-Text Co-Decomposition (CoDe) where the paired image and text are jointly decomposed into a set of image regions and a set of word segments respectively and contrastive learning is developed to enforce region-word alignment. |
JI-JIA WU et. al. | cvpr | 2024-06-13 |
| 820 | SANeRF-HQ: Segment Anything for NeRF in High Quality IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper we introduce the Segment Anything for NeRF in High Quality (SANeRF-HQ) to achieve high-quality 3D segmentation of any target object in a given scene. |
Yichen Liu; Benran Hu; Chi-Keung Tang; Yu-Wing Tai; | cvpr | 2024-06-13 |
| 821 | USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The main challenge in open-vocabulary image segmentation now lies in accurately classifying these segments into text-defined categories. In this paper we introduce the Universal Segment Embedding (USE) framework to address this challenge. |
XIAOQI WANG et. al. | cvpr | 2024-06-13 |
| 822 | Unsupervised Universal Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We propose an Unsupervised Universal Segmentation model (U2Seg) adept at performing various image segmentation tasks—instance semantic and panoptic—using a novel unified framework. |
DANTONG NIU et. al. | cvpr | 2024-06-13 |
| 823 | Density-Guided Semi-Supervised 3D Semantic Segmentation with Dual-Space Hardness Sampling Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However existing point-to-point contrastive learning techniques in literature are generally sensitive to outliers resulting in insufficient modeling of the point-wise representations. To address this problem we propose a method named DDSemi for semi-supervised 3D semantic segmentation where a density-guided contrastive learning technique is explored. |
Jianan Li; Qiulei Dong; | cvpr | 2024-06-13 |
| 824 | Hierarchical Intra-modal Correlation Learning for Label-free 3D Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However these methods usually suffer from inconsistent and noisy pseudo-labels provided by the vision language models. To address this issue we present a hierarchical intra-modal correlation learning framework that captures visual and geometric correlations in 3D scenes at three levels: intra-set intra-scene and inter-scene to help learn more compact 3D representations. |
Xin Kang; Lei Chu; Jiahao Li; Xuejin Chen; Yan Lu; | cvpr | 2024-06-13 |
| 825 | Exploring Regional Clues in CLIP for Zero-Shot Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper to solve the mentioned challenge we analyze the gap between the capability of the CLIP model and the requirement of the zero-shot semantic segmentation task. |
Yi Zhang; Meng-Hao Guo; Miao Wang; Shi-Min Hu; | cvpr | 2024-06-13 |
| 826 | APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To this end we propose to leverage the cutting-edge foundation model the Segment Anything Model (SAM) for generalization enhancement. |
WEIZHAO HE et. al. | cvpr | 2024-06-13 |
| 827 | GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work we introduce a Generalizable Semantic Neural Radiance Field (GSNeRF) which uniquely takes image semantics into the synthesis process so that both novel view images and the associated semantic maps can be produced for unseen scenes. |
Zi-Ting Chou; Sheng-Yu Huang; I-Jieh Liu; Yu-Chiang Frank Wang; | cvpr | 2024-06-13 |
| 828 | Flattening The Parent Bias: Hierarchical Semantic Segmentation in The Poincare Ball Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We find that on the new testing domains a flat (non-hierarchical) segmentation network in which the parents are inferred from the children has superior segmentation accuracy to the hierarchical approach across the board. Complementing these findings and inspired by the intrinsic properties of hyperbolic spaces we study a more principled approach to hierarchical segmentation using the Poincare ball model. |
Simon Weber; Bar?? Zöngür; Nikita Araslanov; Daniel Cremers; | cvpr | 2024-06-13 |
| 829 | Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Nevertheless we observe that simply integrating SAM yields limited benefits and can even lead to performance regression due to the inevitable noise issues and challenges in excessive focus on object parts. In this paper we present an innovative framework Point PrompTing (PPT) incorporated with the proposed multi-source curriculum learning strategy to address these challenges. |
Qiyuan Dai; Sibei Yang; | cvpr | 2024-06-13 |
| 830 | PEM: Prototype-based Efficient MaskFormer for Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To achieve such impressive performance these architectures employ intensive operations and require substantial computational resources which are often not available especially on edge devices. To fill this gap we propose Prototype-based Efficient MaskFormer (PEM) an efficient transformer-based architecture that can operate in multiple segmentation tasks. |
NICCOLÒ CAVAGNERO et. al. | cvpr | 2024-06-13 |
| 831 | GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However by rendering semantic/instance labels per pixel without considering the contextual information of the rendered image these methods usually suffer from unclear boundary segmentation and abnormal segmentation of pixels within an object. To solve this problem we propose Generalized Perception NeRF (GP-NeRF) a novel pipeline that makes the widely used segmentation model and NeRF work compatibly under a unified framework for facilitating context-aware 3D scene perception. |
HAO LI et. al. | cvpr | 2024-06-13 |
| 832 | CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work we introduce a novel cost-based approach to adapt vision-language foundation models notably CLIP for the intricate task of semantic segmentation. |
SEOKJU CHO et. al. | cvpr | 2024-06-13 |
| 833 | Open-World Semantic Segmentation Including Class Similarity IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We propose a novel approach that performs accurate closed-world semantic segmentation and at the same time can identify new categories without requiring any additional training data. |
Matteo Sodano; Federico Magistri; Lucas Nunes; Jens Behley; Cyrill Stachniss; | cvpr | 2024-06-13 |
| 834 | Benchmarking Segmentation Models with Mask-Preserved Attribute Editing Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Different from the previous evaluation paradigms only in consideration of global attribute variations (e.g. adverse weather) we investigate both local and global attribute variations for robustness evaluation. To achieve this we construct a mask-preserved attribute editing pipeline to edit visual attributes of real images with precise control of structural information. |
Zijin Yin; Kongming Liang; Bing Li; Zhanyu Ma; Jun Guo; | cvpr | 2024-06-13 |
| 835 | Traffic Scene Parsing Through The TSP6K Dataset Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However little effort has been put into improving the traffic monitoring scene understanding mainly due to the lack of specific datasets. To fill this gap we introduce a specialized traffic monitoring dataset termed TSP6K containing images from the traffic monitoring scenario with high-quality pixel-level and instance-level annotations. |
PENG-TAO JIANG et. al. | cvpr | 2024-06-13 |
| 836 | EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This technical limitation often leads to inadequate segmentation of complex objects with diverse structures. To address this gap we present a novel approach EAGLE which emphasizes object-centric representation learning for unsupervised semantic segmentation. |
Chanyoung Kim; Woojung Han; Dayun Ju; Seong Jae Hwang; | cvpr | 2024-06-13 |
| 837 | MRFP: Learning Generalizable Semantic Segmentation from Sim-2-Real with Multi-Resolution Feature Perturbation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However the large domain-specific inconsistencies between simulated and real-world data pose a significant generalization challenge in semantic segmentation. In this work to alleviate this problem we propose a novel Multi-Resolution Feature Perturbation (MRFP) technique to randomize domain-specific fine-grained features and perturb style of coarse features. |
Sumanth Udupa; Prajwal Gurunath; Aniruddh Sikdar; Suresh Sundaram; | cvpr | 2024-06-13 |
| 838 | Style Blind Domain Generalized Semantic Segmentation Via Covariance Alignment and Semantic Consistence Contrastive Learning IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However these approaches struggle with the entanglement of style and content which may lead to the unintentional removal of crucial content information causing performance degradation. This study addresses this limitation by proposing BlindNet a novel DGSS approach that blinds the style without external modules or datasets. |
Woo-Jin Ahn; Geun-Yeong Yang; Hyun-Duck Choi; Myo-Taeg Lim; | cvpr | 2024-06-13 |
| 839 | Unsupervised Semantic Segmentation Through Depth-Guided Feature Correlation and Sampling IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To achieve this semantic knowledge is distilled by learning to correlate randomly sampled features from images across an entire dataset. In this work we build upon these advances by incorporating information about the structure of the scene into the training process through the use of depth information. |
Leon Sick; Dominik Engel; Pedro Hermosilla; Timo Ropinski; | cvpr | 2024-06-13 |
| 840 | SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper we propose a simple encoder-decoder named SED for open-vocabulary semantic segmentation which comprises a hierarchical encoder-based cost map generation and a gradual fusion decoder with category early rejection. |
Bin Xie; Jiale Cao; Jin Xie; Fahad Shahbaz Khan; Yanwei Pang; | cvpr | 2024-06-13 |
| 841 | ToNNO: Tomographic Reconstruction of A Neural Network’s Output for Weakly Supervised Segmentation of 3D Medical Images Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose a novel approach ToNNO which is based on the Tomographic reconstruction of a Neural Network’s Output. |
Marius Schmidt-Mengin; Alexis Benichoux; Shibeshih Belachew; Nikos Komodakis; Nikos Paragios; | cvpr | 2024-06-13 |
| 842 | MRFS: Mutually Reinforcing Image Fusion and Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This paper proposes a coupled learning framework to break the performance bottleneck of infrared-visible image fusion and segmentation called MRFS. |
Hao Zhang; Xuhui Zuo; Jie Jiang; Chunchao Guo; Jiayi Ma; | cvpr | 2024-06-13 |
| 843 | SatSynth: Augmenting Image-Mask Pairs Through Diffusion Models for Aerial Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work we explore the potential of generative image diffusion to address the scarcity of annotated data in earth observation tasks. |
Aysim Toker; Marvin Eisenberger; Daniel Cremers; Laura Leal-Taixé; | cvpr | 2024-06-13 |
| 844 | Scribble-Supervised Semantic Segmentation with Prototype-based Feature Augmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, existing methods often ignore the features of classified pixels during feature propagation. To address these limitations, this paper proposes a prototype-based feature augmentation method that leverages feature prototypes to augment scribble supervision. |
Guiyang Chan; Pengcheng Zhang; Hai Dong; Shunhui Ji; Bainian Chen; | icml | 2024-06-12 |
| 845 | BLO-SAM: Bi-level Optimization Based Finetuning of The Segment Anything Model for Overfitting-Preventing Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Current solutions to these problems, which involve finetuning SAM, often lead to overfitting, a notable issue in scenarios with very limited data, like in medical imaging. To overcome these limitations, we introduce BLO-SAM, which finetunes SAM based on bi-level optimization (BLO). |
Li Zhang; Youwei Liang; Ruiyi Zhang; Amirhosein Javadi; Pengtao Xie; | icml | 2024-06-12 |
| 846 | SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Inspired by the non-contrastive SSL approach (SimSiam), we introduce a novel framework SIMSAM to compute the Semantic Affinity Matrix, which is significant for unsupervised image segmentation. |
Chanda Grover Kamra; Indra Deep Mastan; Nitin Kumar; Debayan Gupta; | arxiv-cs.CV | 2024-06-12 |
| 847 | PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To this end, we investigate panoptic segmentation on 3D voxel scenarios and propose an instance-aware occupancy network, PanoSSC. |
YINING SHI et. al. | arxiv-cs.CV | 2024-06-11 |
| 848 | Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Existing CLIP-based open-vocabulary methods successfully perform 3D object grounding with simple (bare) queries, but cannot cope with ambiguous descriptions that demand an understanding of object relations. To tackle this problem, we propose a modular approach called BBQ (Beyond Bare Queries), which constructs 3D scene graph representation with metric and semantic spatial edges and utilizes a large language model as a human-to-agent interface through our deductive scene reasoning algorithm. |
SERGEY LINOK et. al. | arxiv-cs.CV | 2024-06-11 |
| 849 | U-Net Ensemble for Enhanced Semantic Segmentation in Remote Sensing Imagery IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation of remote sensing imagery stands as a fundamental task within the domains of both remote sensing and computer vision. Its objective is to generate a … |
I. Dimitrovski; Vlatko Spasev; S. Loskovska; Ivan Kitanovski; | Remote. Sens. | 2024-06-08 |
| 850 | 1st Place Winner of The 2024 Pixel-level Video Understanding in The Wild (CVPR’24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper details our research work that achieved the 1st place winner in the PVUW’24 VPS challenge, establishing state of art results in all metrics, including the Video Panoptic Quality (VPQ) and Segmentation and Tracking Quality (STQ). |
Qingfeng Liu; Mostafa El-Khamy; Kee-Bong Song; | arxiv-cs.CV | 2024-06-08 |
| 851 | USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The main challenge in open-vocabulary image segmentation now lies in accurately classifying these segments into text-defined categories. In this paper, we introduce the Universal Segment Embedding (USE) framework to address this challenge. |
XIAOQI WANG et. al. | arxiv-cs.CV | 2024-06-07 |
| 852 | 1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The motivation behind the MOSE dataset is how to clearly recognize and distinguish objects in complex scenes. In this challenge, we propose a semantic embedding video object segmentation model and use the salient features of objects as query representations. |
Deshui Miao; Xin Li; Zhenyu He; Yaowei Wang; Ming-Hsuan Yang; | arxiv-cs.CV | 2024-06-06 |
| 853 | Frequency-based Matcher for Long-tailed Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Although the long-tailed phenomenon has been investigated in many fields, e.g., classification and object detection, it has not received enough attention in semantic segmentation and has become a non-negligible obstacle to applying semantic segmentation technology in autonomous driving and virtual reality. Therefore, in this work, we focus on a relatively under-explored task setting, long-tailed semantic segmentation (LTSS). |
Shan Li; Lu Yang; Pu Cao; Liulei Li; Huadong Ma; | arxiv-cs.CV | 2024-06-06 |
| 854 | Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we present an effective methodology for training a semantic traversability estimator using egocentric videos and an automated annotation process. |
YUNHO KIM et. al. | arxiv-cs.RO | 2024-06-05 |
| 855 | DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we use a diffusion UNet encoder as afoundation vision encoder and introduce DiffCut, an unsupervised zero-shotsegmentation method that solely harnesses the output features from the finalself-attention block. |
Paul Couairon; Mustafa Shukor; Jean-Emmanuel Haugeard; Matthieu Cord; Nicolas Thome; | arxiv-cs.CV | 2024-06-04 |
| 856 | EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: At present, there are limited studies analyzing cross-view learning. To address this problem, we introduce a novel Unsupervised Cross-view Adaptation Learning approach to modeling the geometric structural change across views in Semantic Scene Understanding. |
THANH-DAT TRUONG et. al. | arxiv-cs.CV | 2024-06-03 |
| 857 | PGGNet: Pyramid Gradual-guidance Network for RGB-D Indoor Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
WUJIE ZHOU et. al. | Signal Process. Image Commun. | 2024-06-01 |
| 858 | 2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In order to deal with the task of video panoptic segmentation in the wild, we propose a robust integrated video panoptic segmentation solution. |
BIAO WU et. al. | arxiv-cs.CV | 2024-06-01 |
| 859 | Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we adopt semi-supervised video semantic segmentation method based on unreliable pseudo labels. |
BIAO WU et. al. | arxiv-cs.CV | 2024-06-01 |
| 860 | Token-word Mixer Meets Object-aware Transformer for Referring Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Zhenliang Zhang; Zhu Teng; Jack Fan; Baopeng Zhang; Jianping Fan; | Pattern Recognit. | 2024-06-01 |
| 861 | Nighttime Image Semantic Segmentation with Retinex Theory Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Zhichao Sun; Huachao Zhu; Xin Xiao; Yuliang Gu; Yongchao Xu; | Image Vis. Comput. | 2024-06-01 |
| 862 | Attention-Based Multi-Kernelized and Boundary-Aware Network for Image Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Xuanchen Zhou; Gengshen Wu; Xin Sun; Pengpeng Hu; Yi Liu; | Neurocomputing | 2024-06-01 |
| 863 | Integration of Object Detection and Semantic Segmentation Based on Convolutional Neural Networks for Navigation and Monitoring of Cyanobacterial Blooms in Lentic Water Scenes Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Fredy Barrientos-Espillco; M. J. Gómez-Silva; Eva Besada-Portas; Gonzalo Pajares; | Appl. Soft Comput. | 2024-06-01 |
| 864 | To-Former: Semantic Segmentation of Transparent Object with Edge-enhanced Transformer Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Jiawei Chen; Wen Su; Mengjiao Ge; Ye He; Jun Yu; | Vis. Comput. | 2024-05-31 |
| 865 | MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation By Filtering with Self-Supervised Geometry and Motion Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work, we propose MCDS-VSS, a structured filter model that learns in a self-supervised manner to estimate scene geometry and ego-motion of the camera, while also estimating the motion of external objects. |
Angel Villar-Corrales; Moritz Austermann; Sven Behnke; | arxiv-cs.CV | 2024-05-30 |
| 866 | SemFlow: Binding Semantic Segmentation and Image Synthesis Via Rectified Flow IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: For image synthesis, we propose a finite perturbation approach to enhance the diversity of generated results without changing the semantic categories. |
CHAOYANG WANG et. al. | arxiv-cs.CV | 2024-05-30 |
| 867 | DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Methods: In this work, we propose a dense image-to-shape representation that enables the joint learning of landmarks and semantic segmentation by employing a fully convolutional architecture. |
RON KEUTH et. al. | arxiv-cs.CV | 2024-05-30 |
| 868 | View-Consistent Hierarchical 3D SegmentationUsing Ultrametric Feature Fields Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Large-scale vision foundation models such as Segment Anything (SAM) demonstrate impressive performance in zero-shot image segmentation at multiple levels of granularity. However, … |
Haodi He; Colton Stearns; Adam W. Harley; Leonidas J. Guibas; | ArXiv | 2024-05-30 |
| 869 | View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work, we address the challenging task of lifting multi-granular and view-inconsistent image segmentations into a hierarchical and 3D-consistent representation. |
Haodi He; Colton Stearns; Adam W. Harley; Leonidas J. Guibas; | arxiv-cs.CV | 2024-05-30 |
| 870 | Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation Via Large Vision-Language Models IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In this paper, we introduce a new task: Zero-Shot 3D Reasoning Segmentation for parts searching and localization for objects, which is a new paradigm to 3D segmentation that … |
TIANRUN CHEN et. al. | ArXiv | 2024-05-29 |
| 871 | CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this study, we propose an approach that integrates mask refinement and binary semantic segmentation, leveraging a novel collaborative training strategy that surpasses current widely-used refinement strategies. |
Ankush Gajanan Arudkar; Bernard J. E. Evans; | arxiv-cs.CV | 2024-05-29 |
| 872 | Reasoning3D — Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation Via Large Vision-Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we introduce a new task: Zero-Shot 3D Reasoning Segmentation for parts searching and localization for objects, which is a new paradigm to 3D segmentation that transcends limitations for previous category-specific 3D semantic segmentation, 3D instance segmentation, and open-vocabulary 3D segmentation. |
TIANRUN CHEN et. al. | arxiv-cs.CV | 2024-05-29 |
| 873 | RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we introduce RT-GS2, the first generalizable semantic segmentation method employing Gaussian Splatting. |
Mihnea-Bogdan Jurca; Remco Royen; Ion Giosan; Adrian Munteanu; | arxiv-cs.CV | 2024-05-28 |
| 874 | Zero-Shot Video Semantic Segmentation Based on Pre-Trained Diffusion Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We introduce the first zero-shot approach for Video Semantic Segmentation (VSS) based on pre-trained diffusion models. |
QIAN WANG et. al. | arxiv-cs.CV | 2024-05-27 |
| 875 | Competing for Pixels: A Self-play Algorithm for Weakly-supervised Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Leveraging reinforcement learning (RL) self-play, we propose a novel WSS method that gamifies image segmentation of a ROI. |
SHAHEER U. SAEED et. al. | arxiv-cs.CV | 2024-05-26 |
| 876 | Multi-view Remote Sensing Image Segmentation With SAM Priors Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Subsequently, we introduce SAM features via a transformer into the INF of the scene, supplementing the semantic information. |
ZIPENG QI et. al. | arxiv-cs.CV | 2024-05-23 |
| 877 | BiomedParse: A Biomedical Foundation Model for Image Parsing of Everything Everywhere All at Once Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Here, we propose BiomedParse, a biomedical foundation model for imaging parsing that can jointly conduct segmentation, detection, and recognition for 82 object types across 9 imaging modalities. |
THEODORE ZHAO et. al. | arxiv-cs.CV | 2024-05-21 |
| 878 | Enhancing DeepLabV3+ for Aerial Image Semantic Segmentation Using Weighted Upsampling Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Segmentation of land cover in aerial images is a major challenge that has led to the proposal of various solutions. Among these, the DeepLabV3+ architecture appears to be one of … |
Anas Berka; Y. Es-saady; M. Hajji; R. Canals; Adel Hafiane; | 2024 IEEE 12th International Symposium on Signal, Image, … | 2024-05-21 |
| 879 | Research on Efficient Asymmetric Attention Module for Real-Time Semantic Segmentation Networks in Urban Scenes Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Currently, numerous high-precision models have been proposed for semantic segmentation, but the model parameters are large and the segmentation speed is slow. Real-time semantic … |
Xu Su; Lihong Li; Jiejie Xiao; Pengtao Wang; | J. Adv. Comput. Intell. Intell. Informatics | 2024-05-20 |
| 880 | CLFusion:3D Semantic Segmentation Based on Camera and Lidar Fusion Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In the field of autonomous driving, semantic segmentation is crucial for scene understanding. Currently, there are two main methods: camera-based and Lidar-based approaches. To … |
TIANYUE WANG et. al. | 2024 IEEE International Symposium on Circuits and Systems … | 2024-05-19 |
| 881 | Universal Organizer of SAM for Unsupervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Recently, a robust framework called the segment anything model (SAM) has been proven to deliver precise boundary object masks. Therefore, this paper proposes a universal organizer based on SAM, termed as UO-SAM, to enhance the mask quality of USS models. |
TINGTING LI et. al. | arxiv-cs.MM | 2024-05-19 |
| 882 | Hybrid Shunted Transformer Embedding UNet for Remote Sensing Image Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Huacong Zhou; Xiangling Xiao; Huihui Li; Xiaoyong Liu; Peng Liang; | Neural Comput. Appl. | 2024-05-18 |
| 883 | CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we propose CM-UNet, comprising a CNN-based encoder for extracting local image features and a Mamba-based decoder for aggregating and integrating global information, facilitating efficient semantic segmentation of remote sensing images. |
MUSHUI LIU et. al. | arxiv-cs.CV | 2024-05-17 |
| 884 | Accurate Segmentation of Brain Tumors in Magnetic Resonance Images with Pyramid Stage Decomposition Network Approach Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: This study explores the utilization of the Pyramid Scene Parsing Network (PSPNet) architecture to achieve accurate segmentation of brain tumors in magnetic resonance (MR) images. … |
Berna Gürler Arı; Hüseyin Üzen; Abdulkadir Şengür; | 2024 32nd Signal Processing and Communications Applications … | 2024-05-15 |
| 885 | Fourier Boundary Features Network with Wider Catchers for Glass Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We proposed the Fourier Boundary Features Network with Wider Catchers (FBWC), which might be the first attempt to utilize sufficiently wide horizontal shallow branches without vertical deepening for guiding the fine granularity segmentation boundary through primary glass semantic information. |
XIAOLIN QIN et. al. | arxiv-cs.CV | 2024-05-15 |
| 886 | Noisy Few-shot 3D Point Cloud Scene Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: 3D scene semantic segmentation plays a crucial role in robotics by enabling robots to understand and interpret their environment in a detailed and context-aware manner, … |
Hao Huang; Shuaihang Yuan; Congcong Wen; Yu Hao; Yi Fang; | 2024 IEEE International Conference on Robotics and … | 2024-05-13 |
| 887 | Zero Shot Context-Based Object Segmentation Using SLIP (SAM+CLIP) Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Abstract: We present SLIP (SAM+CLIP), an enhanced architecture for zero-shot object segmentation. SLIP combines the Segment Anything Model (SAM) \cite{kirillov2023segment} with the … |
Saaketh Koundinya Gundavarapu; Arushi Arora; Shreya Agarwal; | ArXiv | 2024-05-12 |
| 888 | Weakly Supervised Semantic Segmentation Via Dual-Stream Contrastive Learning of Cross-Image Contextual Information Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Weakly supervised semantic segmentation (WSSS) aims at learning a semantic segmentation model with only image-level tags. Despite intensive research on deep learning approaches … |
Qi Lai; C. Vong; Chuangquan Chen; | IEEE Transactions on Industrial Informatics | 2024-05-08 |
| 889 | A Novel Approach to Optimizing Convolutional Neural Networks for Improved Digital Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: To divide a digital image into individual parts that share similar characteristics is known as digital image segmentation, and it is a vital research subject in the field of … |
Kongduo Xing; Junhua Ku; Jie Zhao; | Int. J. Intell. Syst. | 2024-05-08 |
| 890 | Weakly-supervised Semantic Segmentation Via Dual-stream Contrastive Learning of Cross-image Contextual Information Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Weakly supervised semantic segmentation (WSSS) aims at learning a semantic segmentation model with only image-level tags. |
Qi Lai; Chi-Man Vong; | arxiv-cs.CV | 2024-05-08 |
| 891 | Exploration of An Open Vocabulary Model on Semantic Segmentation for Street Scene Imagery Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: This study investigates the efficacy of an open vocabulary, multi-modal, foundation model for the semantic segmentation of images from complex urban street scenes. Unlike … |
Zichao Zeng; Jan Boehm; | ISPRS Int. J. Geo Inf. | 2024-05-05 |
| 892 | Few-Shot Fruit Segmentation Via Transfer Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we develop a few-shot semantic segmentation framework for infield fruits using transfer learning. |
Jordan A. James; Heather K. Manching; Amanda M. Hulse-Kemp; William J. Beksi; | arxiv-cs.CV | 2024-05-04 |
| 893 | Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we present the first comprehensive survey on XAI in semantic image segmentation. |
Rokas Gipiškis; Chun-Wei Tsai; Olga Kurasova; | arxiv-cs.CV | 2024-05-02 |
| 894 | Domain Adaptive Remote Sensing Image Semantic Segmentation with Prototype Guidance Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
WANKANG ZENG et. al. | Neurocomputing | 2024-05-01 |
| 895 | Trimodal Navigable Region Segmentation Model: Grounding Navigation Instructions in Urban Areas Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In this study, we develop a model that enables mobilities to have more friendly interactions with users. Specifically, we focus on the referring navigable regions task in which a … |
NAOKI HOSOMI et. al. | IEEE Robotics and Automation Letters | 2024-05-01 |
| 896 | Recognizing Pawing Behavior of Prepartum Doe Using Semantic Segmentation and Motion History Image (MHI) Features Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
ZIKANG CHEN et. al. | Expert Syst. Appl. | 2024-05-01 |
| 897 | An Energy-Efficient, Unified CNN Accelerator for Real-Time Multi-Object Semantic Segmentation for Autonomous Vehicle IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: An energy-efficient, unified convolutional neural network (CNN) accelerator is proposed with a lightweight RGB-D network to achieve real-time, multi-object semantic segmentation … |
Jueun Jung; Seung-Ju Kim; Wuyoung Jang; Bokyoung Seo; K. Lee; | IEEE Transactions on Circuits and Systems I: Regular Papers | 2024-05-01 |
| 898 | On The Use of GNN-based Structural Information to Improve CNN-based Semantic Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Patty Coupeau; Jean-Baptiste Fasquel; M. Dinomais; | J. Vis. Commun. Image Represent. | 2024-05-01 |
| 899 | Break The Bias: Delving Semantic Transform Invariance for Few-Shot Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Few-shot semantic segmentation (FSS) aims to segment objects of unseen classes in query images with only a few annotated support images. Existing FSS algorithms typically focus on … |
Qinglong Cao; Yuntian Chen; Chao Ma; Xiaokang Yang; | IEEE Transactions on Circuits and Systems for Video … | 2024-05-01 |
| 900 | Remote Sensing Image Semantic Segmentation Via Class-guided Structural Interaction and Boundary Perception IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Xin He; Yong Zhou; Bing Liu; Jiaqi Zhao; Rui Yao; | Expert Syst. Appl. | 2024-05-01 |
| 901 | Self-Supervised Contrastive Learning for Camera-to-Radar Knowledge Distillation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The advancement of radar has enabled more accurate object detection and semantic segmentation by leveraging the measurements of the distance, direction, and velocity of an object, … |
Wenpeng Wang; Brad Campbell; Sirajum Munir; | 2024 20th International Conference on Distributed Computing … | 2024-04-29 |
| 902 | CLFT: Camera-LiDAR Fusion Transformer for Semantic Segmentation in Autonomous Driving IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Specifically, the vision transformer is the novel ground-breaker that successfully brought the multi-head-attention mechanism to computer vision applications. Therefore, we propose a vision-transformer-based network to carry out camera-LiDAR fusion for semantic segmentation applied to autonomous driving. |
Junyi Gu; Mauro Bellone; Tomáš Pivoňka; Raivo Sell; | arxiv-cs.CV | 2024-04-27 |
| 903 | Semantic Segmentation Refiner for Ultrasound Applications with Zero-Shot Foundation Models Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we address the performance degradation of segmentation models in low-data regimes and propose a prompt-less segmentation method harnessing the ability of segmentation foundation models to segment abstract shapes. |
HEDDA COHEN INDELMAN et. al. | arxiv-cs.CV | 2024-04-25 |
| 904 | Boosting Unsupervised Semantic Segmentation with Principal Mask Proposals Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We present PriMaPs – Principal Mask Proposals – decomposing images into semantically meaningful masks based on their feature representation. |
Oliver Hahn; Nikita Araslanov; Simone Schaub-Meyer; Stefan Roth; | arxiv-cs.CV | 2024-04-25 |
| 905 | Semantic Segmentation of Remote Sensing Images Based on Dual-channel Attention Mechanism Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Due to the inadequate utilization of data correlation and complementarity in the feature extraction process of multimodal remote sensing images, the paper proposes a deep learning … |
Jionghui Jiang; Xi’an Feng; Hui Huang; | IET Image Process. | 2024-04-25 |
| 906 | Research on Cuttings Image Segmentation Method Based on Improved MultiRes-Unet++ with Attention Mechanism Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Cuttings logging is an important technology in petroleum exploration and production. It can be used to identify rock types, oil and gas properties, and reservoir features. … |
Fengcai Huo; Kaiming Liu; Hongli Dong; Weijian Ren; Shuai Dong; | Signal, Image and Video Processing | 2024-04-23 |
| 907 | Survey on Segmentation of Brain Abnormalities in MRI Scan Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: -Medical image segmentation plays an important role in disease monitoring, such as tumor growth, dosage control of medication, and radiation exposure in the human body. Image … |
Idrees Ibraheem Ahmed; Omar M. Hussien Al Okashi; | 2024 21st International Multi-Conference on Systems, … | 2024-04-22 |
| 908 | OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We introduce a self-supervised pretraining method, called OccFeat, for camera-only Bird’s-Eye-View (BEV) segmentation networks. |
SOPHIA SIRKO-GALOUCHENKO et. al. | arxiv-cs.CV | 2024-04-22 |
| 909 | Clio: Real-time Task-Driven Open-Set 3D Scene Graphs IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: While related work implicitly chooses a level of granularity by tuning thresholds for object detection, we argue that such a choice is intrinsically task-dependent. The first contribution of this paper is to propose a task-driven 3D scene understanding problem, where the robot is given a list of tasks in natural language and has to select the granularity and the subset of objects and scene structure to retain in its map that is sufficient to complete the tasks. |
DOMINIC MAGGIO et. al. | arxiv-cs.RO | 2024-04-21 |
| 910 | Show and Grasp: Few-shot Semantic Segmentation for Robot Grasping Through Zero-shot Foundation Models Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, this often comes at the cost of limited performance and fine-tuning is required to be effective in robot grasping scenarios. In this work, we propose to overcome all these limitations by combining the impressive generalization capability reached by foundation models with a high-performing few-shot classifier, working as a score function to select the segmentation that is closer to the support set. |
Leonardo Barcellona; Alberto Bacchin; Matteo Terreran; Emanuele Menegatti; Stefano Ghidoni; | arxiv-cs.RO | 2024-04-19 |
| 911 | Weakly Supervised LiDAR Semantic Segmentation Via Scatter Image Annotation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Specifically, we propose employing scatter images to annotate LiDAR point clouds, combining a pre-trained optical flow estimation network with a foundation image segmentation model to rapidly propagate manual annotations into dense labels for both images and point clouds. |
YILONG CHEN et. al. | arxiv-cs.CV | 2024-04-19 |
| 912 | BACS: Background Aware Continual Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This paper proposes a Backward Background Shift Detector (BACS) to detect previously observed classes based on their distance in the latent space from the foreground centroids of previous steps. |
Mostafa ElAraby; Ali Harakeh; Liam Paull; | arxiv-cs.CV | 2024-04-19 |
| 913 | Contrastive Gaussian Clustering: Weakly Supervised 3D Scene Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce Contrastive Gaussian Clustering, a novel approach capable of provide segmentation masks from any viewpoint and of enabling 3D segmentation of the scene. |
Myrna C. Silva; Mahtab Dahaghin; Matteo Toso; Alessio Del Bue; | arxiv-cs.CV | 2024-04-19 |
| 914 | Group-On: Boosting One-Shot Segmentation with Supportive Query Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose a novel and effective approach for ONE-shot semantic segmentation, called Group-On, which packs multiple query images in batches for the benefit of mutual knowledge support within the same category. |
Hanjing Zhou; Mingze Yin; Danny Chen; Jian Wu; JinTai Chen; | arxiv-cs.CV | 2024-04-17 |
| 915 | Neuromorphic Vision-based Motion Segmentation with Graph Transformer Neural Network Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we propose a novel event-based motion segmentation algorithm using a Graph Transformer Neural Network, dubbed GTNN. |
Yusra Alkendi; Rana Azzam; Sajid Javed; Lakmal Seneviratne; Yahya Zweiri; | arxiv-cs.CV | 2024-04-16 |
| 916 | ECLAIR: A High-Fidelity Aerial LiDAR Dataset for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We introduce ECLAIR (Extended Classification of Lidar for AI Recognition), a new outdoor large-scale aerial LiDAR dataset designed specifically for advancing research in point cloud semantic segmentation. |
Iaroslav Melekhov; Anand Umashankar; Hyeong-Jin Kim; Vladislav Serkov; Dusty Argyle; | arxiv-cs.CV | 2024-04-16 |
| 917 | Conformal Semantic Image Segmentation: Post-hoc Quantification of Predictive Uncertainty IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: We propose a post-hoc, computationally lightweight method to quantify predictive uncertainty in semantic image segmentation. Our approach uses conformal prediction to generate … |
Luca Mossina; Joseba Dalmau; L’eo And’eol; | 2024 IEEE/CVF Conference on Computer Vision and Pattern … | 2024-04-16 |
| 918 | Vocabulary-free Image Classification and Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: This assumption is impractical in scenarios with unknown or evolving semantic context. Here, we address this issue and introduce the Vocabulary-free Image Classification (VIC) task, which aims to assign a class from an unconstrained language-induced semantic space to an input image without needing a known vocabulary. |
ALESSANDRO CONTI et. al. | arxiv-cs.CV | 2024-04-16 |
| 919 | YOLO-Med : Multi-Task Interaction Network for Biomedical Images Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this study, we propose an efficient end-to-end multi-task network capable of concurrently performing object detection and semantic segmentation called YOLO-Med. |
S. Huang; | icassp | 2024-04-15 |
| 920 | Cross-Image Distillation for Semi-Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Moreover, scarce annotated data usually exhibits a biased distribution against the desired one, hindering performance improvement. To address these challenging problems, we propose a novel cross-image distillation framework for semi-supervised semantic segmentation. |
N. ZHANG et. al. | icassp | 2024-04-15 |
| 921 | RD-NERF: Neural Robust Distilled Feature Fields for Sparse-View Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose Neural Robust Distilled Feature Fields (RD-NeRF) for achieving robust 3D semantic feature distillation and 3D consistent scene segmentation with sparse-view labels. |
Y. Ma; B. Dou; T. Zhang; Z. Yuan; | icassp | 2024-04-15 |
| 922 | Domain-Adaptive Semantic Segmentation Emerges From Vision-Language Supervised Domain-Debiased Self-Training Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Even worse, some classes exhibit the extreme domain gap, where the feature distributions undergo a complete shift between the two domains. To alleviate it, we propose a domain-debiased self-training strategy with CLIP to distill its domain-agnostic knowledge. |
H. WANG et. al. | icassp | 2024-04-15 |
| 923 | Language-Driven Open-Vocabulary 3D Semantic Segmentation with Knowledge Distillation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: 3D open-vocabulary semantic segmentation is a challenge in the task of 3D scene understanding, as most current models trained on closed-set datasets struggle to effectively identify categories that were not seen during training. To address this, we introduce a framework called LSWKD. |
Y. Wu; X. -F. Han; G. Xiao; | icassp | 2024-04-15 |
| 924 | Semantic Segmentation for Multi-Scene Remote Sensing Images with Noisy Labels Based on Uncertainty Perception Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, a semantic segmentation method for remote sensing images based on uncertainty perception with noisy labels is proposed. |
X. Lyu; L. Zhang; | icassp | 2024-04-15 |
| 925 | CALSeg: Improving Calibration of Medical Image Segmentation Via Variational Label Smoothing Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, medical image segmentation typically relies on hard labels (one-hot vectors), and when minimizing the cross-entropy loss, the model’s softmax predictions are compelled to align with hard labels, resulting in over-confident predictions. To alleviate above problems, this study proposes a novel framework on calibration of medical image segmentation, called CALSeg. |
X. Guo; Y. Yang; C. Ye; G. Cai; T. Ma; | icassp | 2024-04-15 |
| 926 | The Revenge of BiSeNet: Efficient Multi-Task Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, existing research has primarily concentrated on single-task settings, especially on semantic segmentation, leading to redundant efforts and specialized architectures for different tasks. To address this limitation, we propose a novel architecture for efficient multi-task image segmentation, capable of handling various segmentation tasks without sacrificing efficiency or accuracy. |
Gabriele Rosi; Claudia Cuttano; Niccolò Cavagnero; Giuseppe Averta; Fabio Cermelli; | arxiv-cs.CV | 2024-04-15 |
| 927 | SGT: Self-Guided Transformer for Few-Shot Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, they often overlook the fact that there is variability in different regions of the same object, and intra-image similarity is higher than inter-image similarity. To address these limitations, a Self-Guided Transformer (SGT) is proposed by leveraging intra-image similarity to improve intra-object inconsistencies in this paper. |
K. Ai; H. Hu; Q. Zhou; Q. Guan; | icassp | 2024-04-15 |
| 928 | Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To overcome such issues, gathering semantic information has been shown to be a promising source of information towards a more complete and discriminative feature representation of indoor scenes. Therefore, the work described in this paper uses both semantic information, obtained from object detection, and semantic segmentation techniques. |
Ricardo Pereira; Luís Garrote; Tiago Barros; Ana Lopes; Urbano J. Nunes; | arxiv-cs.CV | 2024-04-11 |
| 929 | O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: For the purpose of preserving consistency in 3D object properties across different viewpoints, we propose a spatial adaptive voxel adjustment mechanism and a multi-view weight selection method. |
MUER TIE et. al. | arxiv-cs.CV | 2024-04-10 |
| 930 | QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, such methods struggle to segment out topological regions like kitchen in the scene. In this work, we introduce a two-step pipeline to solve this problem. |
YASH MEHAN et. al. | arxiv-cs.CV | 2024-04-09 |
| 931 | DaF-BEVSeg: Distortion-aware Fisheye Camera Based Bird’s Eye View Segmentation with Occlusion Reasoning Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We extend the model with an occlusion reasoning module, which is critical for estimating in BEV space. |
Senthil Yogamani; David Unger; Venkatraman Narayanan; Varun Ravi Kumar; | arxiv-cs.CV | 2024-04-09 |
| 932 | GHOST: Grounded Human Motion Generation with Open Vocabulary Scene-and-Text Contexts Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: The connection between our 3D surroundings and the descriptive language that characterizes them would be well-suited for localizing and generating human motion in context but for … |
Z. Milacski; Koichiro Niinuma; Ryosuke Kawamura; Fernando De la Torre; László A. Jeni; | 2025 IEEE/CVF Winter Conference on Applications of Computer … | 2024-04-08 |
| 933 | Evaluating The Efficacy of Cut-and-Paste Data Augmentation in Semantic Segmentation for Satellite Imagery Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In order to mitigate those issues, our study explores the effectiveness of a Cut-and-Paste augmentation technique for semantic segmentation in satellite images. We adapt this augmentation, which usually requires labeled instances, to the case of semantic segmentation. |
Ionut M. Motoi; Leonardo Saraceni; Daniele Nardi; Thomas A. Ciarfuglia; | arxiv-cs.CV | 2024-04-08 |
| 934 | D2SL: Decouple Defogging and Semantic Learning for Foggy Domain-Adaptive Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Despite making some progress, there are still two main drawbacks: (1) the coupling of segmentation and defogging feature representations, resulting in a decrease in semantic representation capability, and (2) the failure to leverage real fog priors in unlabeled foggy data, leading to insufficient model generalization ability. To address these issues, we propose a novel training framework, Decouple Defogging and Semantic learning, called D2SL, aiming to alleviate the adverse impact of defogging tasks on the final segmentation task. |
Xuan Sun; Zhanfu An; Yuyu Liu; | arxiv-cs.CV | 2024-04-07 |
| 935 | HawkDrive: A Transformer-driven Visual Perception System for Autonomous Driving in Night Scene IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Many established vision perception systems for autonomous driving scenarios ignore the influence of light conditions, one of the key elements for driving safety. To address this problem, we present HawkDrive, a novel perception system with hardware and software solutions. |
Ziang Guo; Stepan Perminov; Mikhail Konenkov; Dzmitry Tsetserukou; | arxiv-cs.CV | 2024-04-06 |
| 936 | Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal Remote Sensing Image Interpretation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose Panoptic Perception, a novel task and a new fine-grained dataset (FineGrip) to achieve a more thorough and universal interpretation for RSIs. |
DANPEI ZHAO et. al. | arxiv-cs.CV | 2024-04-06 |
| 937 | Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this work, we introduce Sigma, a Siamese Mamba network for multi-modal semantic segmentation, utilizing the Selective Structured State Space Model, Mamba. |
ZIFU WAN et. al. | arxiv-cs.CV | 2024-04-05 |
| 938 | Background Noise Reduction of Attention Map for Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: The proposed method successfully reduces background noise, leading to improved accuracy of pseudo labels. |
Izumi Fujimori; Masaki Oono; Masami Shishibori; | arxiv-cs.CV | 2024-04-04 |
| 939 | OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Indeed, point cloud and 3D meshes typically have a lower resolution than images and the reconstructed 3D scene geometry might not project well to the underlying 2D image sequences used to compute pixel-aligned CLIP features. To address these challenges, we propose OpenNeRF which naturally operates on posed images and directly encodes the VLM features within the NeRF. |
FRANCIS ENGELMANN et. al. | arxiv-cs.CV | 2024-04-04 |
| 940 | Flattening The Parent Bias: Hierarchical Semantic Segmentation in The Poincaré Ball IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We find that on the new testing domains, a flat (non-hierarchical) segmentation network, in which the parents are inferred from the children, has superior segmentation accuracy to the hierarchical approach across the board. Complementing these findings and inspired by the intrinsic properties of hyperbolic spaces, we study a more principled approach to hierarchical segmentation using the Poincar\’e ball model. |
Simon Weber; Barış Zöngür; Nikita Araslanov; Daniel Cremers; | arxiv-cs.CV | 2024-04-04 |
| 941 | Research on Efficient Feature Generation and Spatial Aggregation for Remote Sensing Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation algorithms leveraging deep convolutional neural networks often encounter challenges due to their extensive parameters, high computational complexity, and … |
RUOYANG LI et. al. | Algorithms | 2024-04-04 |
| 942 | A Weakly Supervised End-to-end Framework for Semantic Segmentation of Cancerous Area in Whole Slide Image Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Yanbo Feng; Adel Hafiane; Hélène Laurent; | Pattern Anal. Appl. | 2024-04-02 |
| 943 | AGWNet: Attention-guided Adaptive Shuffle Channel Gate Warped Feature Network for Indoor Scene RGB-D Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
BING XIONG et. al. | Displays | 2024-04-01 |
| 944 | Smooth Fusion of Multi-spectral Images Via Total Variation Minimization for Traffic Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
YING LI et. al. | Eng. Appl. Artif. Intell. | 2024-04-01 |
| 945 | MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we propose a novel framework, called MedCLIP-SAM that combines CLIP and SAM models to generate segmentation of clinical scans using text prompts in both zero-shot and weakly supervised settings. |
Taha Koleilat; Hojat Asgariandehkordi; Hassan Rivaz; Yiming Xiao; | arxiv-cs.CV | 2024-03-29 |
| 946 | Elevating Semantic Segmentation: A Conditional Generative Adversarial Network (CGAN)-based Synthetic Scene Image Generation for Enhanced Precision Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Wasan M. Jwaid; | Service Oriented Computing and Applications | 2024-03-29 |
| 947 | Segmentation Re-thinking Uncertainty Estimation Metrics for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, our investigation identifies three core deficiencies within the PAvPU framework and proposes robust solutions aimed at refining the metric. By addressing these issues, we aim to enhance the reliability and applicability of uncertainty quantification, especially in scenarios that demand high levels of safety and accuracy, thus contributing to the advancement of semantic segmentation methodologies in critical applications. |
Qitian Ma; Shyam Nanda Rai; Carlo Masone; Tatiana Tommasi; | arxiv-cs.AI | 2024-03-28 |
| 948 | I2CKD : Intra- and Inter-Class Knowledge Distillation for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper proposes a new knowledge distillation method tailored for image semantic segmentation, termed Intra- and Inter-Class Knowledge Distillation (I2CKD). |
Ayoub Karine; Thibault Napoléon; Maher Jridi; | arxiv-cs.CV | 2024-03-27 |
| 949 | Segment Anything Model (SAM) Meets Object Detected Box Prompts Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Segmenting images is an intricate and exceptionally demanding field within computer vision. Instance Segmentation is one of the subfields of image segmentation that segments … |
Erdal Akin; Héctor Caltenco; K. Adewole; Reza Malekian; Jan A. Persson; | 2024 IEEE International Conference on Industrial Technology … | 2024-03-25 |
| 950 | SatSynth: Augmenting Image-Mask Pairs Through Diffusion Models for Aerial Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we explore the potential of generative image diffusion to address the scarcity of annotated data in earth observation tasks. |
Aysim Toker; Marvin Eisenberger; Daniel Cremers; Laura Leal-Taixé; | arxiv-cs.CV | 2024-03-25 |
| 951 | Learning Generalized Segmentation for Foggy-Scenes By Bi-directional Wavelet Guidance IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Abstract: Learning scene semantics that can be well generalized to foggy conditions is important for safety-crucial applications such as autonomous driving. Existing methods need both … |
Qi Bi; Shaodi You; Theo Gevers; | AAAI Conference on Artificial Intelligence | 2024-03-24 |
| 952 | SM2C: Boost The Semi-supervised Segmentation for Medical Image By Using Meta Pseudo Labels and Mixed Images Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To this end, we introduce a novel method called Scaling-up Mix with Multi-Class (SM2C). |
Yifei Wang; Chuhong Zhu; | arxiv-cs.CV | 2024-03-24 |
| 953 | WeatherProof: Leveraging Language Guidance for Semantic Segmentation in Adverse Weather Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose a method to infer semantic segmentation maps from images captured under adverse weather conditions. |
BLAKE GELLA et. al. | arxiv-cs.CV | 2024-03-21 |
| 954 | MTP: Advancing Remote Sensing Foundation Model Via Multitask Pretraining IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Foundation models have reshaped the landscape of remote sensing (RS) by enhancing various image interpretation tasks. Pretraining is an active research topic, encompassing … |
DI WANG et. al. | IEEE Journal of Selected Topics in Applied Earth … | 2024-03-20 |
| 955 | MTP: Advancing Remote Sensing Foundation Model Via Multi-Task Pretraining Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, transferring the pretrained models to downstream tasks may encounter task discrepancy due to their formulation of pretraining as image classification or object discrimination tasks. In this study, we explore the Multi-Task Pretraining (MTP) paradigm for RS foundation models to address this issue. |
DI WANG et. al. | arxiv-cs.CV | 2024-03-20 |
| 956 | CUS3D: A New Comprehensive Urban-Scale Semantic-Segmentation 3D Benchmark Dataset Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: With the continuous advancement of the construction of smart cities, the availability of large-scale and semantically enriched datasets is essential for enhancing the machine’s … |
LIN GAO et. al. | Remote. Sens. | 2024-03-19 |
| 957 | Reflectivity Is All You Need!: Advancing LiDAR Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Building upon our previous work, this paper explores the advantages of employing calibrated intensity (also referred to as reflectivity) within learning-based LiDAR semantic segmentation frameworks. |
Kasi Viswanath; Peng Jiang; Srikanth Saripalli; | arxiv-cs.CV | 2024-03-19 |
| 958 | TTT-KD: Test-Time Training for 3D Semantic Segmentation Through Knowledge Distillation from Foundation Models Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we propose the first TTT method for 3D semantic segmentation, TTT-KD, which models Knowledge Distillation (KD) from foundation models (e.g. DINOv2) as a self-supervised objective for adaptation to distribution shifts at test-time. |
Lisa Weijler; Muhammad Jehanzeb Mirza; Leon Sick; Can Ekkazan; Pedro Hermosilla; | arxiv-cs.CV | 2024-03-18 |
| 959 | BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Abstract: Semantic scene segmentation from a bird’s-eye-view (BEV) perspective plays a crucial role in facilitating planning and decision-making for mobile robots. Although recent … |
JONAS SCHRAMM et. al. | ArXiv | 2024-03-18 |
| 960 | Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: To allow our Segment Any Object Model (SAOM) to work in the everything mode, we propose the novel nearest neighbour assignment method, updating point embeddings for each ground-truth mask. |
MARIIA KHAN et. al. | arxiv-cs.CV | 2024-03-15 |
| 961 | TransLandSeg: A Transfer Learning Approach for Landslide Semantic Segmentation Based on Vision Foundation Model Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We propose TransLandSeg, which is a transfer learning approach for landslide semantic segmentation based on a vision foundation model (VFM). |
CHANGHONG HOU et. al. | arxiv-cs.CV | 2024-03-15 |
| 962 | Annotation Free Semantic Segmentation with Vision Foundation Models Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this work, we generate free annotations for any semantic segmentation dataset using existing foundation models. |
Soroush Seifi; Daniel Olmeda Reino; Fabien Despinoy; Rahaf Aljundi; | arxiv-cs.CV | 2024-03-14 |
| 963 | ASPP+-LANet: A Multi-Scale Context Extraction Network for Semantic Segmentation of High-Resolution Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation of remote sensing (RS) images is a pivotal branch in the realm of RS image processing, which plays a significant role in urban planning, building extraction, … |
Lei Hu; Xun Zhou; Jiachen Ruan; Supeng Li; | Remote. Sens. | 2024-03-14 |
| 964 | When Semantic Segmentation Meets Frequency Aliasing IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Existing research only separates an image into easy and hard regions and empirically observes the latter are associated with object boundaries. In this paper, we conduct a comprehensive analysis of hard pixel errors, categorizing them into three types: false responses, merging mistakes, and displacements. |
Linwei Chen; Lin Gu; Ying Fu; | arxiv-cs.CV | 2024-03-13 |
| 965 | Refining Segmentation On-the-Fly: An Interactive Framework for Point Cloud Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Concretely, we presents the first interactive framework for point cloud semantic segmentation, named InterPCSeg, which seamlessly integrates with off-the-shelf semantic segmentation networks without offline re-training, enabling it to run in an on-the-fly manner. |
Peng Zhang; Ting Wu; Jinsheng Sun; Weiqing Li; Zhiyong Su; | arxiv-cs.CV | 2024-03-10 |
| 966 | Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we introduce a Multi-Grained Cross-modal Alignment (MGCA) framework, which explicitly learns pixel-level alignment along with object- and region-level alignment to bridge the granularity gap without any dense annotations. |
Yajie Liu; Pu Ge; Qingjie Liu; Di Huang; | arxiv-cs.CV | 2024-03-06 |
| 967 | RISeg: Robot Interactive Object Segmentation Via Body Frame-Invariant Features Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In order to successfully perform manipulation tasks in new environments, such as grasping, robots must be proficient in segmenting unseen objects from the background and/or other … |
HOWARD H. QIAN et. al. | 2024 IEEE International Conference on Robotics and … | 2024-03-04 |
| 968 | Semi-Supervised Semantic Segmentation Based on Pseudo-Labels: A Survey Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This review aims to provide a first comprehensive and organized overview of the state-of-the-art research results on pseudo-label methods in the field of semi-supervised semantic segmentation, which we categorize from different perspectives and present specific methods for specific application areas. |
Lingyan Ran; Yali Li; Guoqiang Liang; Yanning Zhang; | arxiv-cs.CV | 2024-03-04 |
| 969 | Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Prior works have commonly used an off-line heuristic thresholding process that combines the CAM maps with off-the-shelf saliency maps produced by a general pre-trained saliency model to produce more accurate pseudo-segmentation labels. We propose AuxSegNet+, a weakly supervised auxiliary learning framework to explore the rich information from these saliency maps and the significant inter-task correlation between saliency detection and semantic segmentation. |
LIAN XU et. al. | arxiv-cs.CV | 2024-03-02 |
| 970 | Building Energy Efficient Semantic Segmentation in Intelligent Edge Computing Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Semantic segmentation is a critical area in computer vision, which needs voluminous image data streaming from user devices. Usually, it is challenging to process semantic … |
Xingyu Yuan; He Li; K. Ota; M. Dong; | IEEE Transactions on Green Communications and Networking | 2024-03-01 |
| 971 | FGMNet: Feature Grouping Mechanism Network for RGB-D Indoor Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Yuming Zhang; Wujie Zhou; L. Ye; Lu Yu; Ting Luo; | Digit. Signal Process. | 2024-03-01 |
| 972 | Contrastive Learning-based Knowledge Distillation for RGB-thermal Urban Scene Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Xiaodong Guo; Wujie Zhou; Tong Liu; | Knowl. Based Syst. | 2024-03-01 |
| 973 | PEM: Prototype-based Efficient MaskFormer for Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To achieve such impressive performance, these architectures employ intensive operations and require substantial computational resources, which are often not available, especially on edge devices. To fill this gap, we propose Prototype-based Efficient MaskFormer (PEM), an efficient transformer-based architecture that can operate in multiple segmentation tasks. |
NICCOLÒ CAVAGNERO et. al. | arxiv-cs.CV | 2024-02-29 |
| 974 | YOLO-MED : Multi-Task Interaction Network for Biomedical Images Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this study, we propose an efficient end-to-end multi-task network capable of concurrently performing object detection and semantic segmentation called YOLO-Med. |
SUIZHI HUANG et. al. | arxiv-cs.CV | 2024-02-29 |
| 975 | FusionVision: A Comprehensive Approach of 3D Object Reconstruction and Segmentation from RGB-D Cameras Using YOLO and Fast Segment Anything IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In the realm of computer vision, the integration of advanced techniques into the processing of RGB-D camera inputs poses a significant challenge, given the inherent complexities arising from diverse environmental conditions and varying object appearances. Therefore, this paper introduces FusionVision, an exhaustive pipeline adapted for the robust 3D segmentation of objects in RGB-D imagery. |
Safouane El Ghazouali; Youssef Mhirit; Ali Oukhrid; Umberto Michelucci; Hichem Nouira; | arxiv-cs.CV | 2024-02-29 |
| 976 | An Automated Learning Method of Semantic Segmentation for Train Autonomous Driving Environment Understanding IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: This article proposes an automated machine learning method for semantic segmentation that can be used for automated training of models in fields such as autonomous driving. This … |
Yang Wang; Jin Zhang; Yihao Chen; Hao Yuan; Cheng Wu; | IEEE Transactions on Industrial Informatics | 2024-02-29 |
| 977 | Spannotation: Enhancing Semantic Segmentation for Autonomous Navigation with Efficient Image Annotation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Unlike other popular annotation tools that requires about 40 seconds to annotate an image for semantic segmentation in a typical navigation task, Spannotation achieves similar result in about 6.03 seconds. The tools utility was validated through the utilization of its generated masks to train a U-Net model which achieved a validation accuracy of 98.27% and mean Intersection Over Union (mIOU) of 96.66%. |
Samuel O. Folorunsho; William R. Norris; | arxiv-cs.CV | 2024-02-28 |
| 978 | DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We present DFormer, a novel RGB-D pretraining framework to learn transferable representations for RGB-D segmentation tasks. |
BOWEN YIN et. al. | iclr | 2024-02-26 |
| 979 | OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Indeed, point cloud and 3D meshes typically have a lower resolution than images and the reconstructed 3D scene geometry might not project well to the underlying 2D image sequences used to compute pixel-aligned CLIP features. To address these challenges, we propose OpenNeRF which naturally operates on posed images and directly encodes the VLM features within the NeRF. |
Francis Engelmann; Fabian Manhardt; Michael Niemeyer; Keisuke Tateno; Federico Tombari; | iclr | 2024-02-26 |
| 980 | BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Current solutions to these problems, which involve finetuning SAM, often lead to overfitting, a notable issue in scenarios with very limited data, like in medical imaging. To overcome these limitations, we introduce BLO-SAM, which finetunes SAM based on bi-level optimization (BLO). |
Li Zhang; Youwei Liang; Ruiyi Zhang; Amirhosein Javadi; Pengtao Xie; | arxiv-cs.CV | 2024-02-26 |
| 981 | Rainy Day Image Semantic Segmentation Based on Two-stage Progressive Network Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Heng Zhang; Dongli Jia; Hui Ma; | Vis. Comput. | 2024-02-26 |
| 982 | P2Seg: Pointly-supervised Segmentation Via Mutual Distillation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we design a Mutual Distillation Module (MDM) to leverage the complementary strengths of both instance position and semantic information and achieve accurate instance-level object perception. |
ZIPENG WANG et. al. | iclr | 2024-02-26 |
| 983 | EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, generating fine-grained segmentation masks with diffusion models often requires additional training on annotated datasets, leaving it unclear to what extent pre-trained diffusion models alone understand the semantic relations of their generated images. To address this question, we leverage the semantic knowledge extracted from Stable Diffusion (SD) and aim to develop an image segmentor capable of generating fine-grained segmentation maps without any additional training. |
Koichi Namekata; Amirmojtaba Sabour; Sanja Fidler; Seung Wook Kim; | iclr | 2024-02-26 |
| 984 | ConSept: Continual Semantic Segmentation Via Adapter-based Vision Transformer Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: In this paper, we delve into the realm of vision transformers for continual semantic segmentation, a problem that has not been sufficiently explored in previous literature. … |
Bowen Dong; Guanglei Yang; W. Zuo; Lei Zhang; | ArXiv | 2024-02-26 |
| 985 | Placing Objects in Context Via Inpainting for Out-of-distribution Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we propose the Placing Objects in Context (POC) pipeline to realistically add any object into any image via diffusion models. |
Pau de Jorge; Riccardo Volpi; Puneet K. Dokania; Philip H. S. Torr; Gregory Rogez; | arxiv-cs.CV | 2024-02-26 |
| 986 | Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: While it exhibits remarkable zero-shot generalization in typical scenarios, its advantage diminishes when applied to specialized domains like medical imagery and remote sensing. To address this limitation, this paper introduces Conv-LoRA, a simple yet effective parameter-efficient fine-tuning approach. |
Zihan Zhong; Zhiqiang Tang; Tong He; Haoyang Fang; Chun Yuan; | iclr | 2024-02-26 |
| 987 | Cross-CBAM: A Lightweight Network for Real-time Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Zhengbin Zhang; Zhenhao Xu; Xingsheng Gu; Juan Xiong; | J. Real Time Image Process. | 2024-02-24 |
| 988 | A New CNN-based Semantic Object Segmentation for Autonomous Vehicles in Urban Traffic Scenes Related Papers Related Patents Related Grants Related Venues Related Experts View Save |
Gürkan Doğan; B. Ergen; | Int. J. Multim. Inf. Retr. | 2024-02-23 |
| 989 | Text-Vision Relationship Alignment for Referring Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Save Abstract: Referring image segmentation aims to segment object in an image based on a referring expression. Its difficulty lies in aligning expression semantics with visual instances. The … |
MINGXING PU et. al. | Neural Processing Letters | 2024-02-22 |
| 990 | QIS : Interactive Segmentation Via Quasi-Conformal Mappings Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: In this paper, we propose the quasi-conformal interactive segmentation (QIS) model, which incorporates user input in the form of positive and negative clicks. |
Han Zhang; Daoping Zhang; Lok Ming Lui; | arxiv-cs.CV | 2024-02-22 |
| 991 | DeiSAM: Segment Anything with Deictic Prompting Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: However, deep learning approaches cannot reliably interpret such deictic representations due to their lack of reasoning capabilities in complex scenarios. To remedy this issue, we propose DeiSAM — a combination of large pre-trained neural networks with differentiable logic reasoners — for deictic promptable segmentation. |
HIKARU SHINDO et. al. | arxiv-cs.LG | 2024-02-21 |
| 992 | Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: To this end, this paper proposes a simple yet effective scene-level weakly supervised point cloud segmentation method with a newly introduced multi-modality point affinity inference module. |
XIAWEI LI et. al. | aaai | 2024-02-20 |
| 993 | W2P: Switching from Weak Supervision to Partial Supervision for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: This paper underscores the significant influence of noisy pseudo-labels on segmentation network performance, particularly in boundary region. To address above issues, we introduce a novel paradigm: Weak to Partial Supervision (W2P). |
Fangyuan Zhang; Tianxiang Pan; Jun-Hai Yong; Bin Wang; | aaai | 2024-02-20 |
| 994 | Weakly Supervised Semantic Segmentation for Driving Scenes IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: We propose solutions for each issue as follows. |
Dongseob Kim; Seungho Lee; Junsuk Choe; Hyunjung Shim; | aaai | 2024-02-20 |
| 995 | CGMGM: A Cross-Gaussian Mixture Generative Model for Few-Shot Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: Consequently, they result in incomplete segmentation of foreground objects and mis-segmentation of the complex background. To overcome this issue, we propose the Cross Gaussian Mixture Generative Model (CGMGM), a novel Gaussian Mixture Models~(GMMs)-based FSS method, which establishes the joint distribution of pixel and category in both the support and query images. |
JUNAO SHEN et. al. | aaai | 2024-02-20 |
| 996 | Less Is More: Label Recommendation for Weakly Supervised Point Cloud Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: However, unconstrained or heuristic weakly supervised annotation forms may lead to suboptimal label efficiency. To address this issue, we propose a novel label recommendation framework for weakly supervised point cloud semantic segmentation. |
Zhiyi Pan; Nan Zhang; Wei Gao; Shan Liu; Ge Li; | aaai | 2024-02-20 |
| 997 | X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos Through Cross-Modal Knowledge Transfer IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: Moreover, the irregularity of point cloud poses a difficulty in aligning temporal information within video sequences. To address these issues, we propose a novel cross-modal knowledge transfer framework, called X4D-SceneFormer. |
LINGLIN JING et. al. | aaai | 2024-02-20 |
| 998 | Progressive Feature Self-Reinforcement for Weakly Supervised Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: A typical manifestation is the diminished precision on object boundaries, leading to deteriorated accuracy of WSSS. To alleviate this issue, we propose to adaptively partition the image content into certain regions (e.g., confident foreground and background) and uncertain regions (e.g., object boundaries and misclassified categories) for separate processing. |
JINGXUAN HE et. al. | aaai | 2024-02-20 |
| 999 | Variance-Insensitive and Target-Preserving Mask Refinement for Interactive Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Save Highlight: We introduce a novel method, Variance-Insensitive and Target-Preserving Mask Refinement to enhance segmentation quality with fewer user inputs. |
CHAOWEI FANG et. al. | aaai | 2024-02-20 |
| 1000 | Learning Content-Enhanced Mask Transformer for Domain Generalized Urban-Scene Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Save Highlight: In this paper, we propose a Content-enhanced Mask TransFormer (CMFormer) for domain-generalized USSS. |
Qi Bi; Shaodi You; Theo Gevers; | aaai | 2024-02-20 |