Paper Digest: Recent Papers on Semantic Segmentation
Paper Digest Team extracted all recent Semantic Segmentation related papers on our radar, and generated highlight sentences for them. The results are then sorted by relevance & date. In addition to this ‘static’ page, we also provide a real-time version of this article, which has more coverage and is updated in real time to include the most recent updates on this topic.
This curated list is created by the Paper Digest Team. Experience the cutting-edge capabilities of Paper Digest, an innovative AI-powered research platform that gets you the personalized and comprehensive updates on the latest research in your field. It also empowers you to read articles, write articles, get answers, conduct literature reviews and generate research reports.
Experience the full potential of our services today!
TABLE 1: Paper Digest: Recent Papers on Semantic Segmentation
Paper | Author(s) | Source | Date | |
---|---|---|---|---|
1 | Semantic Localization Guiding Segment Anything Model For Reference Remote Sensing Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Current RRSIS methods rely on multi-modal fusion backbones and semantic segmentation heads but face challenges like dense annotation requirements and complex scene interpretation. To address these issues, we propose a framework named \textit{prompt-generated semantic localization guiding Segment Anything Model}(PSLG-SAM), which decomposes the RRSIS task into two stages: coarse localization and fine segmentation. |
Shuyang Li; Shuang Wang; Zhuangzhuang Sun; Jing Xiao; | arxiv-cs.CV | 2025-06-12 |
2 | Symmetrical Flow Matching: Unified Image Generation, Segmentation, and Classification with Score-Based Generative Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work introduces Symmetrical Flow Matching (SymmFlow), a new formulation that unifies semantic segmentation, classification, and image generation within a single model. |
Francisco Caetano; Christiaan Viviers; Peter H. N. De With; Fons van der Sommen; | arxiv-cs.CV | 2025-06-12 |
3 | Urban1960SatSeg: Unsupervised Semantic Segmentation of Mid-20$^{th}$ Century Urban Landscapes with Satellite Imageries Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, severe quality degradation (e.g., distortion, misalignment, and spectral scarcity) and annotation absence have long hindered semantic segmentation on such historical RS imagery. To bridge this gap and enhance understanding of urban development, we introduce $\textbf{Urban1960SatBench}$, an annotated segmentation dataset based on historical satellite imagery with the earliest observation time among all existing segmentation datasets, along with a benchmark framework for unsupervised segmentation tasks, $\textbf{Urban1960SatUSM}$. |
TIANXIANG HAO et. al. | arxiv-cs.CV | 2025-06-11 |
4 | Leveraging Depth and Language for Open-Vocabulary Domain-Generalized Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce Vireo, a novel single-stage framework for OV-DGSS that unifies the strengths of OVSS and DGSS for the first time. |
SIYU CHEN et. al. | arxiv-cs.CV | 2025-06-11 |
5 | MSSDF: Modality-Shared Self-supervised Distillation for High-Resolution Multi-modal Remote Sensing Image Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, acquiring high-quality labeled data is often costly and time-consuming. To address this challenge, we proposes a multi-modal self-supervised learning framework that leverages high-resolution RGB images, multi-spectral data, and digital surface models (DSM) for pre-training. |
TONG WANG et. al. | arxiv-cs.CV | 2025-06-10 |
6 | Segment Any Architectural Facades (SAAF):An Automatic Segmentation Model for Building Facades, Walls and Windows Based on Multimodal Semantics Guidance Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This study proposes an automatic segmentation model for building facade walls and windows based on multimodal semantic guidance, called Segment Any Architectural Facades (SAAF). |
PEILIN LI et. al. | arxiv-cs.CV | 2025-06-09 |
7 | PIG: Physically-based Multi-Material Interaction with 3D Gaussians Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, in a scene represented by 3D Gaussian primitives, interactions between objects suffer from inaccurate 3D segmentation, imprecise deformation among different materials, and severe rendering artifacts. To address these challenges, we introduce PIG: Physically-Based Multi-Material Interaction with 3D Gaussians, a novel approach that combines 3D object segmentation with the simulation of interacting objects in high precision. |
ZEYU XIAO et. al. | arxiv-cs.GR | 2025-06-09 |
8 | Efficient Decoupled Feature 3D Gaussian Splatting Via Hierarchical Compression Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Existing 3DGS-based methods embed both color and high-dimensional semantic features into a single field, leading to significant storage and computational overhead. To mitigate this, we propose Decoupled Feature 3D Gaussian Splatting (DF-3DGS), a novel method that decouples the color and semantic fields, thereby reducing the number of 3D Gaussians required for semantic representation. |
Zhenqi Dai; Ting Liu; Yanning Zhang; | cvpr | 2025-06-07 |
9 | Stepwise Decomposition and Dual-stream Focus: A Novel Approach for Training-free Camouflaged Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address the issues above, we propose \textbf{RDVP-MSD}, a novel training-free test-time adaptation framework that synergizes \textbf{R}egion-constrained \textbf{D}ual-stream \textbf{V}isual \textbf{P}rompting (RDVP) via \textbf{M}ultimodal \textbf{S}tepwise \textbf{D}ecomposition Chain of Thought (MSD-CoT). |
CHAO YIN et. al. | arxiv-cs.CV | 2025-06-07 |
10 | Exploring Scene Affinity for Semi-Supervised LiDAR Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper explores scene affinity (AIScene), namely intra-scene consistency and inter-scene correlation, for semi-supervised LiDAR semantic segmentation in driving scenes. |
CHUANDONG LIU et. al. | cvpr | 2025-06-07 |
11 | RelationField: Relate Anything in Radiance Fields Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, current method primarily focus on object-centric representations, supporting object segmentation or detection, while understanding semantic relationships between objects remains largely unexplored. To address this gap, we propose RelationField, the first method to extract inter-object relationships directly from neural radiance fields. |
SEBASTIAN KOCH et. al. | cvpr | 2025-06-07 |
12 | SUM Parts: Benchmarking Part-Level Semantic Segmentation of Urban Meshes Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces SUM Parts, the first large-scale dataset for urban textured meshes with part-level semantic labels, covering about 2.5km^2 with 21 classes. |
Weixiao Gao; Liangliang Nan; Hugo Ledoux; | cvpr | 2025-06-07 |
13 | A Semantic Knowledge Complementarity Based Decoupling Framework for Semi-supervised Class-imbalanced Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose SKCDF, a semantic knowledge complementarity based decoupling framework for multi-organ segmentation in class-imbalanced medical images. |
ZHENG ZHANG et. al. | cvpr | 2025-06-07 |
14 | EntitySAM: Segment Everything in Video Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Specifically, we introduce an entity decoder to facilitate inter-object communication and an automatic prompt generator using learnable object queries. |
Mingqiao Ye; Seoung Wug Oh; Lei Ke; Joon-Young Lee; | cvpr | 2025-06-07 |
15 | High Temporal Consistency Through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a lightweight video semantic segmentation approach–suited to onboard real-time inference–achieving high temporal consistency on aerial data through Semantic Similarity Propagation across frames. |
Cédric Vincent; Taehyoung Kim; Henri Meeß; | cvpr | 2025-06-07 |
16 | DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To improve the FSS pipeline, we propose a novel framework that utilizes large language models (LLMs) to adapt general class semantic information to the query image. |
Amin Karimi; Charalambos Poullis; | cvpr | 2025-06-07 |
17 | MaSS13K: A Matting-level Semantic Segmentation Benchmark Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we build a large-scale, matting-level semantic segmentation dataset, named MaSS13K, which consists of 13,348 real-world images, all at 4K resolution. |
Chenxi Xie; Minghan Li; Hui Zeng; Jun Luo; Lei Zhang; | cvpr | 2025-06-07 |
18 | NightAdapter: Learning A Frequency Adapter for Generalizable Night-time Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Night-time scene segmentation is a critical yet challenging task in the real-world applications, primarily due to the complicated lighting conditions. However, existing methods … |
QI BI et. al. | cvpr | 2025-06-07 |
19 | Understanding Fine-tuning CLIP for Open-vocabulary Semantic Segmentation in Hyperbolic Space Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: While freezing the text encoder preserves its powerful embeddings, recent studies show that fine-tuning both the text and image encoders jointly significantly enhances segmentation performance, especially for classes from open sets. In this work, we explain this phenomenon from the perspective of hierarchical alignment, since during fine-tuning, the hierarchy level of image embeddings shifts from image-level to pixel-level. |
ZELIN PENG et. al. | cvpr | 2025-06-07 |
20 | Masked Point-Entity Contrast for Open-Vocabulary 3D Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces MPEC, a novel Masked Point-Entity Contrastive learning method for open-vocabulary 3D semantic segmentation that leverages both 3D entity-language alignment and point-entity consistency across different point cloud views to foster entity-specific feature representations. |
Yan Wang; Baoxiong Jia; Ziyu Zhu; Siyuan Huang; | cvpr | 2025-06-07 |
21 | An End-to-End Robust Point Cloud Semantic Segmentation Network with Single-Step Conditional Diffusion Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose an end-to-end robust semantic Segmentation Network based on a Conditional-Noise Framework (CNF) of DDPMs, named CDSegNet. |
Wentao Qu; Jing Wang; YongShun Gong; Xiaoshui Huang; Liang Xiao; | cvpr | 2025-06-07 |
22 | Dr. Splat: Directly Referring 3D Gaussian Splatting Via Direct Language Embedding Registration Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce Dr. Splat, a novel approach for open-vocabulary 3D scene understanding leveraging 3D Gaussian Splatting. |
KIM JUN-SEONG et. al. | cvpr | 2025-06-07 |
23 | FALCON: Fairness Learning Via Contrastive Attention Approach to Continual Semantic Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work presents a novel Fairness Learning via Contrastive Attention Approach to continual learning in semantic scene understanding. |
Thanh-Dat Truong; Utsav Prabhu; Bhiksha Raj; Jackson Cothren; Khoa Luu; | cvpr | 2025-06-07 |
24 | BFANet: Revisiting 3D Semantic Segmentation with Boundary Feature Analysis Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we revisit 3D semantic segmentation through a more granular lens, shedding light on subtle complexities that are typically overshadowed by broader performance metrics. |
Weiguang Zhao; Rui Zhang; Qiufeng Wang; Guangliang Cheng; Kaizhu Huang; | cvpr | 2025-06-07 |
25 | VidSeg: Training-free Video Semantic Segmentation Based on Diffusion Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce the first training-free approach for Video Semantic Segmentation (VSS) based on pre-trained diffusion models. |
QIAN WANG et. al. | cvpr | 2025-06-07 |
26 | DocSAM: Unified Document Image Segmentation Via Query Decomposition and Heterogeneous Mixed Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Document image segmentation is crucial in document analysis and recognition but remains challenging due to the heterogeneity of document formats and diverse segmentation tasks. … |
Xiao-Hui Li; Fei Yin; Cheng-Lin Liu; | cvpr | 2025-06-07 |
27 | Zero-Shot 4D Lidar Panoptic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, the primary challenge in advancing research and developing generalized, versatile methods for spatio-temporal scene understanding in Lidar lies in the scarcity of datasets that provide the necessary diversity and scale of annotations. To overcome these challenges, we propose SAL-4D (Segment Anything in Lidar–4D), a method that utilizes multi-modal robotic sensor setups as a bridge to distill recent developments in Video Object Segmentation (VOS) in conjunction with off-the-shelf Vision-Language foundation models to Lidar. |
Yushan Zhang; Aljoša Ošep; Laura Leal-Taixé; Tim Meinhardt; | cvpr | 2025-06-07 |
28 | FFR: Frequency Feature Rectification for Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we identify that attenuated high-frequency features mislead the decoder of ViT-based WSSS models, resulting in over-smoothed false segmentation. To address this, we propose a Frequency Feature Rectification (FFR) framework to rectify the false segmentations caused by attenuated high-frequency features and enhance the learning of high-frequency features in the decoder. |
Ziqian Yang; Xinqiao Zhao; Xiaolei Wang; Quan Zhang; Jimin Xiao; | cvpr | 2025-06-07 |
29 | CALICO: Part-Focused Semantic Co-Segmentation with Large Vision-Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce the new task of part-focused semantic co-segmentation, which involves identifying and segmenting common objects and their constituent common and unique parts across images. |
Kiet A. Nguyen; Adheesh Juvekar; Tianjiao Yu; Muntasir Wahed; Ismini Lourentzou; | cvpr | 2025-06-07 |
30 | PSA-SSL: Pose and Size-aware Self-Supervised Learning on LiDAR Point Clouds Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose PSA-SSL, a novel extension to point cloud SSL that learns object pose and size-aware (PSA) features. |
Barza Nisar; Steven L. Waslander; | cvpr | 2025-06-07 |
31 | Segment Any Motion in Videos Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a novel approach for moving object segmentation that combines long-range trajectory motion cues with DINO-based semantic features and leverages SAM2 for pixel-level mask densification through an iterative prompting strategy. |
NAN HUANG et. al. | cvpr | 2025-06-07 |
32 | Convex Combination Star Shape Prior for Data-driven Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a convex combination star (CCS) shape, possessing multi-center star shape properties, and has the advantage of effectively controlling the shape of the region through a smooth field function. |
Xinyu Zhao; Jun Xie; Shengzhe Chen; Jun Liu; | cvpr | 2025-06-07 |
33 | CSC-PA: Cross-image Semantic Correlation Via Prototype Attentions for Single-network Semi-supervised Breast Tumor Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Most existing semi-supervised methods employ the mean-teacher architecture, which merely learns semantic information within a single image and heavily relies on the performance of the teacher model. Therefore, we present a novel cross-image semantic correlation semi-supervised framework, named CSC-PA, to improve the performance of BUS image segmentation. |
Zhenhui Ding; Guilian Chen; Qin Zhang; Huisi Wu; Jing Qin; | cvpr | 2025-06-07 |
34 | Benchmarking Large Vision-Language Models Via Directed Scene Graph for Comprehensive Image Captioning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce a detailed caption benchmark, termed as CompreCap, to evaluate the visual context from a directed scene graph view. |
FAN LU et. al. | cvpr | 2025-06-07 |
35 | Mosaic3D: Foundation Dataset and Model for Open-Vocabulary 3D Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Applying this pipeline to multiple 3D scene datasets, we create Mosaic3D-5.6M, a dataset of more than 30K annotated scenes with 5.6M mask-text pairs – significantly larger than existing datasets. Building on these data, we propose Mosaic3D, a 3D visiual foundation model (3D-VFM) combining a 3D encoder trained with contrastive learning and a lightweight mask decoder for open-vocabulary 3D semantic and instance segmentation. |
JUNHA LEE et. al. | cvpr | 2025-06-07 |
36 | Scaling Up Image Segmentation Across Data and Tasks Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Traditional segmentation models, while effective in isolated tasks, often fail to generalize to more complex and open-ended segmentation problems, such as free-form, open-vocabulary, and in-the-wild scenarios. To bridge this gap, we propose to scale up image segmentation across diverse datasets and tasks such that the knowledge across different tasks and datasets can be integrated while improving the generalization ability. |
PEI WANG et. al. | cvpr | 2025-06-07 |
37 | Rethinking End-to-End 2D to 3D Scene Segmentation in Gaussian Splatting Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we design Unified-Lift, a new end-to-end object-aware lifting approach that aims for high-quality 3D segmentation based on our object-aware 3D Gaussian representation. |
RUNSONG ZHU et. al. | cvpr | 2025-06-07 |
38 | A Dataset for Semantic Segmentation in The Presence of Unknowns Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Existing datasets allow evaluation of only either knowns or unknowns – but not both, which is required to establish "in the wild" suitability of deep neural network models. To bridge this gap, we propose a novel anomaly segmentation dataset, ISSU, featuring a diverse set of anomaly inputs from cluttered real-world environments. |
ZAKARIA LASKAR et. al. | cvpr | 2025-06-07 |
39 | Using Diffusion Priors for Video Amodal Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end, we propose to tackle video amodal segmentation by formulating it as a conditional generation task, thereby capitalizing on the foundational knowledge in video generative models. |
Kaihua Chen; Deva Ramanan; Tarasha Khurana; | cvpr | 2025-06-07 |
40 | COB-GS: Clear Object Boundaries in 3DGS Segmentation Based on Boundary-Adaptive Gaussian Splitting Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Unlike existing approaches that remove ambiguous Gaussians and sacrifice visual quality, COB-GS, as a 3DGS refinement method, jointly optimizes semantic and visual information, allowing the two different levels to cooperate with each other effectively. Specifically, for the semantic guidance, we introduce a boundary-adaptive Gaussian splitting technique that leverages semantic gradient statistics to identify and split ambiguous Gaussians, aligning them closely with object boundaries. |
Jiaxin Zhang; Junjun Jiang; Youyu Chen; Kui Jiang; Xianming Liu; | cvpr | 2025-06-07 |
41 | Rethinking Semi-supervised Segmentation Beyond Accuracy: Reliability and Robustness Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: These qualities, which ensure consistent performance under diverse conditions (robustness) and well-calibrated model confidences as well as meaningful uncertainties (reliability), are essential for safety-critical applications like autonomous driving, where models must handle unpredictable environments and avoid sudden failures at all costs. To address this gap, we introduce the Reliable Segmentation Score (RSS), a novel metric that combines predictive accuracy, calibration, and uncertainty quality measures via a harmonic mean. |
Steven Landgraf; Markus Hillemann; Markus Ulrich; | arxiv-cs.CV | 2025-06-06 |
42 | U-NetMN and SegNetMN: Modified U-Net and SegNet Models for Bimodal SAR Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we evaluate the impact of mode normalization on two widely used semantic segmentation models, U-Net and SegNet. |
MARWANE KZADRI et. al. | arxiv-cs.CV | 2025-06-05 |
43 | SAM-aware Test-time Adaptation for Universal Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Existing adaptations, such as MedSAM, enhance SAM’s performance in medical imaging but at the cost of reduced generalization to unseen data. Therefore, in this paper, we propose SAM-aware Test-Time Adaptation (SAM-TTA), a fundamentally different pipeline that preserves the generalization of SAM while improving its segmentation performance in medical imaging via a test-time framework. |
JIANGHAO WU et. al. | arxiv-cs.CV | 2025-06-05 |
44 | A Large-Scale Referring Remote Sensing Image Segmentation Dataset and Benchmark Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Existing datasets for RRSIS suffer from critical limitations in resolution, scene diversity, and category coverage, which hinders the generalization and real-world applicability of refer segmentation models. To facilitate the development of this field, we introduce NWPU-Refer, the largest and most diverse RRSIS dataset to date, comprising 15,003 high-resolution images (1024-2048px) spanning 30+ countries with 49,745 annotated targets supporting single-object, multi-object, and non-object segmentation scenarios. |
ZHIGANG YANG et. al. | arxiv-cs.CV | 2025-06-04 |
45 | Talk2SAM: Text-Guided Semantic Enhancement for Complex-Shaped Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: These models often struggle with thin structures and fine boundaries, leading to poor segmentation quality. We propose Talk2SAM, a novel approach that integrates textual guidance to improve segmentation of such challenging objects. |
Luka Vetoshkin; Dmitry Yudin; | arxiv-cs.CV | 2025-06-03 |
46 | Scene Detection Policies and Keyframe Extraction Strategies for Large-Scale Video Analysis Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a unified, adaptive framework for automatic scene detection and keyframe selection that handles formats ranging from short-form media to long-form films, archival content, and surveillance footage. |
Vasilii Korolkov; | arxiv-cs.CV | 2025-05-31 |
47 | Federated Unsupervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Extending these ideas to federated settings requires feature representation and cluster centroid alignment across distributed clients — an inherently difficult task under heterogeneous data distributions in the absence of supervision. To address this, we propose FUSS Federated Unsupervised image Semantic Segmentation) which is, to our knowledge, the first framework to enable fully decentralized, label-free semantic segmentation training. |
Evangelos Charalampakis; Vasileios Mygdalis; Ioannis Pitas; | arxiv-cs.CV | 2025-05-29 |
48 | Bridging Classical and Modern Computer Vision: PerceptiveNet for Tree Crown Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Compared to the traditional methods, Deep Learning models improve accuracy by extracting informative and discriminative features, but often fall short in capturing the aforementioned complexities. To address these challenges, we propose PerceptiveNet, a novel model incorporating a Logarithmic Gabor-parameterised convolutional layer with trainable filter parameters, alongside a backbone that extracts salient features while capturing extensive context and spatial information through a wider receptive field. |
Georgios Voulgaris; | arxiv-cs.CV | 2025-05-29 |
49 | LiDAR Based Semantic Perception for Forklifts in Outdoor Environments Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we present a novel LiDAR-based semantic segmentation framework tailored for autonomous forklifts operating in complex outdoor environments. |
Benjamin Serfling; Hannes Reichert; Lorenzo Bayerlein; Konrad Doll; Kati Radkhah-Lens; | arxiv-cs.RO | 2025-05-28 |
50 | Test-Time Adaptation of Vision-Language Models for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In response, we propose a novel TTA method tailored to adapting VLMs for segmentation during test time. |
MEHRDAD NOORI et. al. | arxiv-cs.CV | 2025-05-27 |
51 | What You Perceive Is What You Conceive: A Cognition-Inspired Framework for Open Vocabulary Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, existing paradigms typically perform class-agnostic region segmentation followed by category matching, which deviates from the human visual system’s process of recognizing objects based on semantic concepts, leading to poor alignment between region segmentation and target concepts. To bridge this gap, we propose a novel Cognition-Inspired Framework for open vocabulary image segmentation that emulates the human visual recognition process: first forming a conceptual understanding of an object, then perceiving its spatial extent. |
JIANGHANG LIN et. al. | arxiv-cs.CV | 2025-05-26 |
52 | The Missing Point in Vision Transformers for Universal Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we introduce ViT-P, a novel two-stage segmentation framework that decouples mask generation from classification. |
SAJJAD SHAHABODINI et. al. | arxiv-cs.CV | 2025-05-26 |
53 | Zero-Shot Pseudo Labels Generation Using SAM and CLIP for Semi-Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this approach, the accuracy of the semantic segmentation model depends on the quality of the pseudo labels, and the quality of the pseudo labels depends on the performance of the model to be trained and the amount of data with annotated labels. In this paper, we generate pseudo labels using zero-shot annotation with the Segment Anything Model (SAM) and Contrastive Language-Image Pretraining (CLIP), improve the accuracy of the pseudo labels using the Unified Dual-Stream Perturbations Approach (UniMatch), and use them as enhanced labels to train a semantic segmentation model. |
Nagito Saito; Shintaro Ito; Koichi Ito; Takafumi Aoki; | arxiv-cs.CV | 2025-05-26 |
54 | ADD-SLAM: Adaptive Dynamic Dense SLAM with Gaussian Splatting Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Existing methods that employ semantic segmentation or object detection for dynamic identification and filtering typically rely on predefined categorical priors, while discarding dynamic scene information crucial for robotic applications such as dynamic obstacle avoidance and environmental interaction. To overcome these challenges, we propose ADD-SLAM: an Adaptive Dynamic Dense SLAM framework based on Gaussian splitting. |
WENHUA WU et. al. | arxiv-cs.CV | 2025-05-25 |
55 | ThinkVideo: High-Quality Reasoning Video Segmentation with Chain of Thoughts Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Existing works probe into the problem by finetuning Multimodal Large Language Models (MLLM) for segmentation-based output, while still falling short in difficult cases on videos given temporally-sensitive queries, primarily due to the failure to integrate temporal and spatial information. In this paper, we propose ThinkVideo, a novel framework which leverages the zero-shot Chain-of-Thought (CoT) capability of MLLM to address these challenges. |
Shiu-hong Kao; Yu-Wing Tai; Chi-Keung Tang; | arxiv-cs.CV | 2025-05-24 |
56 | Semantic Segmentation with Reward Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Sometimes, we need a semantic segmentation network, and even a visual encoder can have a high compatibility, and can be trained using various types of feedback beyond traditional labels, such as feedback that indicates the quality of the parsing results. To tackle this issue, we proposed RSS (Reward in Semantic Segmentation), the first practical application of reward-based reinforcement learning on pure semantic segmentation offered in two granular levels (pixel-level and image-level). |
Xie Ting; Ye Huang; Zhilin Liu; Lixin Duan; | arxiv-cs.CV | 2025-05-23 |
57 | EMRA-proxy: Enhancing Multi-Class Region Semantic Segmentation in Remote Sensing Images with Attention Proxy Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: High-resolution remote sensing (HRRS) image segmentation is challenging due to complex spatial layouts and diverse object appearances. While CNNs excel at capturing local … |
YICHUN YU et. al. | arxiv-cs.CV | 2025-05-23 |
58 | SemSegBench & DetecBench: Benchmarking Reliability and Generalization Beyond Classification Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To facilitate research towards robust model design in segmentation and detection, our primary objective is to provide benchmarking tools regarding robustness to distribution shifts and adversarial manipulations. |
SHASHANK AGNIHOTRI et. al. | arxiv-cs.CV | 2025-05-23 |
59 | OpenSeg-R: Improving Open-Vocabulary Segmentation Via Step-by-Step Visual Reasoning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This makes it challenging for OVS model to distinguish similar categories in open-world settings due to the lack of contextual understanding and discriminative visual cues. To address this limitation, we propose a step-by-step visual reasoning framework for open-vocabulary segmentation, named OpenSeg-R. |
ZONGYAN HAN et. al. | arxiv-cs.CV | 2025-05-22 |
60 | TextureSAM: Towards A Texture Aware Foundation Model for Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we investigate SAM’s bias toward semantics over textures and introduce a new texture-aware foundation model, TextureSAM, which performs superior segmentation in texture-dominant scenarios. |
Inbal Cohen; Boaz Meivar; Peihan Tu; Shai Avidan; Gal Oren; | arxiv-cs.CV | 2025-05-22 |
61 | From Pixels to Images: Deep Learning Advances in Remote Sensing Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This review offers a holistic view of DL-based SS for RS, highlighting key advancements, comparative insights, and open challenges to guide future research. |
Quanwei Liu; Tao Huang; Yanni Dong; Jiaqi Yang; Wei Xiang; | arxiv-cs.CV | 2025-05-21 |
62 | Zero-Shot Gaze-based Volumetric Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we introduce eye gaze as a novel informational modality for interactive segmentation, marking the application of eye-tracking for 3D medical image segmentation. |
Tatyana Shmykova; Leila Khaertdinova; Ilya Pershin; | arxiv-cs.CV | 2025-05-21 |
63 | Scan, Materialize, Simulate: A Generalizable Framework for Physically Grounded Robot Planning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present Scan, Materialize, Simulate (SMS), a unified framework that combines 3D Gaussian Splatting for accurate scene reconstruction, visual foundation models for semantic segmentation, vision-language models for material property inference, and physics simulation for reliable prediction of action outcomes. |
Amine Elhafsi; Daniel Morton; Marco Pavone; | arxiv-cs.RO | 2025-05-20 |
64 | Self-Supervised Learning for Image Segmentation: A Comprehensive Survey Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This survey thoroughly investigates over 150 recent image segmentation articles, particularly focusing on SSL. |
Thangarajah Akilan; Nusrat Jahan; Wandong Zhang; | arxiv-cs.CV | 2025-05-19 |
65 | Enhancing Shape Perception and Segmentation Consistency for Industrial Image Inspection Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, a Shape-Aware Efficient Network (SPENet) is proposed, which focuses on the shapes of objects to achieve excellent segmentation consistency by separately supervising the extraction of boundary and body information from images. |
Guoxuan Mao; Ting Cao; Ziyang Li; Yuan Dong; | arxiv-cs.CV | 2025-05-19 |
66 | MetaUAS: Universal Anomaly Segmentation with One-Prompt Meta-Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we explore the potential of a pure visual foundation model as an alternative to widely used vision-language models for universal visual anomaly segmentation. |
Bin-Bin Gao; | arxiv-cs.CV | 2025-05-14 |
67 | FedSaaS: Class-Consistency Federated Semantic Segmentation Via Global Prototype Supervision and Local Adversarial Harmonization Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This oversight results in ambiguities between class representation. To overcome this challenge, we propose a novel federated segmentation framework that strikes class consistency, termed FedSaaS. |
XIAOYANG YU et. al. | arxiv-cs.CV | 2025-05-14 |
68 | MESSI: A Multi-Elevation Semantic Segmentation Image Dataset of An Urban Environment Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents a Multi-Elevation Semantic Segmentation Image (MESSI) dataset comprising 2525 images taken by a drone flying over dense urban environments. |
Barak Pinkovich; Boaz Matalon; Ehud Rivlin; Hector Rotstein; | arxiv-cs.CV | 2025-05-13 |
69 | Boosting Cross-spectral Unsupervised Domain Adaptation for Thermal Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present a comprehensive study on cross-spectral UDA for thermal image semantic segmentation. |
Seokjun Kwon; Jeongmin Shin; Namil Kim; Soonmin Hwang; Yukyung Choi; | arxiv-cs.CV | 2025-05-11 |
70 | Technical Report for ICRA 2025 GOOSE 2D Semantic Segmentation Challenge: Leveraging Color Shift Correction, RoPE-Swin Backbone, and Quantile-based Label Denoising Strategy for Robust Outdoor Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This report presents our semantic segmentation framework developed by team ACVLAB for the ICRA 2025 GOOSE 2D Semantic Segmentation Challenge, which focuses on parsing outdoor scenes into nine semantic categories under real-world conditions. |
CHIH-CHUNG HSU et. al. | arxiv-cs.CV | 2025-05-11 |
71 | MultiTaskVIF: Segmentation-oriented Visible and Infrared Image Fusion Via Multi-task Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, most existing segmentation-oriented VIF methods adopt a cascade structure comprising separate fusion and segmentation models, leading to increased network complexity and redundancy. This raises a critical question: can we design a more concise and efficient structure to integrate semantic information directly into the fusion model during training-Inspired by multi-task learning, we propose a concise and universal training framework, MultiTaskVIF, for segmentation-oriented VIF models. |
Zixian Zhao; Andrew Howes; Xingchen Zhang; | arxiv-cs.CV | 2025-05-10 |
72 | Brain Hematoma Marker Recognition Using Multitask Learning: SwinTransformer and Swin-Unet Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a method MTL-Swin-Unet which is multi-task learning using transformers for classification and semantic segmentation. |
Kodai Hirata; Tsuyoshi Okita; | arxiv-cs.LG | 2025-05-09 |
73 | RAFT: Robust Augmentation of FeaTures for Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To mitigate the aforementioned gap in image segmentation, we propose RAFT, a novel framework for adapting image segmentation models using minimal labeled real-world data through data and feature augmentations, as well as active learning. |
Edward Humes; Xiaomin Lin; Uttej Kallakuri; Tinoosh Mohsenin; | arxiv-cs.CV | 2025-05-07 |
74 | Segment Any RGB-Thermal Model with Language-aided Distillation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Given that RGB-T provides a robust solution for scene understanding in adverse weather and lighting conditions, such as low light and overexposure, we propose a novel framework, SARTM, which customizes the powerful SAM for RGB-T semantic segmentation. |
DONG XING et. al. | arxiv-cs.CV | 2025-05-03 |
75 | Mamba Based Feature Extraction And Adaptive Multilevel Feature Fusion For 3D Tumor Segmentation From Multi-modal Medical Image Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a Mamba based feature extraction and adaptive multilevel feature fusion for 3D tumor segmentation using multi-modal medical image. |
ZEXIN JI et. al. | arxiv-cs.CV | 2025-04-29 |
76 | Segmenting Objectiveness and Task-awareness Unknown Region for Autonomous Driving Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel framework termed Segmenting Objectiveness and Task-Awareness (SOTA) for autonomous driving scenes. |
MI ZHENG et. al. | arxiv-cs.CV | 2025-04-27 |
77 | A Data-Centric Approach to 3D Semantic Segmentation of Railway Scenes Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces two targeted data augmentation methods designed to improve segmentation performance on the railway-specific OSDaR23 dataset. |
NICOLAS MÜNGER et. al. | arxiv-cs.CV | 2025-04-25 |
78 | SAIP-Net: Enhancing Remote Sensing Image Segmentation Via Spectral Adaptive Information Propagation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address limitations arising from spatial domain feature fusion and insufficient receptive fields, this paper introduces SAIP-Net, a novel frequency-aware segmentation framework that leverages Spectral Adaptive Information Propagation. |
Zhongtao Wang; Xizhe Cao; Yisong Chen; Guoping Wang; | arxiv-cs.CV | 2025-04-23 |
79 | RGB-D Video Object Segmentation Via Enhanced Multi-store Feature Memory Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel RGB-D VOS method via multi-store feature memory for robust segmentation. |
Boyue Xu; Ruichao Hou; Tongwei Ren; Gangshan Wu; | arxiv-cs.CV | 2025-04-23 |
80 | Lightweight Road Environment Segmentation Using Vector Quantization Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: (3) Vector quantization encourages the latent space to form coarse clusters of continuous features, forcing the model to group similar features, making the learned representations more structured for the decoding process. In this work, we combined vector quantization with the lightweight image segmentation model MobileUNETR and used it as a baseline model for comparison to demonstrate its efficiency. |
Jiyong Kwag; Alper Yilmaz; Charles Toth; | arxiv-cs.CV | 2025-04-18 |
81 | Occlusion-Ordered Semantic Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose to solve the joint task of relative depth ordering and segmentation of instances based on occlusions. |
Soroosh Baselizadeh; Cheuk-To Yu; Olga Veksler; Yuri Boykov; | arxiv-cs.CV | 2025-04-18 |
82 | DC-SAM: In-Context Segment Anything in Images and Videos Via Dual Consistency Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose the Dual Consistency SAM (DC-SAM) method based on prompt-tuning to adapt SAM and SAM2 for in-context segmentation of both images and videos. |
MENGSHI QI et. al. | arxiv-cs.CV | 2025-04-16 |
83 | PraNet-V2: Dual-Supervised Reverse Attention for Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, PraNet-V1 struggles with multi-class segmentation tasks. To address this limitation, we propose PraNet-V2, which, compared to PraNet-V1, effectively performs a broader range of tasks including multi-class segmentation. |
Bo-Cheng Hu; Ge-Peng Ji; Dian Shao; Deng-Ping Fan; | arxiv-cs.CV | 2025-04-15 |
84 | Text-Guided Few-Shot Semantic Segmentation with Training-Free Multimodal Feature Matching Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a training-free approach using multimodal feature matching that performs segmentation by identifying regions in a target image that match the features from both the image and text references. |
G. Buthmann; T. Sakai; H. Qiu; T. Katsuki; D. Kimura; | icassp | 2025-04-15 |
85 | UMSSS: A Visual Scene Semantic Segmentation Dataset for Underground Mines Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a challenging semantic segmentation dataset focusing on underground mines, named the underground mine scenes semantic segmentation (UMSSS) dataset, which contains 4200 high-quality annotated images and 18 annotated categories. |
J. Wang; | icassp | 2025-04-15 |
86 | FCoDT-Net: A Novel Framework for High-Precision Medical Image Segmentation Using Contextual Distillation Transformer Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The unused information leads to suboptimal segmentation results. In this paper, we propose the Feature Context Distillation Transformer Network (FCoDT-Net), a deep learning model designed to address these limitations by leveraging the rich contextual information within the skip connections. |
Q. YuTao; Y. SiZhe; H. Bang; R. Wei; | icassp | 2025-04-15 |
87 | Harnessing Light Field Angular Cues and Spatial Geometries for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce a novel backbone network called the Light Field Extraction Interaction Network (LFEI-Net). |
C. Jia; F. Shi; X. Cheng; | icassp | 2025-04-15 |
88 | Dual-Path Consistency Unsupervised Domain Adaptation for Nighttime Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, it is often hindered by the lack of annotations due to interference caused by inadequate lighting or exposure. To overcome these difficulties, we propose a Dual-Path Consistency (DPC) unsupervised domain adaptation (UDA) approach. |
Y. Lu; J. Lang; M. Ding; | icassp | 2025-04-15 |
89 | Dual-Space Augmented Intrinsic-LoRA for Wind Turbine Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Despite advancements in large universal vision models, these models often underperform in domain-specific tasks like WTB segmentation. To address this, we extend Intrinsic LoRA for image segmentation, and propose a novel dual-space augmentation strategy that integrates both image-level and latent-space augmentations. |
S. Singhal; R. Pérez-Gonzalo; A. Espersen; A. Agudo; | icassp | 2025-04-15 |
90 | PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in The Wild Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This report provides a comprehensive overview of the 4th Pixel-level Video Understanding in the Wild (PVUW) Challenge, held in conjunction with CVPR 2025. |
HENGHUI DING et. al. | arxiv-cs.CV | 2025-04-15 |
91 | A Weakly Supervised Semantic Segmentation Model with Enhanced CLIP Feature Extraction Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper addresses the limitations of the Contrastive Language-Image Pre-training (CLIP) model’s image encoder and proposes a segmentation model WSSS-ECFE with enhanced CLIP feature extraction, aiming to improve the performance of the Weakly Supervised Semantic Segmentation (WSSS) task. |
F. Kong; J. Lu; | icassp | 2025-04-15 |
92 | U-SAM: Upgrade Segment Anything Model With Semantic-Aware and Memory-Efficient Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: (2) SAM’s inefficient use of instance-independent visual features and tokens necessitates maintaining unique features and tokens for each instance, leading to excessive GPU memory consumption and diminished segmentation efficiency. To address these issues, we propose the Universal Segment Anything Model (U-SAM), a semantic-aware and memory-efficient segmentation model designed to perform both promptable and traditional segmentation tasks within a compact and unified framework. |
X. Jin; J. Hu; J. Lin; S. Zhang; L. Cao; | icassp | 2025-04-15 |
93 | Joint Semantic Segmentation of Optical and SAR Image in Hazy Environments Via Cross-modal Information Rectification and Cross-attention Fusion Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents a joint semantic segmentation of optical and SAR in hazy environments network that incorporates channel fusion for feature enhancement and cross-attention for feature fusion, enabling efficient segmentation of hazy optical images. |
X. Fan; L. Zhang; | icassp | 2025-04-15 |
94 | SPT: Sequence Prompt Transformer for Interactive Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, existing methods typically process one image at a time, failing to consider the sequential nature of the images. To overcome this limitation, we propose a novel method called Sequence Prompt Transformer (SPT), the first to utilize sequential image information for interactive segmentation. |
S. Cheng; | icassp | 2025-04-15 |
95 | ES-NeRF: Enhancing Segmentation in NeRF with CLIP Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, they face the challenge of accurately and consistently segmenting objects in complex scenarios. To address this issue, we introduce the Enhancing Segmentation in NeRF with CLIP(ES-NeRF), which aims to improve the segmentation quality through feature fusion with the help of CLIP’s powerful semantic comprehension. |
C. ZHAO et. al. | icassp | 2025-04-15 |
96 | Rethinking Early-Fusion Strategies for Improved Multimodal Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, these methods require massive parameter updates and computational effort during the feature extraction and fusion. To address this issue, we propose a novel multimodal fusion network (EFNet) based on an early fusion strategy and a simple but effective feature clustering for training efficient RGB-T semantic segmentation. |
Z. Shen; Y. Li; H. Zhang; Y. Weng; J. Wang; | icassp | 2025-04-15 |
97 | Hazy Remote Sensing Image Semantic Segmentation with Weak Annotations Via Pre-training Optimization and Co-training Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Despite the numerous haze removal methods developed for remote sensing images, their efficacy in the subsequent task of semantic segmentation remains inadequate. To address these issues, this paper aims to enhance the robustness of the segmentation network against haze interference by proposing a weakly supervised semantic segmentation framework based on pre-training optimization and dual-network co-training. |
J. Xu; L. Zhang; | icassp | 2025-04-15 |
98 | MASSeg : 2nd Technical Report for 4th PVUW MOSE Track Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This report presents our solution, which ranked second in the MOSE track of CVPR 2025 PVUW Challenge. |
XUQIANG CAO et. al. | arxiv-cs.CV | 2025-04-14 |
99 | IGL-DT: Iterative Global-Local Feature Learning with Dual-Teacher Semantic Segmentation Framework Under Limited Annotation Scheme Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, existing methods struggle to balance global semantic representation with fine-grained local feature extraction. To address this challenge, we propose a novel tri-branch semi-supervised segmentation framework incorporating a dual-teacher strategy, named IGL-DT. |
DINH DAI QUAN TRAN et. al. | arxiv-cs.CV | 2025-04-13 |
100 | AerOSeg: Harnessing SAM for Open-Vocabulary Segmentation in Remote Sensing Images Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This necessitates the development of OVS approaches specifically tailored for remote sensing. In this context, we propose AerOSeg, a novel OVS approach for remote sensing data. |
Saikat Dutta; Akhil Vasim; Siddhant Gole; Hamid Rezatofighi; Biplab Banerjee; | arxiv-cs.CV | 2025-04-12 |
101 | ChildlikeSHAPES: Semantic Hierarchical Region Parsing for Animating Figure Drawings Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This semantic understanding is a crucial prerequisite for animation tools that seek to modify figures while preserving their unique style. To help achieve this, we propose a novel hierarchical segmentation model, built upon the architecture and pre-trained SAM, to quickly and accurately obtain these semantic labels. |
Astitva Srivastava; Harrison Jesse Smith; Thu Nguyen-Phuoc; Yuting Ye; | arxiv-cs.GR | 2025-04-10 |
102 | PathSegDiff: Pathology Segmentation Using Diffusion Model Representations Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose PathSegDiff, a novel approach for histopathology image segmentation that leverages Latent Diffusion Models (LDMs) as pre-trained featured extractors. |
Sachin Kumar Danisetty; Alexandros Graikos; Srikar Yellapragada; Dimitris Samaras; | arxiv-cs.CV | 2025-04-09 |
103 | MovSAM: A Single-image Moving Object Segmentation Framework Based on Deep Thinking Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, segmenting moving objects from a single image remains challenging for existing methods due to the absence of temporal cues. To address this gap, we propose MovSAM, the first framework for single-image moving object segmentation. |
CHANG NIE et. al. | arxiv-cs.CV | 2025-04-09 |
104 | InvNeRF-Seg: Fine-Tuning A Pre-Trained NeRF for 3D Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose Invariant NeRF for Segmentation (InvNeRFSeg), a two step, zero change fine tuning strategy for 3D segmentation. |
Jiangsan Zhao; Jakob Geipel; Krzysztof Kusnierek; Xuean Cui; | arxiv-cs.CV | 2025-04-08 |
105 | Semi-Supervised Biomedical Image Segmentation Via Diffusion Models and Teacher-Student Co-Training Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce a novel semi-supervised teacher-student framework for biomedical image segmentation, inspired by the recent success of generative models. |
Luca Ciampi; Gabriele Lagani; Giuseppe Amato; Fabrizio Falchi; | arxiv-cs.CV | 2025-04-02 |
106 | Zero-Shot 4D Lidar Panoptic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, the primary challenge in advancing research and developing generalized, versatile methods for spatio-temporal scene understanding in Lidar lies in the scarcity of datasets that provide the necessary diversity and scale of annotations.To overcome these challenges, we propose SAL-4D (Segment Anything in Lidar–4D), a method that utilizes multi-modal robotic sensor setups as a bridge to distill recent developments in Video Object Segmentation (VOS) in conjunction with off-the-shelf Vision-Language foundation models to Lidar. We utilize VOS models to pseudo-label tracklets in short video sequences, annotate these tracklets with sequence-level CLIP tokens, and lift them to the 4D Lidar space using calibrated multi-modal sensory setups to distill them to our SAL-4D model. |
Yushan Zhang; Aljoša Ošep; Laura Leal-Taixé; Tim Meinhardt; | arxiv-cs.CV | 2025-04-01 |
107 | Improving Underwater Semantic Segmentation with Underwater Image Quality Attention and Muti-scale Aggregation Attention Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, the low illumination in underwater environments degrades the imaging quality, which in turn seriously deteriorates the performance of underwater semantic segmentation, particularly for outlining the object region boundaries. To tackle this issue, we present UnderWater SegFormer (UWSegFormer), a transformer-based framework for semantic segmentation of low-quality underwater images. |
Xin Zuo; Jiaran Jiang; Jifeng Shen; Wankou Yang; | arxiv-cs.CV | 2025-03-30 |
108 | Open-Vocabulary Semantic Segmentation with Uncertainty Alignment for Robotic Scene Understanding in Indoor Building Environments Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Additionally, most existing methods ignore the uncertainty of the scene recognition problem, leading to low success rates, particularly in ambiguous and complex environments. To address these challenges, we propose an open-vocabulary scene semantic segmentation and detection pipeline leveraging Vision Language Models (VLMs) and Large Language Models (LLMs). |
Yifan Xu; Vineet Kamat; Carol Menassa; | arxiv-cs.CV | 2025-03-29 |
109 | Enhancing DeepLabV3+ to Fuse Aerial and Satellite Images for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we enhance the DeepLabV3+ architecture by introducing a new transposed conventional layers block for upsampling a second entry to fuse it with high level features. |
Anas Berka; Mohamed El Hajji; Raphael Canals; Youssef Es-saady; Adel Hafiane; | arxiv-cs.CV | 2025-03-28 |
110 | A Dataset for Semantic Segmentation in The Presence of Unknowns Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Existing datasets allow evaluation of only knowns or unknowns – but not both, which is required to establish in the wild suitability of deep neural network models. To bridge this gap, we propose a novel anomaly segmentation dataset, ISSU, that features a diverse set of anomaly inputs from cluttered real-world environments. |
ZAKARIA LASKAR et. al. | arxiv-cs.CV | 2025-03-28 |
111 | Concept-Aware LoRA for Domain-Aligned Segmentation Dataset Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, it often overfits and memorizes training data, limiting their ability to generate diverse and well-aligned samples. To overcome these issues, we propose Concept-Aware LoRA (CA-LoRA), a novel fine-tuning approach that selectively identifies and updates only the weights associated with necessary concepts (e.g., style or viewpoint) for domain alignment while preserving the pretrained knowledge of the T2I model to produce informative samples. |
MINHO PARK et. al. | arxiv-cs.CV | 2025-03-28 |
112 | Towards Generating Realistic 3D Semantic Training Data for Autonomous Driving Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose a novel approach able to generate 3D semantic scene-scale data without relying on any projection or decoupled trained multi-resolution models, achieving more realistic semantic scene data generation compared to previous state-of-the-art methods. |
Lucas Nunes; Rodrigo Marcuzzi; Jens Behley; Cyrill Stachniss; | arxiv-cs.CV | 2025-03-27 |
113 | A Deep Learning Framework for Boundary-Aware Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, they still struggle with blurred target boundaries and insufficient recognition of small targets. To address these issues, this study proposes a Mask2Former-based semantic segmentation algorithm incorporating a boundary enhancement feature bridging module (BEFBM). |
TAI AN et. al. | arxiv-cs.CV | 2025-03-27 |
114 | Show or Tell? Effectively Prompting Vision-Language Models for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce a scalable prompting scheme, few-shot prompted semantic segmentation, inspired by open-vocabulary segmentation and few-shot learning. |
NICCOLO AVOGARO et. al. | arxiv-cs.CV | 2025-03-25 |
115 | OpenLex3D: A New Evaluation Benchmark for Open-Vocabulary 3D Scene Representations Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work presents OpenLex3D, a dedicated benchmark to evaluate 3D open-vocabulary scene representations. |
CHRISTINA KASSAB et. al. | arxiv-cs.CV | 2025-03-25 |
116 | The Coralscapes Dataset: Semantic Scene Understanding in Coral Reefs Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We benchmark a wide range of semantic segmentation models, and find that transfer learning from Coralscapes to existing smaller datasets consistently leads to state-of-the-art performance. |
JONATHAN SAUDER et. al. | arxiv-cs.CV | 2025-03-25 |
117 | RoboEngine: Plug-and-Play Robot Data Augmentation with Semantic Robot Segmentation and Background Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: For the first time, users can effortlessly generate physics- and task-aware robot scenes with just a few lines of code. To achieve this, we present a novel robot scene segmentation dataset, a generalizable high-quality robot segmentation model, and a fine-tuned background generation model, which together form the core components of the out-of-the-box toolkit. |
CHENGBO YUAN et. al. | arxiv-cs.RO | 2025-03-24 |
118 | BIMII-Net: Brain-Inspired Multi-Iterative Interactive Network for RGB-T Road Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Nevertheless, existing RGB-T semantic segmentation models typically depend on simple addition or concatenation strategies or ignore the differences between information at different levels. To address these issues, we proposed a novel RGB-T road scene semantic segmentation network called Brain-Inspired Multi-Iteration Interaction Network (BIMII-Net). |
Hanshuo Qiu; Jie Jiang; Ruoli Yang; Lixin Zhan; Jizhao Liu; | arxiv-cs.CV | 2025-03-24 |
119 | PDDM: Pseudo Depth Diffusion Model for RGB-PD Semantic Segmentation Based in Complex Indoor Scenes Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In addition, the pre-trained diffusion model serves as a strong feature extractor for RGB segmentation tasks, but multi-modal diffusion-based segmentation methods remain unexplored. Therefore, we present a Pseudo Depth Diffusion Model (PDDM) that adopts a large-scale text-image diffusion model as a feature extractor and a simple yet effective fusion strategy to integrate pseudo depth. |
Xinhua Xu; Hong Liu; Jianbing Wu; Jinfu Liu; | arxiv-cs.CV | 2025-03-24 |
120 | Context-Aware Semantic Segmentation: Enhancing Pixel-Level Understanding with Large Language Models for Advanced Vision Applications Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Current models, such as CNN and Transformer-based architectures, excel at identifying pixel-level features but fail to distinguish semantically similar objects (e.g., doctor vs. nurse in a hospital scene) or understand complex contextual scenarios (e.g., differentiating a running child from a regular pedestrian in autonomous driving). To address these limitations, we proposed a novel Context-Aware Semantic Segmentation framework that integrates Large Language Models (LLMs) with state-of-the-art vision backbones. |
Ben Rahman; | arxiv-cs.CV | 2025-03-24 |
121 | Seg2Box: 3D Object Detection By Point-Wise Semantics Supervision Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, the challenge arises due to the incomplete geometry structure and boundary ambiguity of point-cloud instances, leading to inaccurate pseudo labels and poor detection results. To address these challenges, we propose a novel method, named Seg2Box. |
MAOJI ZHENG et. al. | arxiv-cs.CV | 2025-03-20 |
122 | Controllable Segmentation-Based Text-Guided Style Editing Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a novel approach for controllable, region-specific style editing driven by textual prompts. |
Jingwen Li; Aravind Chandrasekar; Mariana Rocha; Chao Li; Yuqing Chen; | arxiv-cs.GR | 2025-03-20 |
123 | USAM-Net: A U-Net-based Network for Improved Stereo Correspondence and Scene Depth Estimation Using Features from A Pre-trained Image Segmentation Network Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The increasing demand for high-accuracy depth estimation in autonomous driving and augmented reality applications necessitates advanced neural architectures capable of effectively leveraging multiple data modalities. In this context, we introduce the Unified Segmentation Attention Mechanism Network (USAM-Net), a novel convolutional neural network that integrates stereo image inputs with semantic segmentation maps and attention to enhance depth estimation performance. |
Joseph Emmanuel DL Dayo; Prospero C. Naval Jr; | arxiv-cs.CV | 2025-03-19 |
124 | High Temporal Consistency Through Semantic Similarity Propagation in Semi-Supervised Video Semantic Segmentation for Autonomous Flight Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a lightweight video semantic segmentation approach-suited to onboard real-time inference-achieving high temporal consistency on aerial data through Semantic Similarity Propagation across frames. |
Cédric Vincent; Taehyoung Kim; Henri Meeß; | arxiv-cs.CV | 2025-03-19 |
125 | SPNeRF: Open Vocabulary 3D Neural Scene Segmentation with Superpoints Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Extending these capabilities to 3D segmentation introduces challenges, as CLIP’s image-based embeddings often lack the geometric detail necessary for 3D scene segmentation. Recent methods tend to address this by introducing additional segmentation models or replacing CLIP with variations trained on segmentation data, which lead to redundancy or loss on CLIP’s general language capabilities. |
WEIWEN HU et. al. | arxiv-cs.CV | 2025-03-19 |
126 | High-Precision Dichotomous Image Segmentation Via Probing Diffusion Capacity Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To this end, we propose DiffDIS, a diffusion-driven segmentation model that taps into the potential of the pre-trained U-Net within diffusion models, specifically designed for high-resolution, fine-grained object segmentation. |
QIAN YU et. al. | iclr | 2025-03-17 |
127 | 3D-AffordanceLLM: Harnessing Large Language Models for Open-Vocabulary Affordance Detection in 3D Worlds Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We accordingly propose the \textit{3D-AffordanceLLM} (3D-ADLLM), a framework designed for reasoning affordance detection in 3D open-scene. |
HENGSHUO CHU et. al. | iclr | 2025-03-17 |
128 | Adaptive Transformer Attention and Multi-Scale Fusion for Spine 3D Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This study proposes a 3D semantic segmentation method for the spine based on the improved SwinUNETR to improve segmentation accuracy and robustness. |
YANLIN XIANG et. al. | arxiv-cs.CV | 2025-03-17 |
129 | Organ-aware Multi-scale Medical Image Segmentation Using Text Prompt Engineering Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: While MedSAM has demonstrated strong performance across various medical segmentation tasks, it primarily relies on geometric prompts (e.g., points and bounding boxes) and lacks support for text-based prompts, which could help specify subtle or ambiguous anatomical structures. To overcome these limitations, we propose the Organ-aware Multi-scale Text-guided Medical Image Segmentation Model (OMT-SAM) for multi-organ segmentation. |
Wenjie Zhang; Ziyang Zhang; Mengnan He; Jiancheng Ye; | arxiv-cs.CV | 2025-03-17 |
130 | DynAlign: Unsupervised Dynamic Taxonomy Alignment for Cross-Domain Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, these models often struggle with domain-specific nuances and underrepresented fine-grained categories. To address these challenges, we introduce DynAlign, a two-stage framework that integrates UDA with foundation models to bridge both the image-level and label-level domain gaps. |
Han Sun; Rui Gong; Ismail Nejjar; Olga Fink; | iclr | 2025-03-17 |
131 | Clustering Is Back: Reaching State-of-the-art LiDAR Instance Segmentation Without Training Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we demonstrate that competitive panoptic segmentation can be achieved using only semantic labels, with instances predicted without any training or annotations. |
Corentin Sautier; Gilles Puy; Alexandre Boulch; Renaud Marlet; Vincent Lepetit; | arxiv-cs.CV | 2025-03-17 |
132 | Class Distribution-induced Attention Map for Open-vocabulary Semantic Segmentations Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we argue that CLIP-based prior works yield patch-wise noisy class predictions while having highly correlated class distributions for each object. |
Dong Un Kang; Hayeon Kim; Se Young Chun; | iclr | 2025-03-17 |
133 | HiMTok: Learning Hierarchical Mask Tokens for Image Segmentation with Large Multimodal Model Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose the Hierarchical Mask Tokenizer (HiMTok), which represents segmentation masks with up to 32 tokens and eliminates the need for the original image during mask de-tokenization. |
Tao Wang; Changxu Cheng; Lingfeng Wang; Senda Chen; Wuyue Zhao; | arxiv-cs.CV | 2025-03-17 |
134 | Text4Seg: Reimagining Image Segmentation As Text Generation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce Text4Seg, a novel text-as-mask paradigm that casts image segmentation as a text generation problem, eliminating the need for additional decoders and significantly simplifying the segmentation process. |
MENGCHENG LAN et. al. | iclr | 2025-03-17 |
135 | LangDA: Building Context-Awareness Via Language for Domain Adaptive Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Two key approaches in DASS are (1) vision-only approaches using masking or multi-resolution crops, and (2) language-based approaches that use generic class-wise prompts informed by target domain (e.g. a {snowy} photo of a {class}). |
CHANG LIU et. al. | arxiv-cs.CV | 2025-03-16 |
136 | Point Cloud Based Scene Segmentation: A Survey Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To inspire future research, in this review paper, we provide a comprehensive overview of the current state-of-the-art methods in the field of Point Cloud Semantic Segmentation for autonomous driving. |
Dan Halperin; Niklas Eisl; | arxiv-cs.CV | 2025-03-16 |
137 | OSMa-Bench: Evaluating Open Semantic Mapping Under Varying Lighting Conditions Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces a dynamically configurable and highly automated LLM/LVLM-powered pipeline for evaluating OSM solutions called OSMa-Bench (Open Semantic Mapping Benchmark). |
Maxim Popov; Regina Kurkova; Mikhail Iumanov; Jaafar Mahmoud; Sergey Kolyubin; | arxiv-cs.CV | 2025-03-13 |
138 | MaskAttn-UNet: A Mask Attention-Driven Framework for Universal Low-Resolution Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Low-resolution image segmentation is crucial in real-world applications such as robotics, augmented reality, and large-scale scene understanding, where high-resolution data is often unavailable due to computational constraints. To address this challenge, we propose MaskAttn-UNet, a novel segmentation framework that enhances the traditional U-Net architecture via a mask attention mechanism. |
ANZHE CHENG et. al. | arxiv-cs.CV | 2025-03-11 |
139 | Aligning Instance-Semantic Sparse Representation Towards Unsupervised Object Segmentation and Shape Abstraction with Repeatable Primitives Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Driven by the tendency of high-dimensional semantically similar features to lie in or near low-dimensional subspaces, we introduce a one-stage, fully unsupervised framework towards semantic-aware shape representation. |
Jiaxin Li; Hongxing Wang; Jiawei Tan; Zhilong Ou; Junsong Yuan; | arxiv-cs.CV | 2025-03-10 |
140 | Approximate Size Targets Are Sufficient for Accurate Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Our ideas are validated on PASCAL VOC using our new human annotations of approximate object sizes. |
Xingye Fan; Yuri Boykov; | arxiv-cs.CV | 2025-03-10 |
141 | MemorySAM: Memorize Modalities and Semantics with Segment Anything Model 2 for Multi-modal Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Inspired by cross-frame correlation in videos, we propose to treat multi-modal data as a sequence of frames representing the same scene. |
CHENFEI LIAO et. al. | arxiv-cs.CV | 2025-03-09 |
142 | Dynamically Evolving Segment Anything Model with Continuous Learning for Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, in practical applications, the diversity of scenarios and tasks in medical image segmentation continues to expand, necessitating models that can dynamically evolve to meet the demands of various segmentation tasks. Here, we introduce EvoSAM, a dynamically evolving medical image segmentation model that continuously accumulates new knowledge from an ever-expanding array of scenarios and tasks, enhancing its segmentation capabilities. |
ZHAORI LIU et. al. | arxiv-cs.CV | 2025-03-08 |
143 | EvidMTL: Evidential Multi-Task Learning for Uncertainty-Aware Semantic Surface Mapping from Monocular RGB Images Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Existing mapping methods often suffer from overconfident semantic predictions, and sparse and noisy depth sensing, leading to inconsistent map representations. In this paper, we therefore introduce EvidMTL, a multi-task learning framework that uses evidential heads for depth estimation and semantic segmentation, enabling uncertainty-aware inference from monocular RGB images. |
Rohit Menon; Nils Dengler; Sicong Pan; Gokul Krishna Chenchani; Maren Bennewitz; | arxiv-cs.RO | 2025-03-06 |
144 | BEVMOSNet: Multimodal Fusion for BEV Moving Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Conversely, LiDAR and radar sensors remain almost unaffected in these scenarios, and radar provides key velocity information of the objects. Therefore, we introduce BEVMOSNet, to our knowledge, the first end-to-end multimodal fusion leveraging cameras, LiDAR, and radar to precisely predict the moving objects in BEV. |
HIEP TRUONG CONG et. al. | arxiv-cs.CV | 2025-03-05 |
145 | GaussianGraph: 3D Gaussian-based Scene Graph Generation for Open-world Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, existing methods primarily focus on embedding compressed CLIP features to 3D Gaussians, suffering from low object segmentation accuracy and lack spatial reasoning capabilities. To address these limitations, we propose GaussianGraph, a novel framework that enhances 3DGS-based scene understanding by integrating adaptive semantic clustering and scene graph generation. |
XIHAN WANG et. al. | arxiv-cs.CV | 2025-03-05 |
146 | SurgiSAM2: Fine-tuning A Foundational Model for Surgical Video Anatomy Segmentation and Detection Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Methods: We utilized five public datasets to evaluate and fine-tune SAM 2 for segmenting anatomical tissues in surgical videos/images. |
DEVANISH N. KAMTAM et. al. | arxiv-cs.CV | 2025-03-05 |
147 | UFO: A Unified Approach to Fine-grained Visual Perception Via Open-ended Language Interface Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This is primarily because these tasks often rely heavily on task-specific designs and architectures that can complicate the modeling process. To address this challenge, we present \ours, a framework that \textbf{U}nifies \textbf{F}ine-grained visual perception tasks through an \textbf{O}pen-ended language interface. |
HAO TANG et. al. | arxiv-cs.CV | 2025-03-03 |
148 | Enhanced Neuromorphic Semantic Segmentation Latency Through Stream Event Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Traditional frame-based methods often struggle to balance latency, accuracy, and energy efficiency. To address these challenges, we leverage event streams from event-based cameras-bio-inspired sensors that trigger events in response to changes in the scene. |
D. Hareb; J. Martinet; B. Miramond; | arxiv-cs.CV | 2025-02-26 |
149 | Multi-Granularity Video Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we aim to generate multi-granularity video segmentation dataset that is annotated for both salient and non-salient masks. |
SANGBEOM LIM et. al. | aaai | 2025-02-25 |
150 | Structural Pruning Via Spatial-aware Information Redundancy for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Within this framework, we introduce a spatial-aware redundancy metric based on feature maps, thus endowing the pruning process with location sensitivity to better adapt to pruning segmentation networks. |
Dongyue Wu; Zilin Guo; Li Yu; Nong Sang; Changxin Gao; | aaai | 2025-02-25 |
151 | Every Component Counts: Rethinking The Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present Connected-Component (CC)-Metrics, a novel semantic segmentation evaluation protocol, targeted to align existing semantic segmentation metrics to a multi-instance detection scenario in which each connected component matters. |
ALEXANDER JAUS et. al. | aaai | 2025-02-25 |
152 | InvSeg: Test-Time Prompt Inversion for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This discrepancy hinders diffusion models from capturing accurate visual-textual correlations. To solve this, we propose InvSeg, a test-time prompt inversion method that tackles open-vocabulary semantic segmentation by inverting image-specific visual context into text prompt embedding space, leveraging structure information derived from the diffusion model’s reconstruction process to enrich text prompts so as to associate each class with a structure-consistent mask. |
Jiayi Lin; Jiabo Huang; Jian Hu; Shaogang Gong; | aaai | 2025-02-25 |
153 | Exploring Semantic Consistency and Style Diversity for Domain Generalized Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Domain randomization-based methods frequently incorporate domain-irrelevant noise due to the uncontrollability of style transformations, resulting in segmentation ambiguity. To address these challenges, we introduce a novel framework, named SCSD for Semantic Consistency prediction and Style Diversity generalization. |
Hongwei Niu; Linhuang Xie; Jianghang Lin; Shengchuan Zhang; | aaai | 2025-02-25 |
154 | Weakly Supervised Gland Segmentation with Class Semantic Consistency and Purified Labels Filtration Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Specifically, for class consistency, we propose Consistency Correlation Attention (CCA) to encourage the network to focus on the contribution of class features to semantic dependencies. |
SIYANG FENG et. al. | aaai | 2025-02-25 |
155 | Efficient Event-Based Semantic Segmentation Via Exploiting Frame-Event Fusion: A Hybrid Neural Network Approach Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, existing event-based semantic segmentation methods often fail to fully exploit the complementary information provided by frames and events, resulting in complex training strategies and increased computational costs. To address these challenges, we propose an efficient hybrid framework for image semantic segmentation, comprising a Spiking Neural Network branch for events and an Artificial Neural Network branch for frames. |
HEBEI LI et. al. | aaai | 2025-02-25 |
156 | SeeDiff: Off-the-Shelf Seeded Mask Generation from Diffusion Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we take a closer look at attention mechanisms of Stable Diffusion, from which we draw connections with classical seeded segmentation approaches. |
Joon Hyun Park; Kumju Jo; Sungyong Baik; | aaai | 2025-02-25 |
157 | SGTC: Semantic-Guided Triplet Co-training for Sparsely Annotated Semi-Supervised Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Even worse, most of the existing approaches pay much attention to image-level information and ignore semantic features, resulting in the inability to perceive weak boundaries. To address these issues, we propose a novel Semantic-Guided Triplet Co-training (SGTC) framework, which achieves high-end medical image segmentation by only annotating three orthogonal slices of a few volumetric samples, significantly alleviating the burden of radiologists. |
Ke Yan; Qing Cai; Fan Zhang; Ziyan Cao; Zhi Liu; | aaai | 2025-02-25 |
158 | Holistic Correction with Object Prototype for Video Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end, we propose a Holistic Correction Network (HCNet) to adaptively acquire concise object prototypes for holistic correction at semantic, spatial and temporal aspects. |
Shengye Qiao; Changqun Xia; Yanjie Liang; Gongjin Lan; Jia Li; | aaai | 2025-02-25 |
159 | Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose a unique neural model, leveraging advances from the state space and diffusion generative modeling to achieve remarkable 3D semantic scene completion performance with monocular image input. |
Li Liang; Naveed Akhtar; Jordan Vice; Xiangrui Kong; Ajmal Saeed Mian; | aaai | 2025-02-25 |
160 | S2S2: Semantic Stacking for Robust Semantic Segmentation in Medical Imaging Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In response, we introduce a novel, domain-agnostic, add-on, and data-driven strategy inspired by image stacking in image denoising. |
Yimu Pan; Sitao Zhang; Alison D. Gernand; Jeffery A. Goldstein; James Z. Wang; | aaai | 2025-02-25 |
161 | CLIMB-3D: Continual Learning for Imbalanced 3D Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Although some approaches have addressed class emergence, they often overlook class imbalance, resulting in suboptimal performance — particularly on rare categories. To tackle this challenge, we propose CLIMB-3D, a unified framework for \textbf{CL}ass-incremental \textbf{Imb}alance-aware \textbf{3D}IS. |
VISHAL THENGANE et. al. | arxiv-cs.CV | 2025-02-24 |
162 | Pointmap Association and Piecewise-Plane Constraint for Consistent and Compact 3D Gaussian Segmentation Field Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: As a result, optimization typically lacks awareness of semantic category information, which can result in floaters with ambiguous segmentation. To address these challenges, we introduce CCGS, a method designed to achieve both view consistent 2D segmentation and a compact 3D Gaussian segmentation field. |
WENHAO HU et. al. | arxiv-cs.CV | 2025-02-22 |
163 | RendBEV: Semantic Novel View Synthesis for Self-Supervised Bird’s Eye View Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we present RendBEV, a new method for the self-supervised training of BEV semantic segmentation networks, leveraging differentiable volumetric rendering to receive supervision from semantic perspective views computed by a 2D semantic segmentation model. |
Henrique Piñeiro Monteagudo; Leonardo Taccari; Aurel Pjetri; Francesco Sambo; Samuele Salti; | arxiv-cs.CV | 2025-02-20 |
164 | When Segmentation Meets Hyperspectral Image: New Paradigm for Hyperspectral Image Classification Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, their application remains under-explored in this task due to (1) the prevailing notion that larger patch sizes degrade performance, (2) the extensive unlabeled regions in HSI groundtruth, and (3) the misalignment of input shapes between HSI data and segmentation models. Thus, in this study, we propose a novel paradigm and baseline, HSIseg, for HSI classification that leverages segmentation techniques combined with a novel Dynamic Shifted Regional Transformer (DSRT) to overcome these challenges. |
Weilian Zhou; Weixuan Xie; Sei-ichiro Kamata; Man Sing Wong; Haipeng Wang; | arxiv-cs.CV | 2025-02-18 |
165 | From Open-Vocabulary to Vocabulary-Free Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work proposes a Vocabulary-Free Semantic Segmentation pipeline, eliminating the need for predefined class vocabularies. |
KLARA REICHARD et. al. | arxiv-cs.CV | 2025-02-17 |
166 | Text-Promptable Propagation for Referring Medical Image Sequence Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Existing 2D and 3D segmentation models struggle to explicitly track objects of interest across medical image sequences, and lack support for nteractive, text-driven guidance. To address these limitations, we propose Text-Promptable Propagation (TPP), a model designed for referring medical image sequence segmentation. |
RUNTIAN YUAN et. al. | arxiv-cs.CV | 2025-02-16 |
167 | NPSim: Nighttime Photorealistic Simulation From Daytime Images With Monocular Inverse Rendering and Ray Tracing Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this thesis, we introduce a novel approach named NPSim, which enables the simulation of realistic nighttime images from real daytime counterparts with monocular inverse rendering and ray tracing. |
Shutong Zhang; | arxiv-cs.CV | 2025-02-15 |
168 | Instance Segmentation of Scene Sketches Using Natural Image Priors Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce InkLayer, a method for instance segmentation of raster scene sketches. |
Mia Tang; Yael Vinker; Chuan Yan; Lvmin Zhang; Maneesh Agrawala; | arxiv-cs.CV | 2025-02-13 |
169 | SQ-GAN: Semantic Image Communications Using Masked Vector Quantization Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This work introduces Semantically Masked VQ-GAN (SQ-GAN), a novel approach integrating generative models to optimize image compression for semantic/task-oriented communications. |
Francesco Pezone; Sergio Barbarossa; Giuseppe Caire; | arxiv-cs.CV | 2025-02-13 |
170 | Prototype Contrastive Consistency Learning for Semi-Supervised Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, although previous contrastive learning methods can mine semantic information from partial pixels within images, they ignore the whole context information of unlabeled images, which is very important to precise segmentation. In order to solve this problem, we propose a novel prototype contrastive learning method called Prototype Contrastive Consistency Segmentation (PCCS) for semi-supervised medical image segmentation. |
Shihuan He; Zhihui Lai; Ruxin Wang; Heng Kong; | arxiv-cs.CV | 2025-02-10 |
171 | Convolutional Neural Network Segmentation for Satellite Imagery Data to Identify Landforms Using U-Net Architecture Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The study applies the U-Net model for effective feature extraction by using Convolutional Neural Network (CNN) segmentation techniques. |
Mitul Goswami; Sainath Dey; Aniruddha Mukherjee; Suneeta Mohanty; Prasant Kumar Pattnaik; | arxiv-cs.CV | 2025-02-08 |
172 | Deep Unfolding Multi-modal Image Fusion Network Via Attribution Analysis Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Although some approaches attempt to jointly optimize image fusion and downstream tasks, these efforts often lack direct guidance or interaction, serving only to assist with a predefined fusion loss. To address this, we propose an “Unfolding Attribution Analysis Fusion network” (UAAFusion), using attribution analysis to tailor fused images more effectively for semantic segmentation, enhancing the interaction between the fusion and segmentation. |
HAOWEN BAI et. al. | arxiv-cs.CV | 2025-02-03 |
173 | Lifting By Gaussians: A Simple, Fast and Flexible Method for 3D Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce Lifting By Gaussians (LBG), a novel approach for open-world instance segmentation of 3D Gaussian Splatted Radiance Fields (3DGS). |
Rohan Chacko; Nicolai Haeni; Eldar Khaliullin; Lin Sun; Douglas Lee; | arxiv-cs.CV | 2025-01-31 |
174 | SynthmanticLiDAR: A Synthetic Dataset for Semantic Segmentation on LiDAR Imaging Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we present a modified CARLA simulator designed with LiDAR semantic segmentation in mind, with new classes, more consistent object labeling with their counterparts from real datasets such as SemanticKITTI, and the possibility to adjust the object class distribution. |
Javier Montalvo; Pablo Carballeira; Álvaro García-Martín; | arxiv-cs.CV | 2025-01-31 |
175 | Let Human Sketches Help: Empowering Challenging Image Segmentation Task with Freehand Sketches Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Our approach introduces an innovative sketch-guided interactive segmentation framework, allowing users to intuitively annotate objects with freehand sketches (drawing a rough contour of the object) instead of the traditional bounding boxes or points used in classic interactive segmentation models like SAM. |
YING ZANG et. al. | arxiv-cs.CV | 2025-01-31 |
176 | Freestyle Sketch-in-the-Loop Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we expand the domain of sketch research into the field of image segmentation, aiming to establish freehand sketches as a query modality for subjective image segmentation. |
SUBHADEEP KOLEY et. al. | arxiv-cs.CV | 2025-01-27 |
177 | Cross-Domain Semantic Segmentation with Large Language Model-Assisted Descriptor Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose LangSeg, a novel LLM-guided semantic segmentation method that leverages context-sensitive, fine-grained subclass descriptors generated by LLMs. |
Philip Hughes; Larry Burns; Luke Adams; | arxiv-cs.CV | 2025-01-27 |
178 | D-PLS: Decoupled Semantic Segmentation for 4D-Panoptic-LiDAR-Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces a novel approach to 4D Panoptic LiDAR Segmentation that decouples semantic and instance segmentation, leveraging single-scan semantic predictions as prior information for instance segmentation. |
Maik Steinhauser; Laurenz Reichardt; Nikolas Ebert; Oliver Wasenmüller; | arxiv-cs.CV | 2025-01-27 |
179 | Rethinking Early-Fusion Strategies for Improved Multimodal Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, these methods require massive parameter updates and computational effort during the feature extraction and fusion. To address this issue, we propose a novel multimodal fusion network (EFNet) based on an early fusion strategy and a simple but effective feature clustering for training efficient RGB-T semantic segmentation. |
Zhengwen Shen; Yulian Li; Han Zhang; Yuchen Weng; Jun Wang; | arxiv-cs.CV | 2025-01-19 |
180 | Semi-supervised Semantic Segmentation for Remote Sensing Images Via Multi-scale Uncertainty Consistency and Cross-Teacher-Student Attention Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, RS images pose unique challenges, including rich multi-scale features and high inter-class similarity. To address these problems, this paper proposes a novel semi-supervised Multi-Scale Uncertainty and Cross-Teacher-Student Attention (MUCA) model for RS image semantic segmentation tasks. |
Shanwen Wang; Xin Sun; Changrui Chen; Danfeng Hong; Jungong Han; | arxiv-cs.CV | 2025-01-18 |
181 | Surface-SOS: Self-Supervised Object Segmentation Via Neural Surface Representation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Under conditions of multi-camera inputs, the structural, textural and geometrical consistency among each view can be leveraged to achieve fine-grained object segmentation. To make better use of the above information, we propose Surface representation based Self-supervised Object Segmentation (Surface-SOS), a new framework to segment objects for each view by 3D surface representation from multi-view images of a scene. |
Xiaoyun Zheng; Liwei Liao; Jianbo Jiao; Feng Gao; Ronggang Wang; | arxiv-cs.CV | 2025-01-16 |
182 | Hierarchical Superpixel Segmentation Via Structural Information Theory Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: These approaches do not fully leverage the global information in the graph, leading to suboptimal segmentation quality. To address this limitation, we present SIT-HSS, a hierarchical superpixel segmentation method based on structural information theory. |
MINHUI XIE et. al. | arxiv-cs.CV | 2025-01-13 |
183 | Adaptive Noise-Tolerant Network for Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, instead of relying on clean segmentation labels, we study whether and how integrating imperfect or noisy segmentation results from off-the-shelf segmentation algorithms may help achieve better segmentation results through a new Adaptive Noise-Tolerant Network (ANTN) model. |
Weizhi Li; | arxiv-cs.CV | 2025-01-13 |
184 | RSRefSeg: Referring Remote Sensing Image Segmentation with Foundation Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, these approaches often struggle to establish robust alignments between fine-grained semantic concepts, leading to inconsistent representations across textual and visual information. To address these limitations, we introduce a referring remote sensing image segmentation foundational model, RSRefSeg. |
Keyan Chen; Jiafan Zhang; Chenyang Liu; Zhengxia Zou; Zhenwei Shi; | arxiv-cs.CV | 2025-01-12 |
185 | LarvSeg: Exploring Image Classification Data For Large Vocabulary Semantic Segmentation Via Category-wise Attentive Classifier Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a new large vocabulary semantic segmentation framework, called LarvSeg. |
HAOJUN YU et. al. | arxiv-cs.CV | 2025-01-12 |
186 | Static Segmentation By Tracking: A Frustratingly Label-Efficient Approach to Fine-Grained Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a label-efficient method named Static Segmentation by Tracking (SST). |
ZHENYANG FENG et. al. | arxiv-cs.CV | 2025-01-12 |
187 | BEN: Using Confidence-Guided Matting for Dichotomous Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: As improvements in image segmentation become increasingly challenging to achieve, combining image matting and grayscale segmentation techniques offers promising new directions for architectural innovation. Inspired by the possibility of aligning these two model tasks, we propose a new architectural approach for DIS called Confidence-Guided Matting (CGM). |
Maxwell Meyer; Jack Spruyt; | arxiv-cs.CV | 2025-01-07 |
188 | LM-Net: A Light-weight and Multi-scale Network for Medical Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This results in over-segmentation, under-segmentation, and blurred segmentation boundaries. To tackle these challenges, we explore multi-scale feature representations from different perspectives, proposing a novel, lightweight, and multi-scale architecture (LM-Net) that integrates advantages of both Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs) to enhance segmentation accuracy. |
Zhenkun Lu; Chaoyin She; Wei Wang; Qinghua Huang; | arxiv-cs.CV | 2025-01-07 |
189 | Image Segmentation: Inducing Graph-based Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We compare our proposed UNet-GNN model against established convolutional neural networks (CNNs) based segmentation models, including U-Net and U-Net++, as well as the transformer-based SwinUNet. |
Aryan Singh; Pepijn Van de Ven; Ciarán Eising; Patrick Denny; | arxiv-cs.CV | 2025-01-07 |
190 | Superpixel Boundary Correction for Weakly-Supervised Semantic Segmentation on Histopathology Images Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, Class Activation Map (CAM)-based methods still suffer from low spatial resolution and unclear boundaries. To address these issues, we propose a multi-level superpixel correction algorithm that refines CAM boundaries using superpixel clustering and floodfill. |
Hongyi Wu; Hong Zhang; | arxiv-cs.CV | 2025-01-07 |
191 | 4D-CS: Exploiting Cluster Prior for 4D Spatio-Temporal LiDAR Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, these methods often overlook the segmentation consistency in space and time, which may result in point clouds within the same object being predicted as different categories. To handle this issue, our core idea is to generate cluster labels across multiple frames that can reflect the complete spatial structure and temporal information of objects. |
Jiexi Zhong; Zhiheng Li; Yubo Cui; Zheng Fang; | arxiv-cs.CV | 2025-01-06 |
192 | The 2nd Place Solution from The 3D Semantic Segmentation Track in The 2024 Waymo Open Dataset Challenge Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this report, we introduce MixSeg3D, a sophisticated combination of the strong point cloud segmentation model with advanced 3D data mixing strategies. |
Qing Wu; | arxiv-cs.CV | 2025-01-06 |
193 | MedSegDiffNCA: Diffusion Models With Neural Cellular Automata for Skin Lesion Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work proposes three NCA-based improvements for diffusion-based medical image segmentation. |
Avni Mittal; John Kalkhof; Anirban Mukhopadhyay; Arnav Bhavsar; | arxiv-cs.CV | 2025-01-05 |
194 | IAM: Enhancing RGB-D Instance Segmentation with New Benchmarks Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: There is a relative scarcity of instance-level RGB-D segmentation datasets, which restricts current methods to broad category distinctions rather than fully capturing the fine-grained details required for recognizing individual objects. To bridge this gap, we introduce three RGB-D instance segmentation benchmarks, distinguished at the instance level. |
Aecheon Jung; Soyun Choi; Junhong Min; Sungeun Hong; | arxiv-cs.CV | 2025-01-03 |
195 | Tuning A SAM-Based Model With Multicognitive Visual Adapter to Remote Sensing Instance Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The segment anything model (SAM), a foundational model designed for promptable segmentation tasks, demonstrates exceptional generalization capabilities, making it highly promising … |
Linghao Zheng; Xinyang Pu; Su Zhang; Feng Xu; | IEEE Journal of Selected Topics in Applied Earth … | 2025-01-01 |
196 | Geographical Scenario Knowledge-Informed Graph Structure Attention for Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Deep learning methods, renowned for their ability to discern physical features from images, are frequently used in the semantic segmentation of remote sensing images. However, … |
HUILING ZHAO et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2025-01-01 |
197 | Tissue Segmentation for Traumatic Brain Injury Based on Multimodal MRI Image Fusion-semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
YAO XU et. al. | Biomed. Signal Process. Control. | 2025-01-01 |
198 | FGAseg: Fine-Grained Pixel-Text Alignment for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: As a result, information extracted directly from VLMs can’t meet the requirements of segmentation tasks. To address this limitation, we propose FGAseg, a model designed for fine-grained pixel-text alignment and category boundary supplementation. |
Bingyu Li; Da Zhang; Zhiyuan Zhao; Junyu Gao; Xuelong Li; | arxiv-cs.CV | 2025-01-01 |
199 | A Generalized Geodesic Voting Framework for Interactive Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this article, we introduce a new variational model for addressing the image segmentation problem of minimal user interaction. The proposed variational segmentation model, … |
SHUWANG ZHOU et. al. | IEEE Transactions on Instrumentation and Measurement | 2025-01-01 |
200 | PanoSLAM: Panoptic 3D Scene Reconstruction Via Gaussian SLAM Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce PanoSLAM, the first SLAM system to integrate geometric reconstruction, 3D semantic segmentation, and 3D instance segmentation within a unified framework. |
RUNNAN CHEN et. al. | arxiv-cs.CV | 2024-12-31 |
201 | OVGaussian: Generalizable 3D Gaussian Segmentation with Open Vocabularies Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose \textbf{OVGaussian}, a generalizable \textbf{O}pen-\textbf{V}ocabulary 3D semantic segmentation framework based on the 3D \textbf{Gaussian} representation. |
RUNNAN CHEN et. al. | arxiv-cs.CV | 2024-12-31 |
202 | LiDAR-Camera Fusion for Video Panoptic Segmentation Without Video Training Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work seeks to introduce a feature fusion module that enhances PS and VPS by fusing LiDAR and image data for autonomous vehicles. |
Fardin Ayar; Ehsan Javanmardi; Manabu Tsukada; Mahdi Javanmardi; Mohammad Rahmati; | arxiv-cs.CV | 2024-12-30 |
203 | Dual-Space Augmented Intrinsic-LoRA for Wind Turbine Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Despite advancements in large universal vision models, these models often underperform in domain-specific tasks like WTB segmentation. To address this, we extend Intrinsic LoRA for image segmentation, and propose a novel dual-space augmentation strategy that integrates both image-level and latent-space augmentations. |
Shubh Singhal; Raül Pérez-Gonzalo; Andreas Espersen; Antonio Agudo; | arxiv-cs.CV | 2024-12-30 |
204 | HisynSeg: Weakly-Supervised Histopathological Image Segmentation Via Image-Mixing Synthesis and Consistency Regularization Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, CAM-based methods are prone to suffer from under-activation and over-activation issues, leading to poor segmentation performance. To address this problem, we propose a novel weakly-supervised semantic segmentation framework for histopathological images based on image-mixing synthesis and consistency regularization, dubbed HisynSeg. |
Zijie Fang; Yifeng Wang; Peizhang Xie; Zhi Wang; Yongbing Zhang; | arxiv-cs.CV | 2024-12-30 |
205 | LangSurf: Language-Embedded Surface Gaussians for 3D Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end, we propose a Language-Embedded Surface Field (LangSurf), which accurately aligns the 3D language fields with the surface of objects, facilitating precise 2D and 3D segmentation with text query, widely expanding the downstream tasks such as removal and editing. |
HAO LI et. al. | arxiv-cs.CV | 2024-12-23 |
206 | Multi-Scale Foreground-Background Confidence for Out-of-Distribution Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we present a multi-scale OOD segmentation method that exploits the confidence information of a foreground-background segmentation model. |
Samuel Marschall; Kira Maag; | arxiv-cs.CV | 2024-12-22 |
207 | Imaging Segmentation of Brain Tumors Based on The Modified U-net Method Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Brain tumor segmentation in medical image analysis is a challenging task. Deep learning techniques have recently shown promise in resolving a variety of computer vision problems, … |
Yajie Zhang; Hea Choon Ngo; Yifan Zhang; Noor Fazilla Abd Yusof; Xiaohan Wang; | Inf. Technol. Control. | 2024-12-21 |
208 | Leveraging Contrastive Learning for Semantic Segmentation with Consistent Labels Across Varying Appearances Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces a novel synthetic dataset that captures urban scenes under a variety of weather conditions, providing pixel-perfect, ground-truth-aligned images to facilitate effective feature alignment across domains. |
JAVIER MONTALVO et. al. | arxiv-cs.CV | 2024-12-21 |
209 | VerSe: Integrating Multiple Queries As Prompts for Versatile Cardiac MRI Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, they are semi-automatic and inefficient, due to their reliance on click-based prompts, especially for 3D cardiac MRI volumes. To address these limitations, we propose VerSe, a Versatile Segmentation framework to unify automatic and interactive segmentation through mutiple queries. |
BANGWEI GUO et. al. | arxiv-cs.CV | 2024-12-20 |
210 | Spike2Former: Efficient Spiking Transformer for High-performance Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We set a new state-of-the-art for SNNs in various semantic segmentation datasets, with a significant improvement of +12.7% mIoU and 5.0 efficiency on ADE20K, +14.3% mIoU and 5.2 efficiency on VOC2012, and +9.1% mIoU and 6.6 efficiency on CityScapes. |
ZHENXIN LEI et. al. | arxiv-cs.CV | 2024-12-19 |
211 | Language-guided Medical Image Segmentation with Target-informed Multi-level Contrastive Alignments Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we propose a language-guided segmentation network with Target-informed Multi-level Contrastive Alignments (TMCA). |
MINGJIAN LI et. al. | arxiv-cs.CV | 2024-12-18 |
212 | Incorporating Feature Pyramid Tokenization and Open Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we treat segmentation as tokenizing pixels and study a united perceptual and semantic token compression for all granular understanding and consequently facilitate open vocabulary semantic segmentation. |
Jianyu Zhang; Li Zhang; Shijian Li; | arxiv-cs.CV | 2024-12-18 |
213 | SEG-SAM: Semantic-Guided SAM for Unified Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: SAM has shown promising binary segmentation performance in natural domains, however, transferring it to the medical domain remains challenging, as medical images often possess substantial inter-category overlaps. To address this, we propose the SEmantic-Guided SAM (SEG-SAM), a unified medical segmentation model that incorporates semantic medical knowledge to enhance medical segmentation performance. |
SHUANGPING HUANG et. al. | arxiv-cs.CV | 2024-12-17 |
214 | Open-World Panoptic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this article, we tackle the problem of open-world panoptic segmentation, i.e., the task of discovering new semantic categories and new object instances at test time, while enforcing consistency among the categories that we incrementally discover. |
Matteo Sodano; Federico Magistri; Jens Behley; Cyrill Stachniss; | arxiv-cs.CV | 2024-12-17 |
215 | Classification Drives Geographic Bias in Street Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We examined if instance segmentation models trained on European driving scenes (Eurocentric models) are geo-biased. |
Rahul Nair; Gabriel Tseng; Esther Rolf; Bhanu Tokas; Hannah Kerner; | arxiv-cs.CV | 2024-12-15 |
216 | DCSEG: Decoupled 3D Open-Set Segmentation Using Gaussian Splatting Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present a decoupled 3D segmentation pipeline to ensure modularity and adaptability to novel 3D representations as well as semantic segmentation foundation models. |
Luis Wiedmann; Luca Wiehe; David Rozenberszki; | arxiv-cs.CV | 2024-12-14 |
217 | CFSSeg: Closed-Form Solution for Class-Incremental Semantic Segmentation of 2D Images and 3D Point Clouds Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, stochastic gradient descent-based approaches inevitably update the model’s weights for past knowledge, leading to catastrophic forgetting, a problem exacerbated by pixel/point-level granularity. To address these challenges, we propose CFSSeg, a novel exemplar-free approach that leverages a closed-form solution, offering a practical and theoretically grounded solution for continual semantic segmentation tasks. |
JIAXU LI et. al. | arxiv-cs.CV | 2024-12-14 |
218 | SuperGSeg: Open-Vocabulary 3D Segmentation with Structured Super-Gaussians Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end, We introduce SuperGSeg, a novel approach that fosters cohesive, context-aware scene representation by disentangling segmentation and language field distillation. |
SIYUN LIANG et. al. | arxiv-cs.CV | 2024-12-13 |
219 | SPT: Sequence Prompt Transformer for Interactive Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, existing methods typically process one image at a time, failing to consider the sequential nature of the images. To overcome this limitation, we propose a novel method called Sequence Prompt Transformer (SPT), the first to utilize sequential image information for interactive segmentation. |
Senlin Cheng; Haopeng Sun; | arxiv-cs.CV | 2024-12-13 |
220 | FAMNet: Frequency-aware Matching Network for Cross-domain Few-shot Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Inspired by the characteristics of frequency domain similarity across different domains, we propose a Frequency-aware Matching Network (FAMNet), which includes two key components: a Frequency-aware Matching (FAM) module and a Multi-Spectral Fusion (MSF) module. |
Yuntian Bo; Yazhou Zhu; Lunbo Li; Haofeng Zhang; | arxiv-cs.CV | 2024-12-12 |
221 | A Deep Semantic Segmentation Network with Semantic and Contextual Refinements Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation is a fundamental task in multimedia processing, which can be used for analyzing, understanding, editing contents of images and videos, among others. To … |
ZHIYAN WANG et. al. | ArXiv | 2024-12-11 |
222 | GCUNet: A GNN-Based Contextual Learning Network for Tertiary Lymphoid Structure Semantic Segmentation in Whole Slide Image Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, this prevents the model from accessing information outside of the patches, limiting the performance. To address this issue, we propose GCUNet, a GNN-based contextual learning network for TLS semantic segmentation. |
Lei Su; Yang Du; | arxiv-cs.CV | 2024-12-08 |
223 | Appearance Matching Adapter for Exemplar-based Semantic Image Synthesis In-the-Wild Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: During inference, we introduce an automated exemplar retrieval method for selecting exemplar image-segmentation pairs efficiently. |
SIYOON JIN et. al. | arxiv-cs.CV | 2024-12-04 |
224 | Point-GR: Graph Residual Point Cloud Network for 3D Object Classification and Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents Point-GR, a novel deep learning architecture designed explicitly to transform unordered raw point clouds into higher dimensions while preserving local geometric features. |
Md Meraz; Md Afzal Ansari; Mohammed Javed; Pavan Chakraborty; | arxiv-cs.CV | 2024-12-04 |
225 | SJTU:Spatial Judgments in Multimodal Models Towards Unified Segmentation Through Coordinate Detection Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper introduces SJTU: Spatial Judgments in multimodal models – Towards Unified segmentation through coordinate detection, a novel framework that leverages spatial coordinate understanding to bridge vision-language interaction and precise segmentation, enabling accurate target identification through natural language instructions. |
Joongwon Chae; Zhenyu Wang; Peiwu Qin; | arxiv-cs.CV | 2024-12-03 |
226 | CMCD-Net:Unsupervised Domain Adaptation with Contrastive Learning for Cross-modality and Cross-disease Brain Lesion Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Unsupervised domain adaptation (UDA), as a robust transfer learning strategy that utilizes source domain richly labeled data to solve the target domain unlabeled semantic … |
Xuexian Chen; Yanjun Peng; | 2024 IEEE International Conference on Bioinformatics and … | 2024-12-03 |
227 | Mamba-SAM: An Adaption Framework for Accurate Medical Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The Segment Anything Model (SAM) shows strong performance in natural images but struggles with medical images due to a significant semantic gap and characteristics like … |
YIFENG WU et. al. | 2024 IEEE International Conference on Bioinformatics and … | 2024-12-03 |
228 | Semantic Segmentation Prior for Diffusion-Based Real-World Super-Resolution Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Additionally, the same region may have a strong response to more than one prompt and it will lead to semantic ambiguity for image super-resolution. To alleviate the above two issues, in this paper, we propose to consider semantic segmentation as an additional control condition into diffusion-based image super-resolution. |
JIAHUA XIAO et. al. | arxiv-cs.CV | 2024-12-03 |
229 | RailEINet:A Novel Scene Segmentation Network for Automatic Train Operation Based on Feature Alignment Related Papers Related Patents Related Grants Related Venues Related Experts View |
Tao Sun; Baoqing Guo; Tao Ruan; Xingfang Zhou; Dingyuan Bai; | Eng. Appl. Artif. Intell. | 2024-12-01 |
230 | Advancing Perturbation Space Expansion Based on Information Fusion for Semi-supervised Remote Sensing Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Liang Zhou; Keyi Duan; Jinkun Dai; Yuanxin Ye; | Inf. Fusion | 2024-12-01 |
231 | Density-aware Global-Local Attention Network for Point Cloud Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The point cloud data collected in real scenes often contain small objects and categories with small sample sizes, which are difficult to handle by existing networks. In this regard, we propose a point cloud segmentation network that fuses local attention based on density perception with global attention. |
Chade Li; Pengju Zhang; Yihong Wu; | arxiv-cs.CV | 2024-11-30 |
232 | GradiSeg: Gradient-Guided Gaussian Segmentation with Enhanced 3D Boundary Precision Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: While 3D Gaussian Splatting enables high-quality real-time rendering, existing Gaussian-based frameworks for 3D semantic segmentation still face significant challenges in boundary recognition accuracy. To address this, we propose a novel 3DGS-based framework named GradiSeg, incorporating Identity Encoding to construct a deeper semantic understanding of scenes. |
ZEHAO LI et. al. | arxiv-cs.CV | 2024-11-30 |
233 | LMSeg: Unleashing The Power of Large-Scale Models for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose to alleviate the above-mentioned issues by leveraging multiple large-scale models to enhance the alignment between fine-grained visual features and enriched linguistic features. |
HUADONG TANG et. al. | arxiv-cs.CV | 2024-11-30 |
234 | Bootstraping Clustering of Gaussians for View-consistent 3D Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we present FreeGS, an unsupervised semantic-embedded 3DGS framework that achieves view-consistent 3D scene understanding without the need for 2D labels. |
WENBO ZHANG et. al. | arxiv-cs.CV | 2024-11-29 |
235 | Efficient Track Anything Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: The high computation complexity of multistage image encoder and memory module has limited its applications in real-world tasks, e.g., video object segmentation on mobile devices. To address this limitation, we propose EfficientTAMs, lightweight track anything models that produce high-quality results with low latency and model size. |
YUNYANG XIONG et. al. | arxiv-cs.CV | 2024-11-28 |
236 | Semantic Image Segmentation of Cell Volumes Using 3D U-Net Convolutional Neural Network Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Image segmentation is the process of assigning a label to every pixel in an image such that pixels with the same label share certain characteristics. Traditionally image … |
LAZAR DASIC et. al. | 2024 IEEE 24th International Conference on Bioinformatics … | 2024-11-27 |
237 | Box for Mask and Mask for Box: Weak Losses for Multi-task Partially Supervised Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose Box-for-Mask and Mask-for-Box strategies, and their combination BoMBo, to distil necessary information from one task annotations to train the other. |
Hoàng-Ân Lê; Paul Berg; Minh-Tan Pham; | arxiv-cs.CV | 2024-11-26 |
238 | A Performance Increment Strategy for Semantic Segmentation of Low-Resolution Images from Damaged Roads Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: A representative dataset for emerging countries consists of low-resolution images of poorly maintained roads and includes labels of damage classes; in this scenario, three challenges arise: objects with few pixels, objects with undefined shapes, and highly underrepresented classes. To tackle these challenges, this work proposes the Performance Increment Strategy for Semantic Segmentation (PISSS) as a methodology of 14 training experiments to boost performance. |
Rafael S. Toledo; Cristiano S. Oliveira; Vitor H. T. Oliveira; Eric A. Antonelo; Aldo von Wangenheim; | arxiv-cs.CV | 2024-11-25 |
239 | Effective SAM Combination for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose ESC-Net, a novel one-stage open-vocabulary segmentation model that leverages the SAM decoder blocks for class-agnostic segmentation within an efficient inference framework. |
MINHYEOK LEE et. al. | arxiv-cs.CV | 2024-11-21 |
240 | CLIPer: Hierarchically Improving Spatial Representation of CLIP for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present a novel hierarchical framework, named CLIPer, that hierarchically improves spatial representation of CLIP. |
Lin Sun; Jiale Cao; Jin Xie; Xiaoheng Jiang; Yanwei Pang; | arxiv-cs.CV | 2024-11-20 |
241 | BelHouse3D: A Benchmark Dataset for Assessing Occlusion Robustness in 3D Point Cloud Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Existing 3D benchmarking datasets typically evaluate deep learning models under the assumption that training and test data are independently and identically distributed (IID), which affects the models’ usability for real-world point cloud segmentation. To address these challenges, we introduce the BelHouse3D dataset, a new synthetic point cloud dataset designed for 3D indoor scene semantic segmentation. |
Umamaheswaran Raman Kumar; Abdur Razzaq Fayjie; Jurgen Hannaert; Patrick Vandewalle; | arxiv-cs.CV | 2024-11-20 |
242 | SAM Carries The Burden: A Semi-Supervised Approach Refining Pseudo Labels for Medical Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: The recently introduced Segment Anything Model (SAM) enables prompt-based segmentation and offers zero-shot generalization to unfamiliar objects. |
RON KEUTH et. al. | arxiv-cs.CV | 2024-11-19 |
243 | Calibrated and Efficient Sampling-Free Confidence Estimation for LiDAR Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we introduce a sampling-free approach for estimating well-calibrated confidence values for classification tasks, achieving alignment with true classification accuracy and significantly reducing inference time compared to sampling-based methods. |
Hanieh Shojaei Miandashti; Qianqian Zou; Claus Brenner; | arxiv-cs.CV | 2024-11-18 |
244 | TP-UNet: Temporal Prompt Guided UNet for Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To efficiently integrate temporal information, we propose TP-UNet that utilizes temporal prompts, encompassing organ-construction relationships, to guide the segmentation UNet model. |
Ranmin Wang; Limin Zhuang; Hongkun Chen; Boyan Xu; Ruichu Cai; | arxiv-cs.CV | 2024-11-18 |
245 | ITACLIP: Boosting Training-Free Semantic Segmentation with Image, Text, and Architectural Enhancements Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this study, we enhance the semantic segmentation performance of CLIP by introducing new modules and modifications: 1) architectural changes in the last layer of ViT and the incorporation of attention maps from the middle layers with the last layer, 2) Image Engineering: applying data augmentations to enrich input image representations, and 3) using Large Language Models (LLMs) to generate definitions and synonyms for each class name to leverage CLIP’s open-vocabulary capabilities. |
M. Arda Aydın; Efe Mert Çırpar; Elvin Abdinli; Gozde Unal; Yusuf H. Sahin; | arxiv-cs.CV | 2024-11-18 |
246 | CorrCLIP: Reconstructing Correlations in CLIP with Off-the-Shelf Foundation Models for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we introduce CorrCLIP, a training-free approach for open-vocabulary semantic segmentation, which reconstructs significantly coherent inter-patch correlations utilizing foundation models. |
Dengke Zhang; Fagui Liu; Quan Tang; | arxiv-cs.CV | 2024-11-15 |
247 | ULTra: Unveiling Latent Token Interpretability in Transformer-Based Understanding and Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, their complexity makes latent token representations difficult to interpret. We introduce ULTra, a framework for interpreting Transformer embeddings and uncovering meaningful semantic patterns within them. |
Hesam Hosseini; Ghazal Hosseini Mighan; Amirabbas Afzali; Sajjad Amini; Amir Houmansadr; | arxiv-cs.CV | 2024-11-15 |
248 | Harnessing Vision Foundation Models for High-Performance, Training-Free Open Vocabulary Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Specifically, we introduce Trident, a training-free framework that first splices features extracted by CLIP and DINO from sub-images, then leverages SAM’s encoder to create a correlation matrix for global aggregation, enabling a broadened receptive field for effective segmentation. |
Yuheng Shi; Minjing Dong; Chang Xu; | arxiv-cs.CV | 2024-11-14 |
249 | Slender Object Scene Segmentation in Remote Sensing Image Based on Learnable Morphological Skeleton with Segment Anything Model Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a new approach that integrates learnable morphological skeleton prior into deep neural networks using the variational method. |
JUN XIE et. al. | arxiv-cs.CV | 2024-11-13 |
250 | Heuristical Comparison of Vision Transformers Against Convolutional Neural Networks for Semantic Segmentation on Remote Sensing Imagery Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Vision Transformers (ViT) have recently brought a new wave of research in the field of computer vision. These models have performed particularly well in image classification and segmentation. |
Ashim Dahal; Saydul Akbar Murad; Nick Rahimi; | arxiv-cs.CV | 2024-11-13 |
251 | Zero-shot Capability of SAM-family Models for Bone Segmentation in CT Scans Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The Segment Anything Model (SAM) and similar models build a family of promptable foundation models (FMs) for image and video segmentation. |
Caroline Magg; Hoel Kervadec; Clara I. Sánchez; | arxiv-cs.CV | 2024-11-13 |
252 | Superpixel Segmentation: A Long-Lasting Ill-Posed Problem Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Concurrently, recent deep learning-based superpixel methods mainly focus on the object segmentation task at the expense of regularity. |
Rémi Giraud; Michaël Clément; | arxiv-cs.CV | 2024-11-10 |
253 | ZAHA: Introducing The Level of Facade Generalization and The Large-Scale Point Cloud Facade Semantic Segmentation Benchmark Dataset Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In ZAHA, we introduce Level of Facade Generalization (LoFG), novel hierarchical facade classes designed based on international urban modeling standards, ensuring compatibility with real-world challenging classes and uniform methods’ comparison. |
OLAF WYSOCKI et. al. | arxiv-cs.CV | 2024-11-07 |
254 | OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address the task, we propose a plug-and-play approach termed OLAF. |
Pranav Gupta; Rishubh Singh; Pradeep Shenoy; Ravikiran Sarvadevabhatla; | arxiv-cs.CV | 2024-11-05 |
255 | Rethinking Decoders for Transformer-based Semantic Segmentation: A Compression Perspective Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we argue that there are fundamental connections between semantic segmentation and compression, especially between the Transformer decoders and Principal Component Analysis (PCA). |
Qishuai Wen; Chun-Guang Li; | arxiv-cs.CV | 2024-11-05 |
256 | PreCM: The Padding-based Rotation Equivariant Convolution Mode for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Subsequently, we mathematically design a padding-based rotation equivariant convolution mode (PreCM), which is not only applicable to multi-scale images and convolutional kernels but can also serve as a replacement component for various types of convolutions, such as dilated convolutions, transposed convolutions, and asymmetric convolution. |
Xinyu Xu; Huazhen Liu; Tao Zhang; Huilin Xiong; Wenxian Yu; | arxiv-cs.CV | 2024-11-03 |
257 | Enhanced Scene Understanding and Situation Awareness for Autonomous Vehicles Based on Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Accurate visual perception and comprehensive scene understanding are critical for the safety and reliability of autonomous vehicles (AVs). Nevertheless, the efficacy of visual … |
YIYUE ZHAO et. al. | IEEE Transactions on Systems, Man, and Cybernetics: Systems | 2024-11-01 |
258 | Panoramic Image Semantic Segmentation Using Channel Attention-based HarDNet and Distorted Boundary Learning Related Papers Related Patents Related Grants Related Venues Related Experts View |
Xun Jin; Chongyang Zhu; De Li; | Multim. Syst. | 2024-11-01 |
259 | Temporal Consistency for RGB-Thermal Data-Based Semantic Scene Understanding Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic scene understanding is a fundamental capability for autonomous vehicles. Under challenging lighting conditions, such as nighttime and on-coming headlights, the semantic … |
Haotian Li; Henry K. Chu; Yuxiang Sun; | IEEE Robotics and Automation Letters | 2024-11-01 |
260 | Cityscape-Adverse: Benchmarking Robustness of Semantic Segmentation with Realistic Scene Modifications Via Diffusion-Based Image Editing Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce Cityscape-Adverse, a benchmark that employs diffusion-based image editing to simulate eight adverse conditions, including variations in weather, lighting, and seasons, while preserving the original semantic labels. |
NAUFAL SURYANTO et. al. | arxiv-cs.CV | 2024-11-01 |
261 | Cross-modal Semantic Segmentation for Indoor Environmental Perception Using Single-chip Millimeter-wave Radar Raw Data Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To efficiently obtain high-quality labels, an automatic label generation method utilizing LiDAR point clouds and occupancy grid maps is introduced. |
Hairuo Hu; Haiyong Cong; Zhuyu Shao; Yubo Bi; Jinghao Liu; | arxiv-cs.CV | 2024-11-01 |
262 | Image Synthesis with Class-Aware Semantic Diffusion Models for Surgical Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In response, we propose the Class-Aware Semantic Diffusion Model (CASDM), a novel approach which utilizes segmentation maps as conditions for image synthesis to tackle data scarcity and imbalance. |
Yihang Zhou; Rebecca Towning; Zaid Awad; Stamatia Giannarou; | arxiv-cs.CV | 2024-10-31 |
263 | S3PT: Scene Semantics and Structure Guided Clustering to Boost Self-Supervised Pre-Training for Autonomous Driving Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose S3PT a novel scene semantics and structure guided clustering to provide more scene-consistent objectives for self-supervised training. |
MACIEJ K. WOZNIAK et. al. | arxiv-cs.CV | 2024-10-30 |
264 | Text2Seg: Zero-shot Remote Sensing Image Semantic Segmentation Via Text-Guided Visual Foundation Models Related Papers Related Patents Related Grants Related Venues Related Experts View |
JIELU ZHANG et. al. | GeoAI@SIGSPATIAL | 2024-10-29 |
265 | LDCNet: Long-Distance Context Modeling for Large-Scale 3D Point Cloud Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Shoutong Luo; Zhengxing Sun; Yi Wang; Yunhan Sun; Chendi Zhu; | ACM Multimedia | 2024-10-28 |
266 | Every Component Counts: Rethinking The Measure of Success for Medical Semantic Segmentation in Multi-Instance Segmentation Tasks Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present Connected-Component~(CC)-Metrics, a novel semantic segmentation evaluation protocol, targeted to align existing semantic segmentation metrics to a multi-instance detection scenario in which each connected component matters. |
ALEXANDER JAUS et. al. | arxiv-cs.CV | 2024-10-24 |
267 | Semantic Segmentation and Scene Reconstruction of RGB-D Image Frames: An End-to-End Modular Pipeline for Robotic Applications Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce a novel end-to-end modular pipeline that integrates state-of-the-art semantic segmentation, human tracking, point-cloud fusion, and scene reconstruction. |
ZHIWU ZHENG et. al. | arxiv-cs.CV | 2024-10-23 |
268 | Surgical Scene Segmentation By Transformer With Asymmetric Feature Enhancement Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Secondly, the specific characteristics of anatomy and instruments are not specifically modeled. To tackle the above challenges, we propose a novel Transformer-based framework with an Asymmetric Feature Enhancement module (TAFE), which enhances local information and then actively fuses the improved feature pyramid into the embeddings from transformer encoders by a multi-scale interaction attention strategy. |
Cheng Yuan; Yutong Ban; | arxiv-cs.CV | 2024-10-23 |
269 | Multi Kernel Estimation Based Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents a novel approach for multi-kernel estimation by enhancing the KernelGAN algorithm, which traditionally estimates a single kernel for the entire image. |
Haim Goldfisher; Asaf Yekutiel; | arxiv-cs.CV | 2024-10-22 |
270 | TICNet: Three-Branch Real-Time Semantic Segmentation Network with Intensive Compensation of Railway Track Scenes Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With the rapid development of railway traffic system, real-time semantic segmentation plays a crucial role in railway track scene monitoring. However, most of the existing methods … |
Yiwen Bai; Lu Yang; Lei Zhang; Yajing Song; | 2024 5th International Conference on Machine Learning and … | 2024-10-18 |
271 | Railway LiDAR Semantic Segmentation Based on Intelligent Semi-automated Data Annotation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Thus, we propose an approach for a point-wise 3D semantic segmentation based on the 2DPass network architecture using scans and images jointly. |
Florian Wulff; Bernd Schaeufele; Julian Pfeifer; Ilja Radusch; | arxiv-cs.CV | 2024-10-17 |
272 | SemSim: Revisiting Weak-to-Strong Consistency from A Semantic Similarity Perspective for Semi-supervised Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, two key limitations still persist, impeding its efficient adaptation: (1) the neglect of contextual dependencies results in inconsistent predictions for similar semantic features, leading to incomplete object segmentation; (2) the lack of exploitation of semantic similarity between labeled and unlabeled data induces considerable class-distribution discrepancy. To address these limitations, we propose a novel semi-supervised framework based on FixMatch, named SemSim, powered by two appealing designs from semantic similarity perspective: (1) rectifying pixel-wise prediction by reasoning about the intra-image pair-wise affinity map, thus integrating contextual dependencies explicitly into the final prediction; (2) bridging labeled and unlabeled data via a feature querying mechanism for compact class representation learning, which fully considers cross-image anatomical similarities. |
SHIAO XIE et. al. | arxiv-cs.CV | 2024-10-17 |
273 | Adaptive Prompt Learning with SAM for Few-shot Scanning Probe Microscope Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Code and dataset used in this study will be made available upon acceptance. |
YAO SHEN et. al. | arxiv-cs.CV | 2024-10-16 |
274 | RClicks: Realistic Click Simulation for Benchmarking Interactive Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Using our model and dataset, we propose RClicks benchmark for a comprehensive comparison of existing interactive segmentation methods on realistic clicks. |
ANTON ANTONOV et. al. | arxiv-cs.CV | 2024-10-15 |
275 | Real-Time Semantic Segmentation in Natural Environments with SAM-assisted Sim-to-Real Domain Transfer Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation plays a pivotal role in many robotic applications requiring high-level scene understanding, such as smart farming, where the precise identification of trees … |
Han Wang; R. Mascaro; M. Chli; L. Teixeira; | 2024 IEEE/RSJ International Conference on Intelligent … | 2024-10-14 |
276 | LKASeg:Remote-Sensing Image Semantic Segmentation with Large Kernel Attention and Full-Scale Skip Connections Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a remote-sensing image semantic segmentation network named LKASeg, which combines Large Kernel Attention(LSKA) and Full-Scale Skip Connections(FSC). |
XUEZHI XIANG et. al. | arxiv-cs.CV | 2024-10-14 |
277 | Uncertainty Estimation and Out-of-Distribution Detection for LiDAR Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a method to distinguish in-distribution (ID) from OOD samples and quantify both epistemic and aleatoric uncertainties using the feature space of a single deterministic model. |
Hanieh Shojaei; Qianqian Zou; Max Mehltretter; | arxiv-cs.LG | 2024-10-11 |
278 | VideoSAM: Open-World Video Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end, we introduce VideoSAM, an end-to-end framework designed to address these challenges by improving object tracking and segmentation consistency in dynamic environments. |
PINXUE GUO et. al. | arxiv-cs.CV | 2024-10-11 |
279 | Are We Ready for Real-Time LiDAR Semantic Segmentation in Autonomous Driving? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: These characteristics hinder the real-time semantic analysis, particularly on resource-constrained hardware architectures that constitute the main computational components of numerous robotic applications. Therefore, in this paper, we investigate various 3D semantic segmentation methodologies and analyze their performance and capabilities for resource-constrained inference on embedded NVIDIA Jetson platforms. |
Samir Abou Haidar; Alexandre Chariot; Mehdi Darouich; Cyril Joly; Jean-Emmanuel Deschaud; | arxiv-cs.RO | 2024-10-10 |
280 | Data Augmentation for Surgical Scene Segmentation with Anatomy-Aware Diffusion Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce a multi-stage approach using diffusion models to generate multi-class surgical datasets with annotations. |
Danush Kumar Venkatesh; Dominik Rivoir; Micha Pfeiffer; Fiona Kolbinger; Stefanie Speidel; | arxiv-cs.CV | 2024-10-10 |
281 | Shift and Matching Queries for Video Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a method to extend a query-based image segmentation model to video using feature shift and query matching. |
Tsubasa Mizuno; Toru Tamaki; | arxiv-cs.CV | 2024-10-10 |
282 | Evaluating The Impact of Point Cloud Colorization on Semantic Segmentation Accuracy Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel statistical approach to evaluate the impact of inaccurate RGB information on image-based point cloud segmentation. |
Qinfeng Zhu; Jiaze Cao; Yuanzhi Cai; Lei Fan; | arxiv-cs.CV | 2024-10-09 |
283 | Rethinking The Evaluation of Visible and Infrared Image Fusion Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper proposes a Segmentation-oriented Evaluation Approach (SEA) to assess VIF methods by incorporating the semantic segmentation task and leveraging segmentation labels available in latest VIF datasets. |
Dayan Guan; Yixuan Wu; Tianzhu Liu; Alex C. Kot; Yanfeng Gu; | arxiv-cs.CV | 2024-10-09 |
284 | Open-RGBT: Open-vocabulary RGB-T Zero-shot Semantic Segmentation in Open-world Environments Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, these models face challenges in dealing with intricate scenes, primarily due to the heterogeneity between RGB and thermal modalities. To address this gap, we present Open-RGBT, a novel open-vocabulary RGB-T semantic segmentation model. |
Meng Yu; Luojie Yang; Xunjie He; Yi Yang; Yufeng Yue; | arxiv-cs.CV | 2024-10-09 |
285 | Scribbles for All: Benchmarking Scribble Supervised Segmentation Across Datasets Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we introduce *Scribbles for All*, a label and training data generation algorithm for semantic segmentation trained on scribble labels. |
Wolfgang Boettcher; Lukas Hoyer; Ozan Unal; Jan Eric Lenssen; Bernt Schiele; | nips | 2024-10-07 |
286 | Toward Real Ultra Image Segmentation: Leveraging Surrounding Context to Cultivate General Segmentation Model Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Existing ultra image segmentation methods suffer from two major challenges, namely the generalization issue (i.e. they lack the stability and generality of standard segmentation models, as they are tailored to specific datasets), and the architectural issue (i.e. they are incompatible with real-world ultra image scenes, as they compromise between image size and computing resources). To tackle these issues, we revisit the classic sliding inference framework, upon which we propose a Surrounding Guided Segmentation framework (SGNet) for ultra image segmentation. |
Sai Wang; Yutian Lin; Yu Wu; Bo Du; | nips | 2024-10-07 |
287 | A Unified Framework for 3D Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose UniSeg3D, a unified 3D segmentation framework that achieves panoptic, semantic, instance, interactive, referring, and open-vocabulary semantic segmentation tasks within a single model. |
WEI XU et. al. | nips | 2024-10-07 |
288 | Geometric Exploitation for Indoor Panoramic Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Unlike previous works, in this paper, we propose a novel approach for semantic segmentation of panoramic images. |
Duc Cao Dinh; Seok Kim; Kyusung Cho; | nips | 2024-10-07 |
289 | Zero-Shot Image Segmentation Via Recursive Normalized Cut on Diffusion Features Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we consider a diffusion UNet encoder as a foundation vision encoder and we introduce DiffCut, an unsupervised zero-shot segmentation method that solely harnesses the output features from the final self-attention block. |
Paul Couairon; Mustafa Shukor; Jean-Emmanuel HAUGEARD; Matthieu Cord; Nicolas THOME; | nips | 2024-10-07 |
290 | Unsupervised Hierarchy-Agnostic Segmentation: Parsing Semantic Image Structure Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce a novel algebraic methodology for unsupervised image segmentation. |
Simone Rossetti; fiora pirri; | nips | 2024-10-07 |
291 | Relationship Prompt Learning Is Enough for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Prompt learning offers a direct and parameter-efficient approach, yet it falls short in guiding VLM for pixel-level visual localization. Therefore, we propose relationship prompt module (RPM), which generates relationship prompt that directs VLM to extract pixel-level semantic embeddings suitable for OVSS. |
li Jiahao; Yanyun Qu; Yuan Xie; Yang Lu; | nips | 2024-10-07 |
292 | Towards Open-Vocabulary Semantic Segmentation Without Semantic Labels Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a novel method, MCLIP, to adapt the CLIP image encoder for pixel-level understanding by guiding the model on where, which is achieved using unlabeled images and masks generated from vision foundation models such as SAM and DINO. |
HEESEONG SHIN et. al. | nips | 2024-10-07 |
293 | One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce VideoLISA, a video-based multimodal large language model designed to tackle the problem of language-instructed reasoning segmentation in videos. |
ZECHEN BAI et. al. | nips | 2024-10-07 |
294 | DeiSAM: Segment Anything with Deictic Prompting Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, deep learning approaches cannot reliably interpret such deictic representations due to their lack of reasoning capabilities in complex scenarios. To remedy this issue, we propose DeiSAM — a combination of large pre-trained neural networks with differentiable logic reasoners — for deictic promptable segmentation. |
HIKARU SHINDO et. al. | nips | 2024-10-07 |
295 | AdaptDiff: Cross-Modality Domain Adaptation Via Weak Conditional Semantic Diffusion for Retinal Vessel Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, despite its promise, deep learning has many challenges in practice due to its inability to effectively transition to unseen domains, caused by the inherent data distribution shift and the lack of manual annotations to guide domain adaptation. To tackle this problem, we present an unsupervised domain adaptation (UDA) method named AdaptDiff that enables a retinal vessel segmentation network trained on fundus photography (FP) to produce satisfactory results on unseen modalities (e.g., OCT-A) without any manual labels. |
DEWEI HU et. al. | arxiv-cs.CV | 2024-10-06 |
296 | Unleashing The Potential of The Diffusion Model in Few-shot Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Our initial focus lies in understanding how to facilitate interaction between the query image and the support image, resulting in the proposal of a KV fusion method within the self-attention framework. |
MUZHI ZHU et. al. | arxiv-cs.CV | 2024-10-03 |
297 | Annotated Dataset for Training Cloud Segmentation Neural Networks Using High-Resolution Satellite Remote Sensing Imagery Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The integration of satellite data with deep learning has revolutionized various tasks in remote sensing, including classification, object detection, and semantic segmentation. … |
Mingyuan He; Jie Zhang; Yang He; Xinjie Zuo; Zebin Gao; | Remote. Sens. | 2024-10-02 |
298 | Semantic Segmentation of Unmanned Aerial Vehicle Remote Sensing Images Using SegFormer Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper evaluates the effectiveness and efficiency of SegFormer, a semantic segmentation framework, for the semantic segmentation of UAV images. |
Vlatko Spasev; Ivica Dimitrovski; Ivan Chorbev; Ivan Kitanovski; | arxiv-cs.CV | 2024-10-01 |
299 | Deep Multimodal Fusion for Semantic Segmentation of Remote Sensing Earth Observation Data Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a late fusion deep learning model (LF-DLM) for semantic segmentation that leverages the complementary strengths of both VHR aerial imagery and SITS. |
Ivica Dimitrovski; Vlatko Spasev; Ivan Kitanovski; | arxiv-cs.CV | 2024-10-01 |
300 | Multi-Bottleneck Progressive Propulsion Network for Medical Image Semantic Segmentation with Integrated Macro-micro Dual-stage Feature Enhancement and Refinement Related Papers Related Patents Related Grants Related Venues Related Experts View |
YUEFEI WANG et. al. | Expert Syst. Appl. | 2024-10-01 |
301 | I-MedSAM: Implicit Medical Image Segmentation with Segment Anything Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose I-MedSAM, which leverages the benefits of both continuous representations and SAM, to obtain better cross-domain ability and accurate boundary delineation. |
XIAOBAO WEI et. al. | eccv | 2024-09-30 |
302 | PSALM: Pixelwise Segmentation with Large Multi-modal Model IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To overcome the limitation of the LMM being limited to textual output, PSALM incorporates a mask decoder and a well-designed input schema to handle a variety of segmentation tasks. This schema includes images, task instructions, conditional prompts, and mask tokens, which enable the model to generate and classify segmentation masks effectively. |
Zheng Zhang; yeyao ma; Enming Zhang; Xiang Bai; | eccv | 2024-09-30 |
303 | In Defense of Lazy Visual Grounding for Open-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present Lazy Visual Grounding for open-vocabulary semantic segmentation, which decouples unsupervised object mask discovery from object grounding. |
Dahyun Kang; Minsu Cho; | eccv | 2024-09-30 |
304 | Knowledge Transfer with Simulated Inter-Image Erasing for Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Though adversarial erasing has prevailed in weakly supervised semantic segmentation to help activate integral object regions, existing approaches still suffer from the dilemma of under-activation and over-expansion due to the difficulty in determining when to stop erasing. In this paper, we propose a Knowledge Transfer with Simulated Inter-Image Erasing (KTSE) approach for weakly supervised semantic segmentation to alleviate the above problem. |
TAO CHEN et. al. | eccv | 2024-09-30 |
305 | Explore The Potential of CLIP for Training-Free Open Vocabulary Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Our study delves into the impact of CLIP’s [CLS] token on patch feature correlations, revealing a dominance of ”global” patches that hinders local feature discrimination. To overcome this, we propose CLIPtrase, a novel training-free semantic segmentation strategy that enhances local feature awareness through recalibrated self-correlation among patches. |
Tong Shao; Zhuotao Tian; Hang Zhao; Jingyong Su; | eccv | 2024-09-30 |
306 | Beyond Pixels: Semi-Supervised Semantic Segmentation with A Multi-scale Patch-based Multi-Label Classifier Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we show that an effective way to incorporate contextual information is through a patch-based classifier. |
Prantik Howlader; Srijan Das; Hieu Le; Dimitris Samaras; | eccv | 2024-09-30 |
307 | SegPoint: Segment Any Point Cloud Via Large Language Model Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a model, called , that leverages the reasoning capabilities of a multi-modal Large Language Model (LLM) to produce point-wise segmentation masks across a diverse range of tasks: 1) 3D instruction segmentation, 2) 3D referring segmentation, 3) 3D semantic segmentation, and 4) 3D open-vocabulary semantic segmentation.To advance 3D instruction research, we introduce a new benchmark, , designed to evaluate segmentation performance from complex and implicit instructional texts, featuring point cloud-instruction pairs. |
Shuting He; Henghui Ding; Xudong Jiang; Bihan Wen; | eccv | 2024-09-30 |
308 | Dataset Enhancement with Instance-Level Augmentations Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present a method for expanding a dataset by incorporating knowledge from the wide distribution of pre-trained latent diffusion models. |
Orest Kupyn; Christian Rupprecht; | eccv | 2024-09-30 |
309 | SemiVL: Semi-Supervised Semantic Segmentation with Vision-Language Guidance IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To adapt the VLM from global to local reasoning, we introduce a spatial fine-tuning strategy for label-efficient learning. |
Lukas Hoyer; David Joseph Tan; Muhammad Ferjad Naeem; Luc Van Gool; Federico Tombari; | eccv | 2024-09-30 |
310 | Enriching Information and Preserving Semantic Congruence in Expanding Curvilinear Object Segmentation Datasets Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Curvilinear object segmentation plays a crucial role across various applications, yet datasets in this domain often suffer from small scale due to the high costs associated with data acquisition and annotation. To address these challenges, this paper introduces a novel approach for expanding curvilinear object segmentation datasets, focusing on enhancing the informativeness of generated data and the consistency between semantic maps and generated images. |
Qin Lei; Jiang Zhong; Qizhu Dai; | eccv | 2024-09-30 |
311 | Open-Vocabulary Camouflaged Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To fill in the gaps, we introduce a new task, open-vocabulary camouflaged object segmentation (OVCOS), and construct a large-scale complex scene dataset (OVCamo) containing 11,483 hand-selected images with fine annotations and corresponding object classes. |
Youwei Pang; Xiaoqi Zhao; JiaMing Zuo; Lihe Zhang; Huchuan Lu; | eccv | 2024-09-30 |
312 | From Pixels to Objects: A Hierarchical Approach for Part and Object Segmentation Using Local and Global Aggregation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce a hierarchical transformer-based model designed for sophisticated image segmentation tasks, effectively bridging the granularity of part segmentation with the comprehensive scope of object segmentation. |
Yunfei Xie; Cihang Xie; Alan Yuille; Jieru Mei; | eccv | 2024-09-30 |
313 | Placing Objects in Context Via Inpainting for Out-of-distribution Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose the Placing Objects in Context (POC) pipeline to realistically add any object into any image via diffusion models. |
Pau de Jorge Aranda; Riccardo Volpi; Puneet Dokania; Philip Torr; Gregory Rogez; | eccv | 2024-09-30 |
314 | View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we address the challenging task of lifting multi-granular and view-inconsistent image segmentations into a hierarchical and 3D-consistent representation. |
Haodi He; Colton Stearns; Adam Harley; Leonidas Guibas; | eccv | 2024-09-30 |
315 | Open Panoramic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To further enhance the distortion-aware modeling ability from the pinhole source domain, we propose a novel data augmentation method called Random Equirectangular Projection (RERP) which is specifically designed to address object deformations in advance. |
JUNWEI ZHENG et. al. | eccv | 2024-09-30 |
316 | Boosting Gaze Object Prediction Via Pixel-level Supervision from Vision Foundation Model Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents a more challenging gaze object segmentation (GOS) task, which involves inferring the pixel-level mask corresponding to the object captured by human gaze behavior. |
Yang Jin; Lei Zhang; Shi Yan; Bin Fan; Binglu Wang; | eccv | 2024-09-30 |
317 | Open-Vocabulary RGB-Thermal Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Second, when fusing RGB and thermal images, they often need to design complex fusion network structures, which usually results in low network training efficiency. We present OpenRSS, the Open-vocabulary RGB-T Semantic Segmentation method, to solve these two disadvantages. |
GUOQIANG ZHAO et. al. | eccv | 2024-09-30 |
318 | Segment3D: Learning Fine-Grained Class-Agnostic 3D Segmentation Without Manual Labels IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In contrast, recent 2D foundation models have demonstrated strong generalization and impressive zero-shot abilities, inspiring us to incorporate these characteristics from 2D models into 3D models. Therefore, we explore the use of image segmentation foundation models to automatically generate high-quality training labels for 3D segmentation models. |
RUI HUANG et. al. | eccv | 2024-09-30 |
319 | 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose 3DSS-VLG, a weakly supervised approach for 3D Semantic Segmentation with 2D Vision-Language Guidance, an alternative approach that a 3D model predicts dense-embedding for each point which is co-embedded with both the aligned image and text spaces from the 2D vision-language model. |
XIAOXU XU et. al. | eccv | 2024-09-30 |
320 | RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To effectively embed high-dimensional features, we propose a double-nested autoencoder structure with a novel class-aware embedding objective to encode high-dimensional features into manageable voxel-wise embeddings. |
Li Li; Hubert P. H. Shum; Toby P Breckon; | eccv | 2024-09-30 |
321 | Class-Agnostic Visio-Temporal Scene Sketch Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a Class-Agnostic Visio-Temporal Network (CAVT) for scene sketch semantic segmentation. |
Aleyna Kütük; Tevfik Metin Sezgin; | arxiv-cs.CV | 2024-09-30 |
322 | OLAF: A Plug-and-Play Framework for Enhanced Multi-object Multi-part Scene Parsing Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To address the task, we propose a plug-and-play approach termed OLAF. |
Pranav Gupta; Rishubh Singh; Pradeep Shenoy; Ravi Kiran Sarvadevabhatla; | eccv | 2024-09-30 |
323 | VISA: Reasoning Video Object Segmentation Via Large Language Model Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce a new task, Reasoning Video Object Segmentation (ReasonVOS). |
CILIN YAN et. al. | eccv | 2024-09-30 |
324 | Occlusion-Aware Seamless Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Panoramic images can broaden the Field of View (FoV), occlusion-aware prediction can deepen the understanding of the scene, and domain adaptation can transfer across viewing domains. In this work, we introduce a novel task, Occlusion-Aware Seamless Segmentation (OASS), which simultaneously tackles all these three challenges. |
YIHONG CAO et. al. | eccv | 2024-09-30 |
325 | Can Textual Semantics Mitigate Sounding Object Segmentation Preference? Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Motivated by the the fact that text modality is well explored and contains rich abstract semantics, we propose leveraging text cues from the visual scene to enhance audio guidance with the semantics inherent in text. |
Yaoting Wang; Peiwen Sun; Yuanchao Li; Honggang Zhang; Di Hu; | eccv | 2024-09-30 |
326 | Betrayed By Attention: A Simple Yet Effective Approach for Self-supervised Video Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a simple yet effective approach for self-supervised video object segmentation (VOS). |
Shuangrui Ding; Rui Qian; Haohang Xu; Dahua Lin; Hongkai Xiong; | eccv | 2024-09-30 |
327 | O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: For the purpose of preserving consistency in 3D object properties across different viewpoints, we propose a spatial adaptive voxel adjustment mechanism and a multi-view weight selection method. |
MUER TIE et. al. | eccv | 2024-09-30 |
328 | SegGen: Supercharging Segmentation Models with Text2Mask and Mask2Img Synthesis Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present , a new data generation approach that pushes the performance boundaries of state-of-the-art image segmentation models. |
HANRONG YE et. al. | eccv | 2024-09-30 |
329 | Towards Reliable Evaluation and Fast Training of Robust Semantic Segmentation Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose several problem-specific novel attacks minimizing different metrics in accuracy and mIoU. |
Francesco Croce; Naman D. Singh; Matthias Hein; | eccv | 2024-09-30 |
330 | Segment and Recognize Anything at Any Granularity Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we introduce , an augmented image segmentation foundation for segmenting and recognizing anything at desired granularities. |
FENG LI et. al. | eccv | 2024-09-30 |
331 | One Token to Seg Them All: Language Instructed Reasoning Segmentation in Videos Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce VideoLISA, a video-based multimodal large language model designed to tackle the problem of language-instructed reasoning segmentation in videos. |
ZECHEN BAI et. al. | arxiv-cs.CV | 2024-09-29 |
332 | Get It For Free: Radar Segmentation Without Expert Labels and Its Application in Odometry and Localization Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents a novel weakly supervised semantic segmentation method for radar segmentation, where the existing LiDAR semantic segmentation models are employed to generate semantic labels, which then serve as supervision signals for training a radar semantic segmentation model. |
Siru Li; Ziyang Hong; Yushuai Chen; Liang Hu; Jiahu Qin; | arxiv-cs.RO | 2024-09-26 |
333 | Global-Local Medical SAM Adaptor Based on Full Adaption Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, Med-SA still can be improved, as it fine-tunes SAM in a partial adaption manner. To resolve this problem, we present a novel global medical SAM adaptor (GMed-SA) with full adaption, which can adapt SAM globally. |
MENG WANG et. al. | arxiv-cs.AI | 2024-09-25 |
334 | Go-SLAM: Grounded Object Segmentation and Localization with Gaussian Splatting SLAM Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce Go-SLAM, a novel framework that utilizes 3D Gaussian Splatting SLAM to reconstruct dynamic environments while embedding object-level information within the scene representations. |
Phu Pham; Dipam Patel; Damon Conover; Aniket Bera; | arxiv-cs.RO | 2024-09-25 |
335 | Potential Field As Scene Affordance for Behavior Change-Based Visual Risk Object Identification Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we compute potential fields by assigning different energy levels according to the semantic labels obtained from BEV semantic segmentation. |
Pang-Yuan Pao; Shu-Wei Lu; Ze-Yan Lu; Yi-Ting Chen; | arxiv-cs.CV | 2024-09-24 |
336 | The BRAVO Semantic Segmentation Challenge Results in UNCV2024 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose the unified BRAVO challenge to benchmark the reliability of semantic segmentation models under realistic perturbations and unknown out-of-distribution (OOD) scenarios. |
TUAN-HUNG VU et. al. | arxiv-cs.CV | 2024-09-23 |
337 | ZeroSCD: Zero-Shot Street Scene Change Detection Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Traditional change detection methods rely on training models that take these image pairs as input and estimate the changes, which requires large amounts of annotated data, a costly and time-consuming process. To overcome this, we propose ZeroSCD, a zero-shot scene change detection framework that eliminates the need for training. |
Shyam Sundar Kannan; Byung-Cheol Min; | arxiv-cs.RO | 2024-09-23 |
338 | MOSE: Monocular Semantic Reconstruction Using NeRF-Lifted Noisy Priors Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose MOSE, a neural field semantic reconstruction approach to lift inferred image-level noisy priors to 3D, producing accurate semantics and geometry in both 3D and 2D space. |
Zhenhua Du; Binbin Xu; Haoyu Zhang; Kai Huo; Shuaifeng Zhi; | arxiv-cs.CV | 2024-09-21 |
339 | Infield Disease Detection in Citrus Plants: Integrating Semantic Segmentation and Dynamic Deep Learning Object Detection Model for Enhanced Agricultural Yield Related Papers Related Patents Related Grants Related Venues Related Experts View |
N. Rani; Arun Sri Krishna; M. Sunag; M. A. Sangamesha; B. R. Pushpa; | Neural Comput. Appl. | 2024-09-21 |
340 | CUS3D :CLIP-based Unsupervised 3D Segmentation Via Object-level Denoise Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, unlike previous research that ignores the “noise” raised during feature projection from 2D to 3D, we propose a novel distillation learning framework named CUS3D. |
Fuyang Yu; Runze Tian; Zhen Wang; Xiaochuan Wang; Xiaohui Liang; | arxiv-cs.CV | 2024-09-20 |
341 | A Bottom-Up Approach to Class-Agnostic Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we present a novel bottom-up formulation for addressing the class-agnostic segmentation problem. |
Sebastian Dille; Ari Blondal; Sylvain Paris; Yağız Aksoy; | arxiv-cs.CV | 2024-09-20 |
342 | HS3-Bench: A Benchmark and Strong Baseline for Hyperspectral Semantic Segmentation in Driving Scenarios Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Even though some datasets exist, there is no standard benchmark available to systematically measure progress on this task and evaluate the benefit of hyperspectral data. In this paper, we work towards closing this gap by providing the HyperSpectral Semantic Segmentation benchmark (HS3-Bench). |
Nick Theisen; Robin Bartsch; Dietrich Paulus; Peer Neubert; | arxiv-cs.CV | 2024-09-17 |
343 | Fuse4Seg: Image-Level Fusion Based Multi-Modality Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We argue the current feature-level fusion strategy is prone to semantic inconsistencies and misalignments across various imaging modalities because it merges features at intermediate layers in a neural network without evaluative control. To mitigate this, we introduce a novel image-level fusion based multi-modality medical image segmentation method, Fuse4Seg, which is a bi-level learning framework designed to model the intertwined dependencies between medical image segmentation and medical image fusion. |
Yuchen Guo; Weifeng Su; | arxiv-cs.CV | 2024-09-16 |
344 | Semantic2D: A Semantic Dataset for 2D Lidar Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents a 2D lidar semantic segmentation dataset to enhance the semantic scene understanding for mobile robots in different indoor robotics applications. |
Zhanteng Xie; Philip Dames; | arxiv-cs.RO | 2024-09-15 |
345 | Resolving Inconsistent Semantics in Multi-Dataset Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Existing methods struggle with this setting, particularly when evaluated on label spaces mixed from the individual training sets. To overcome these issues, we introduce a simple yet effective multi-dataset training approach by integrating language-based embeddings of class names and label space-specific query embeddings. |
Qilong Zhangli; Di Liu; Abhishek Aich; Dimitris Metaxas; Samuel Schulter; | arxiv-cs.CV | 2024-09-15 |
346 | Multi-Scale Grouped Prototypes for Interpretable Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a method for interpretable semantic segmentation that leverages multi-scale image representation for prototypical part learning. |
Hugo Porta; Emanuele Dalsasso; Diego Marcos; Devis Tuia; | arxiv-cs.CV | 2024-09-14 |
347 | AFFSegNet: Adaptive Feature Fusion Segmentation Network for Microtumors and Multi-Organ Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce an augmented multi-layer perceptron within the encoder to explicitly model long-range dependencies during feature extraction. |
FUCHEN ZHENG et. al. | arxiv-cs.CV | 2024-09-12 |
348 | UNIT: Unsupervised Online Instance Segmentation Through Time Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To that end, we leverage an instance segmentation backbone and propose a new training recipe that enables the online tracking of objects. |
Corentin Sautier; Gilles Puy; Alexandre Boulch; Renaud Marlet; Vincent Lepetit; | arxiv-cs.CV | 2024-09-12 |
349 | Segmentation By Factorization: Unsupervised Semantic Segmentation for Pathology By Factorizing Foundation Model Features Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce Segmentation by Factorization (F-SEG), an unsupervised segmentation method for pathology that generates segmentation masks from pre-trained deep learning models. |
Jacob Gildenblat; Ofir Hadar; | arxiv-cs.CV | 2024-09-09 |
350 | Enhanced Generative Data Augmentation for Semantic Segmentation Via Stronger Guidance Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we introduce an effective data augmentation pipeline for semantic segmentation using Controllable Diffusion model. |
Quang-Huy Che; Duc-Tri Le; Bich-Nga Pham; Duc-Khai Lam; Vinh-Tiep Nguyen; | arxiv-cs.CV | 2024-09-09 |
351 | SGSeg: Enabling Text-free Inference in Language-guided Segmentation of Chest X-rays Via Self-guidance Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this study, we propose a self-guided segmentation framework (SGSeg) that leverages language guidance for training (multi-modal) while enabling text-free inference (uni-modal), which is the first that enables text-free inference in language-guided segmentation. |
Shuchang Ye; Mingyuan Meng; Mingjian Li; Dagan Feng; Jinman Kim; | arxiv-cs.CV | 2024-09-07 |
352 | ISeg: An Iterative Refinement-based Framework for Training-free Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To fully utilize self-attention map, we present a deep experimental analysis on iteratively refining cross-attention map with self-attention map, and propose an effective iterative refinement framework for training-free segmentation, named iSeg. |
Lin Sun; Jiale Cao; Jin Xie; Fahad Shahbaz Khan; Yanwei Pang; | arxiv-cs.CV | 2024-09-04 |
353 | Evaluation Study on SAM 2 for Class-agnostic Instance-level Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Segment Anything Model (SAM) has demonstrated powerful zero-shot segmentation performance in natural scenes. |
Jialun Pei; Zhangjun Zhou; Tiantian Zhang; | arxiv-cs.CV | 2024-09-04 |
354 | AllWeatherNet:Unified Image Enhancement for Autonomous Driving Under Adverse Weather and Lowlight-conditions Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Existing methods have limited effectiveness in improving essential computer vision tasks, such as semantic segmentation, and often focus on only one specific condition, such as removing rain or translating nighttime images into daytime ones. To address these limitations, we propose a method to improve the visual quality and clarity degraded by such adverse conditions. |
CHENGHAO QIAN et. al. | arxiv-cs.CV | 2024-09-03 |
355 | Segmenting Object Affordances: Reproducibility and Sensitivity to Scale Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, experimental setups are often not reproducible, thus leading to unfair and inconsistent comparisons. In this work, we benchmark these methods under a reproducible setup on two single objects scenarios, tabletop without occlusions and hand-held containers, to facilitate future comparisons. |
Tommaso Apicella; Alessio Xompero; Paolo Gastaldo; Andrea Cavallaro; | arxiv-cs.CV | 2024-09-03 |
356 | Fast Semantic Segmentation of Ultra-High-Resolution Remote Sensing Images Via Score Map and Fast Transformer-Based Fusion Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: For ultra-high-resolution (UHR) image semantic segmentation, striking a balance between computational efficiency and storage space is a crucial research direction. This paper … |
Yihao Sun; Mingrui Wang; Xiaoyi Huang; Chengshu Xin; Yinan Sun; | Remote. Sens. | 2024-09-02 |
357 | Transferring Multi-Modal Domain Knowledge to Uni-Modal Domain for Urban Scene Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Synthetic data (i.e., source domain) have been widely adopted to improve the semantic segmentation performance for real-world images (i.e., target domain), since obtaining … |
PENG LIU et. al. | IEEE Transactions on Intelligent Transportation Systems | 2024-09-01 |
358 | Multi-source Domain Adaptation for Panoramic Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, these methods struggle to understand the panoramic structure using only real pinhole images and lack real-world scene perception with only synthetic panoramic images. Therefore, in this paper, we propose a new task, Multi-source Domain Adaptation for Panoramic Semantic Segmentation (MSDA4PASS), which leverages both real pinhole and synthetic panoramic images to improve segmentation on unlabeled real panoramic images. |
JING JIANG et. al. | arxiv-cs.CV | 2024-08-29 |
359 | DQFormer: Towards Unified LiDAR Panoptic Segmentation with Decoupled Queries Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose decoupling things/stuff queries according to their intrinsic properties for individual decoding and disentangling classification/segmentation to mitigate ambiguity. |
YU YANG et. al. | arxiv-cs.CV | 2024-08-28 |
360 | SPNet: Dual-Branch Network with Spatial Supplementary Information for Building and Water Segmentation of Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation is primarily employed to generate accurate prediction labels for each pixel of the input image, and then classify the images according to the generated … |
WENYU ZHAO et. al. | Remote. Sens. | 2024-08-27 |
361 | Handling Geometric Domain Shifts in Semantic Segmentation of Surgical RGB and Hyperspectral Images Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: While model development and validation are primarily conducted on idealistic scenes, geometric domain shifts, such as occlusions of the situs, are common in real-world open surgeries. To close this gap, we (1) present the first analysis of state-of-the-art (SOA) semantic segmentation models when faced with geometric out-of-distribution (OOD) data, and (2) propose an augmentation technique called Organ Transplantation, to enhance generalizability. |
SILVIA SEIDLITZ et. al. | arxiv-cs.CV | 2024-08-27 |
362 | MROVSeg: Breaking The Resolution Curse of Vision-Language Models in Open-Vocabulary Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: A typical solution is to employ additional image backbones for high-resolution inputs, but it also introduce significant computation overhead. Therefore, we propose MROVSeg, a multi-resolution training framework for open-vocabulary image segmentation with a single pretrained CLIP backbone, that uses sliding windows to slice the high-resolution input into uniform patches, each matching the input size of the well-trained image encoder. |
YUANBING ZHU et. al. | arxiv-cs.CV | 2024-08-27 |
363 | ICFRNet: Image Complexity Prior Guided Feature Refinement for Real-time Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we leverage image complexity as a prior for refining segmentation features to achieve accurate real-time semantic segmentation. |
Xin Zhang; Teodor Boyadzhiev; Jinglei Shi; Jufeng Yang; | arxiv-cs.CV | 2024-08-25 |
364 | FusionSAM: Latent Space Driven Segment Anything Model for Multimodal Fusion and Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce SAM into multimodal image segmentation for the first time, proposing a novel framework that combines Latent Space Token Generation (LSTG) and Fusion Mask Prompting (FMP) modules to enhance SAM’s multimodal fusion and segmentation capabilities. |
DAIXUN LI et. al. | arxiv-cs.CV | 2024-08-25 |
365 | Accuracy Improvement of Cell Image Segmentation Using Feedback Former Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This tendency leads to a lack of detailed information for segmentation. Therefore, to supplement or reinforce the missing detailed information, we hypothesized that feedback processing in the human visual cortex should be effective. |
Hinako Mitsuoka; Kazuhiro Hotta; | arxiv-cs.CV | 2024-08-23 |
366 | Image Segmentation in Foundation Model Era: A Survey Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We investigate two basic lines of research — generic image segmentation (i.e., semantic segmentation, instance segmentation, panoptic segmentation), and promptable image segmentation (i.e., interactive segmentation, referring segmentation, few-shot segmentation) — by delineating their respective task settings, background concepts, and key challenges. |
TIANFEI ZHOU et. al. | arxiv-cs.CV | 2024-08-23 |
367 | The 2nd Solution for LSVOS Challenge RVOS Track: Spatial-temporal Refinement for Consistent Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: During testing, while these models can effectively process information over short time steps, they struggle to maintain consistent perception over prolonged time sequences, leading to inconsistencies in the resulting semantic segmentation masks. To address this challenge, we take a step further in this work by leveraging the tracking capabilities of the newly introduced Segment Anything Model version 2 (SAM-v2) to enhance the temporal consistency of the referring object segmentation model. |
Tuyen Tran; | arxiv-cs.CV | 2024-08-22 |
368 | Improved Semi-Supervised Attention GAN for Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation is one of the cornerstone problems in computer vision that involves assigning each image pixel to a specific semantic class. Traditional supervised learning … |
Nusrat Jahan; Thangarajah Akilan; Thanh Minh Nguyen; | 2024 IEEE Pacific Rim Conference on Communications, … | 2024-08-21 |
369 | Rethinking Video Segmentation with Masked Video Consistency: Did The Model Learn As Intended? Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This leads to inconsistent segmentation results across frames. To address these issues, we propose a training strategy Masked Video Consistency, which enhances spatial and temporal feature aggregation. |
Chen Liang; Qiang Guo; Xiaochao Qu; Luoqi Liu; Ting Liu; | arxiv-cs.CV | 2024-08-20 |
370 | 3D-Aware Instance Segmentation and Tracking in Egocentric Videos Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Egocentric videos present unique challenges for 3D scene understanding due to rapid camera motion, frequent object occlusions, and limited object visibility. This paper introduces a novel approach to instance segmentation and tracking in first-person video that leverages 3D awareness to overcome these obstacles. |
YASH BHALGAT et. al. | arxiv-cs.CV | 2024-08-19 |
371 | OVOSE: Open-Vocabulary Semantic Segmentation in Event-Based Cameras Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce OVOSE, the first Open-Vocabulary Semantic Segmentation algorithm for Event cameras. |
Muhammad Rameez Ur Rahman; Jhony H. Giraldo; Indro Spinelli; Stéphane Lathuilière; Fabio Galasso; | arxiv-cs.CV | 2024-08-18 |
372 | Depth-guided Texture Diffusion for Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we introduce a Depth-guided Texture Diffusion approach that effectively tackles the outlined challenge. |
Wei Sun; Yuan Li; Qixiang Ye; Jianbin Jiao; Yanzhao Zhou; | arxiv-cs.CV | 2024-08-17 |
373 | Tuning A SAM-Based Model with Multi-Cognitive Visual Adapter to Remote Sensing Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, a Multi-Cognitive SAM-Based Instance Segmentation Model (MC-SAM SEG) is introduced to employ SAM on remote sensing domain. |
Linghao Zheng; Xinyang Pu; Feng Xu; | arxiv-cs.CV | 2024-08-16 |
374 | SAM2-UNet: Segment Anything 2 Makes Strong Encoder for Natural and Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose a simple but effective framework, termed SAM2-UNet, for versatile image segmentation. |
XINYU XIONG et. al. | arxiv-cs.CV | 2024-08-16 |
375 | HEFANet: Hierarchical Efficient Fusion and Aggregation Segmentation Network for Enhanced Rgb-thermal Urban Scene Parsing Related Papers Related Patents Related Grants Related Venues Related Experts View |
ZHENGWEN SHEN et. al. | Appl. Intell. | 2024-08-14 |
376 | MetaSeg: MetaFormer-based Global Contexts-aware Network for Efficient Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose a powerful semantic segmentation network, MetaSeg, which leverages the Metaformer architecture from the backbone to the decoder. |
Beoungwoo Kang; Seunghun Moon; Yubin Cho; Hyunwoo Yu; Suk-Ju Kang; | arxiv-cs.CV | 2024-08-14 |
377 | Enhancing Autonomous Vehicle Perception in Adverse Weather Through Image Augmentation During Semantic Segmentation Training Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We trained encoder-decoder UNet models to perform semantic segmentation. |
Ethan Kou; Noah Curran; | arxiv-cs.CV | 2024-08-13 |
378 | MacFormer: Semantic Segmentation with Fine Object Boundaries Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: While Vision Transformer-based models have made significant progress, current semantic segmentation methods often struggle with precise predictions in localized areas like object boundaries. To tackle this challenge, we introduce a new semantic segmentation architecture, “MacFormer”, which features two key components. |
GUOAN XU et. al. | arxiv-cs.CV | 2024-08-11 |
379 | TOSS: Real-time Tracking and Moving Object Segmentation for Static Scene Mapping Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose an integrated real-time framework that combines online tracking-based moving object segmentation with static map building. |
SEOYEON JANG et. al. | arxiv-cs.RO | 2024-08-10 |
380 | Embodied Uncertainty-Aware Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To deal with uncertainty in robot perception, we propose a method for generating a hypothesis distribution of object segmentation. |
Xiaolin Fang; Leslie Pack Kaelbling; Tomás Lozano-Pérez; | arxiv-cs.RO | 2024-08-08 |
381 | SegXAL: Explainable Active Learning for Semantic Segmentation in Driving Scene Scenarios Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, there are certain challenges that hinder the deployment of AI models in-the-wild scenarios, i.e., inefficient use of unlabeled data, lack of incorporation of human expertise, and lack of interpretation of the results. To mitigate these challenges, we propose a novel Explainable Active Learning (XAL) model, XAL-based semantic segmentation model SegXAL, that can (i) effectively utilize the unlabeled data, (ii) facilitate the Human-in-the-loop paradigm, and (iii) augment the model decisions in an interpretable way. |
Sriram Mandalika; Athira Nambiar; | arxiv-cs.CV | 2024-08-08 |
382 | Biomedical SAM 2: Segment Anything in Biomedical Images and Videos Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To explore the performance of SAM-2 in biomedical applications, we designed three evaluation pipelines for single-frame 2D image segmentation, multi-frame 3D image segmentation and multi-frame video segmentation with varied prompt designs, revealing SAM-2’s limitations in medical contexts. |
ZHILING YAN et. al. | arxiv-cs.CV | 2024-08-06 |
383 | Query3D: LLM-Powered Open-Vocabulary Scene Segmentation with Language Embedded 3D Gaussian Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper introduces a novel method for open-vocabulary 3D scene querying in autonomous driving by combining Language Embedded 3D Gaussians with Large Language Models (LLMs). |
Amirhosein Chahe; Lifeng Zhou; | arxiv-cs.CV | 2024-08-06 |
384 | Segmentation Style Discovery: Application to Skin Lesion Images Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we introduce the problem of segmentation style discovery, and propose StyleSeg, a segmentation method that learns plausible, diverse, and semantically consistent segmentation styles from a corpus of image-mask pairs without any knowledge of annotator correspondence. |
Kumar Abhishek; Jeremy Kawahara; Ghassan Hamarneh; | arxiv-cs.CV | 2024-08-05 |
385 | Pixel-Level Domain Adaptation: A New Perspective for Enhancing Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we argue that the distribution discrepancy between the discriminative and the non-discriminative parts of objects prevents the model from producing complete and precise pseudo masks as ground truths. |
Ye Du; Zehua Fu; Qingjie Liu; | arxiv-cs.CV | 2024-08-04 |
386 | Bridging LiDAR Gaps: A Multi-LiDARs Domain Adaptation Dataset for 3D Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We focus on the domain adaptation problem for 3D semantic segmentation, addressing the challenge of data variability in point clouds collected by different LiDARs. |
SHAOYANG CHEN et. al. | ijcai | 2024-08-03 |
387 | PPTFormer: Pseudo Multi-Perspective Transformer for UAV Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Traditional segmentation algorithms falter as they cannot accurately mimic the complexity of UAV perspectives, and the cost of obtaining multi-perspective labeled datasets is prohibitive. To address these issues, we introduce the PPTFormer, a novel Pseudo Multi-Perspective Transformer network that revolutionizes UAV image segmentation. |
Deyi Ji; Wenwei Jin; Hongtao Lu; Feng Zhao; | ijcai | 2024-08-03 |
388 | MISA: MIning Saliency-Aware Semantic Prior for Box Supervised Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To boost the BSIS model’s perceptual ability for object shape and contour, we introduce MISA, that is, MIning Saliency-Aware semantic prior from a well-optimized box supervised semantic segmentation (BSSS) network, and incorporating cross-model guidance into the learning process of BSIS. |
HAO ZHU et. al. | ijcai | 2024-08-03 |
389 | Aggregation and Purification: Dual Enhancement Network for Point Cloud Few-shot Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we design a novel Dual Enhancement Network (DENet) to comprehensively tackle different kinds of scene discrepancies in a coherent and synergistic framework. |
GUOXIN XIONG et. al. | ijcai | 2024-08-03 |
390 | Efficient Dual-Stream Fusion Network for Real-Time Railway Scene Understanding Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Railway scene understanding is key to autonomous train operation and important in active train perception. However, most railway scene understanding methods focus on track … |
ZHIWEI CAO et. al. | IEEE Transactions on Intelligent Transportation Systems | 2024-08-01 |
391 | Prompt Learning for Light Field Semantic Segmentation in The Consumer-Centric Internet of Intelligent Computing Things Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Light field semantic segmentation accurately identifies the semantic information of the scene, providing solutions for various intelligent computing tasks in consumer electronics … |
CHEN JIA et. al. | IEEE Transactions on Consumer Electronics | 2024-08-01 |
392 | Multi-unit Stacked Architecture: An Urban Scene Segmentation Network Based on UNet and ShuffleNetv2 Related Papers Related Patents Related Grants Related Venues Related Experts View |
Dian Liu; Jianchao Du; Chuhan Li; Chenglong Yu; Mingjin Zhang; | Appl. Soft Comput. | 2024-08-01 |
393 | Medical SAM 2: Segment Medical Images As Video Via Segment Anything Model 2 IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce Medical SAM 2 (MedSAM-2), a generalized auto-tracking model for universal 2D and 3D medical image segmentation. |
Jiayuan Zhu; Abdullah Hamdi; Yunli Qi; Yueming Jin; Junde Wu; | arxiv-cs.CV | 2024-08-01 |
394 | Point-supervised Brain Tumor Segmentation with Box-prompted MedSAM Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Although recent vision foundational models, such as the medical segment anything model (MedSAM), have made significant advancements in bounding-box-prompted segmentation, it is not straightforward to utilize point annotation, and is prone to semantic ambiguity. In this preliminary study, we introduce an iterative framework to facilitate semantic-aware point-supervised MedSAM. |
Xiaofeng Liu; Jonghye Woo; Chao Ma; Jinsong Ouyang; Georges El Fakhri; | arxiv-cs.CV | 2024-08-01 |
395 | MaskUno: Switch-Split Block For Enhancing Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In all the proposed variations to date, the problem of competing kernels (each class aims to maximize its own accuracy) persists when models try to synchronously learn numerous classes. In this paper, we propose mitigating this problem by replacing mask prediction with a Switch-Split block that processes refined ROIs, classifies them, and assigns them to specialized mask predictors. |
Jawad Haidar; Marc Mouawad; Imad Elhajj; Daniel Asmar; | arxiv-cs.CV | 2024-07-31 |
396 | 3D-GRES: Generalized 3D Referring Expression Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, current approaches are limited to segmenting a single target, restricting the versatility of the task. To overcome this limitation, we introduce Generalized 3D Referring Expression Segmentation (3D-GRES), which extends the capability to segment any number of instances based on natural language instructions. |
CHANGLI WU et. al. | arxiv-cs.CV | 2024-07-30 |
397 | Leveraging Adaptive Implicit Representation Mapping for Ultra High-Resolution Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Secondly, SIRMF is shared across all samples, which limits its ability to generalize and handle diverse inputs. To address these limitations, we propose a novel approach that leverages the newly proposed Adaptive Implicit Representation Mapping (AIRM) for ultra-high-resolution Image Segmentation. |
Ziyu Zhao; Xiaoguang Li; Pingping Cai; Canyu Zhang; Song Wang; | arxiv-cs.CV | 2024-07-30 |
398 | Fine-grained Metrics for Point Cloud Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Because of this, the majority of categories and large objects are favored in the existing evaluation metrics. This paper suggests fine-grained mIoU and mAcc for a more thorough assessment of point cloud segmentation algorithms in order to address these issues. |
Zhuheng Lu; Ting Wu; Yuewei Dai; Weiqing Li; Zhiyong Su; | arxiv-cs.CV | 2024-07-30 |
399 | Learning Ordinality in Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: While existing deep learning approaches achieve high accuracy, they often overlook the ordinal relationships between classes, which can provide critical domain knowledge (e.g., the pupil lies within the iris, and lane markings are part of the road). This paper introduces novel methods for spatial ordinal segmentation that explicitly incorporate these inter-class dependencies. |
Ricardo P. M. Cruz; Rafael Cristino; Jaime S. Cardoso; | arxiv-cs.CV | 2024-07-30 |
400 | ASI-Seg: Audio-Driven Surgical Instrument Segmentation with Surgeon Intention Understanding Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: The recent Segment Anything Model (SAM) reveals the capability to segment objects following prompts, but the manual annotations for prompts are impractical during the surgery. To address these limitations in operating rooms, we propose an audio-driven surgical instrument segmentation framework, named ASI-Seg, to accurately segment the required surgical instruments by parsing the audio commands of surgeons. |
ZHEN CHEN et. al. | arxiv-cs.CV | 2024-07-28 |
401 | RefMask3D: Language-Guided Transformer for 3D Referring Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose RefMask3D to explore the comprehensive multi-modal feature interaction and understanding. |
Shuting He; Henghui Ding; | arxiv-cs.CV | 2024-07-25 |
402 | SMPISD-MTPNet: Scene Semantic Prior-Assisted Infrared Ship Detection Using Multi-Task Perception Networks Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: For the training process, we introduce the Soft Fine-tuning training strategy to suppress the distortion caused by data augmentation. |
CHEN HU et. al. | arxiv-cs.CV | 2024-07-25 |
403 | Navigating Uncertainty in Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We address the selection and evaluation of uncertain segmentation methods in medical imaging and present two case studies: prostate segmentation, illustrating that for minimal annotator variation simple deterministic models can suffice, and lung lesion segmentation, highlighting the limitations of the Generalized Energy Distance (GED) in model selection. |
Kilian Zepf; Jes Frellsen; Aasa Feragen; | arxiv-cs.CV | 2024-07-23 |
404 | Deformable Convolution Based Road Scene Semantic Segmentation of Fisheye Images in Autonomous Driving Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This study investigates the effectiveness of modern Deformable Convolutional Neural Networks (DCNNs) for semantic segmentation tasks, particularly in autonomous driving scenarios with fisheye images. |
ANAM MANZOOR et. al. | arxiv-cs.CV | 2024-07-23 |
405 | Disentangling Spatio-temporal Knowledge for Weakly Supervised Object Detection and Segmentation in Surgical Video Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper introduces Video Spatio-Temporal Disentanglement Networks (VDST-Net), a framework to disentangle spatiotemporal information using semi-decoupled knowledge distillation to predict high-quality class activation maps (CAMs). |
Guiqiu Liao; Matjaz Jogan; Sai Koushik; Eric Eaton; Daniel A. Hashimoto; | arxiv-cs.CV | 2024-07-22 |
406 | Augmented Efficiency: Reducing Memory Footprint and Accelerating Inference for 3D Semantic Segmentation Through Hybrid Vision Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces a novel approach to 3D semantic segmentation, distinguished by incorporating a hybrid blend of 2D and 3D computer vision techniques, enabling a streamlined, efficient process. |
Aditya Krishnan; Jayneel Vora; Prasant Mohapatra; | arxiv-cs.CV | 2024-07-22 |
407 | GaussianBeV: 3D Gaussian Representation Meets Perception Models for BeV Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose GaussianBeV, a novel method for transforming image features to BeV by finely representing the scene using a set of 3D gaussians located and oriented in 3D space. |
Florian Chabot; Nicolas Granger; Guillaume Lapouge; | arxiv-cs.CV | 2024-07-19 |
408 | Panoptic Segmentation of Mammograms with Text-To-Image Diffusion Model Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We aim to harness their capabilities for breast lesion segmentation in a panoptic setting, which encompasses both semantic and instance-level predictions. |
Kun Zhao; Jakub Prokop; Javier Montalt Tordera; Sadegh Mohammadi; | arxiv-cs.CV | 2024-07-19 |
409 | ViLLa: Video Reasoning Segmentation with Large Language Model Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To bridge the gap between image and video, in this work, we propose a new video segmentation task – video reasoning segmentation. |
RONGKUN ZHENG et. al. | arxiv-cs.CV | 2024-07-18 |
410 | MeshSegmenter: Zero-Shot Mesh Semantic Segmentation Via Texture Synthesis Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present MeshSegmenter, a simple yet effective framework designed for zero-shot 3D semantic segmentation. |
ZIMING ZHONG et. al. | arxiv-cs.CV | 2024-07-18 |
411 | OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird’s-eye-view Vehicle Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, there are still two issues: 1) a lack of effective understanding and enhancement of BEV space features, particularly in accurately capturing long-distance environmental features and 2) recognizing fine details of target objects. To address these issues, we propose OE-BevSeg, an end-to-end multimodal framework that enhances BEV segmentation performance through global environment-aware perception and local target object enhancement. |
JIAN SUN et. al. | arxiv-cs.CV | 2024-07-17 |
412 | FoodMem: Near Real-time and Precise Food Video Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present FoodMem, a novel framework designed to segment food items from video sequences of 360-degree unbounded scenes. |
Ahmad AlMughrabi; Adrián Galán; Ricardo Marques; Petia Radeva; | arxiv-cs.CV | 2024-07-16 |
413 | VISA: Reasoning Video Object Segmentation Via Large Language Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce a new task, Reasoning Video Object Segmentation (ReasonVOS). |
CILIN YAN et. al. | arxiv-cs.CV | 2024-07-15 |
414 | RAPiD-Seg: Range-Aware Pointwise Distance Distribution Networks for 3D LiDAR Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To effectively embed high-dimensional RAPiD features, we propose a double-nested autoencoder structure with a novel class-aware embedding objective to encode high-dimensional features into manageable voxel-wise embeddings. |
Li Li; Hubert P. H. Shum; Toby P. Breckon; | arxiv-cs.CV | 2024-07-14 |
415 | FANet: Feature Amplification Network for Semantic Segmentation in Cluttered Background Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Existing deep learning approaches leave out the semantic cues that are crucial in semantic segmentation present in complex scenarios including cluttered backgrounds and translucent objects, etc. To handle these challenges, we propose a feature amplification network (FANet) as a backbone network that incorporates semantic information using a novel feature enhancement module at multi-stages. |
Muhammad Ali; Mamoona Javaid; Mubashir Noman; Mustansar Fiaz; Salman Khan; | arxiv-cs.CV | 2024-07-12 |
416 | Enriching Information and Preserving Semantic Consistency in Expanding Curvilinear Object Segmentation Datasets Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Curvilinear object segmentation plays a crucial role across various applications, yet datasets in this domain often suffer from small scale due to the high costs associated with data acquisition and annotation. To address these challenges, this paper introduces a novel approach for expanding curvilinear object segmentation datasets, focusing on enhancing the informativeness of generated data and the consistency between semantic maps and generated images. |
Qin Lei; Jiang Zhong; Qizhu Dai; | arxiv-cs.CV | 2024-07-11 |
417 | CycleSAM: One-Shot Surgical Scene Segmentation Using Cycle-Consistent Feature Matching to Prompt SAM Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose CycleSAM, an approach for one-shot surgical scene segmentation that uses the training image-mask pair at test-time to automatically identify points in the test images that correspond to each object class, which can then be used to prompt SAM to produce object masks. |
Aditya Murali; Pietro Mascagni; Didier Mutter; Nicolas Padoy; | arxiv-cs.CV | 2024-07-09 |
418 | LuSNAR:A Lunar Segmentation, Navigation and Reconstruction Dataset Based on Muti-sensor for Autonomous Exploration Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Most of the existing lunar datasets are targeted at a single task, lacking diverse scenes and high-precision ground truth labels. To address this issue, we propose a multi-task, multi-scene, and multi-label lunar benchmark dataset LuSNAR. |
JIAYI LIU et. al. | arxiv-cs.CV | 2024-07-08 |
419 | Submodular Video Object Proposal Selection for Semantic Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes to achieve semantic video object segmentation by learning a data-driven representation which captures the synergy of multiple instances from continuous frames. |
Tinghuai Wang; | arxiv-cs.CV | 2024-07-08 |
420 | Object-Oriented Material Classification and 3D Clustering for Improved Semantic Perception and Mapping in Mobile Robots Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To this end, we propose a complementarity-aware deep learning approach for RGB-D-based material classification built on top of an object-oriented pipeline. |
Siva Krishna Ravipati; Ehsan Latif; Ramviyas Parasuraman; Suchendra M. Bhandarkar; | arxiv-cs.RO | 2024-07-08 |
421 | RHRSegNet: Relighting High-Resolution Night-Time Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose RHRSegNet, implementing a relighting model over a High-Resolution Network for semantic segmentation. |
Sarah Elmahdy; Rodaina Hebishy; Ali Hamdi; | arxiv-cs.CV | 2024-07-08 |
422 | Prototype-Guided Structural Learning from Visual Foundation Model for Few-Shot Aerial Image Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Few-shot aerial image semantic segmentation aims to segment query images with few annotated support samples. It is challenging due to intra-class variations and complex object … |
Qixiong Wang; Hongxiang Jiang; Jiaqi Feng; Guangyun Zhang; Jihao Yin; | IGARSS 2024 – 2024 IEEE International Geoscience and Remote … | 2024-07-07 |
423 | Enhancing Label-efficient Medical Image Segmentation with Text-guided Diffusion Models Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Aside from offering state-of-the-art performance in medical image generation, denoising diffusion probabilistic models (DPM) can also serve as a representation learner to capture … |
Chun-Mei Feng; | International Conference on Medical Image Computing and … | 2024-07-07 |
424 | CaRe-Ego: Contact-aware Relationship Modeling for Egocentric Interactive Hand-object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To make up for the shortcomings of existing methods, we propose a novel method called CaRe-Ego that achieves state-of-the-art performance by emphasizing the contact between hands and objects from two aspects. |
Yuejiao Su; Yi Wang; Lap-Pui Chau; | arxiv-cs.CV | 2024-07-07 |
425 | Knowledge-Enhancement Module for RGB-T Semantic Segmentation in Remote Sensing Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In accomplishing the task of semantic segmentation of RGB-T remote sensing images, there is a great challenge due to severe occlusion, long-tailed data distribution, and … |
QINGWANG WANG et. al. | IGARSS 2024 – 2024 IEEE International Geoscience and Remote … | 2024-07-07 |
426 | Self-supervised Learning Via Cluster Distance Prediction for Operating Room Context Awareness Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a new 3D self-supervised task for OR scene understanding utilizing OR scene images captured with ToF cameras. |
Idris Hamoud; Alexandros Karargyris; Aidean Sharghi; Omid Mohareri; Nicolas Padoy; | arxiv-cs.CV | 2024-07-07 |
427 | LMSeg: A Deep Graph Message-passing Network for Efficient and Accurate Semantic Segmentation of Large-scale 3D Landscape Meshes Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents an end-to-end deep graph message-passing network, LMSeg, designed to efficiently and accurately perform semantic segmentation on large-scale 3D landscape meshes. |
Zexian Huang; Kourosh Khoshelham; Gunditj Mirring Traditional Owners Corporation; Martin Tomko; | arxiv-cs.CV | 2024-07-05 |
428 | Attention Normalization Impacts Cardinality Generalization in Slot Attention Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we demonstrate that design decisions on normalizing the aggregated values in the attention architecture have considerable impact on the capabilities of Slot Attention to generalize to a higher number of slots and objects as seen during training. |
Markus Krimmel; Jan Achterhold; Joerg Stueckler; | arxiv-cs.CV | 2024-07-04 |
429 | ISWSST: Index-space-wave State Superposition Transformers for Multispectral Remotely Sensed Imagery Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Currently the semantic segmentation task of multispectral remotely sensed imagery (MSRSI) faces the following problems: 1) Usually, only single domain feature (i.e., space domain … |
Chang Li; Pengfei Zhang; Yu Wang; | arxiv-cs.CV | 2024-07-03 |
430 | Multi-Grained Contrast for Data-Efficient Unsupervised Representation Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we aim to learn multi-grained representations, which can effectively describe the image on various granularity levels, thus improving generalization on extensive downstream tasks. |
Chengchao Shen; Jianzhong Chen; Jianxin Wang; | arxiv-cs.CV | 2024-07-02 |
431 | Fast and Efficient: Mask Neural Fields for 3D Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents MaskField, which enables efficient 3D open-vocabulary segmentation with neural fields from a novel perspective. |
ZIHAN GAO et. al. | arxiv-cs.CV | 2024-07-01 |
432 | Multiple Resolutions Detail Enhancement Network for Real-Time Image Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Real-time image semantic segmentation (ISS) draws the attentions of more and more researchers as a basis of scene understanding, and it has been applied in many fields that need … |
JING GU et. al. | IEEE Transactions on Artificial Intelligence | 2024-07-01 |
433 | Image Semantic Segmentation of Indoor Scenes: A Survey Related Papers Related Patents Related Grants Related Venues Related Experts View |
Ronny Velastegui; Maxim Tatarchenko; Sezer Karaoglu; Theo Gevers; | Comput. Vis. Image Underst. | 2024-07-01 |
434 | Joint Optimization of Crack Segmentation With An Adaptive Dynamic Threshold Module Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Crack segmentation is a critical component in structural health monitoring. Conventional crack segmentation models usually focus on optimizing the cross-entropy-based objective … |
Qin Lei; Jiang Zhong; Chen Wang; | IEEE Transactions on Intelligent Transportation Systems | 2024-07-01 |
435 | PanopticRecon: Leverage Open-vocabulary Instance Segmentation for Zero-shot Panoptic Reconstruction Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel zero-shot panoptic reconstruction method from RGB-D images of scenes. |
XUAN YU et. al. | arxiv-cs.CV | 2024-07-01 |
436 | Multi-Level Object-Aware Guidance Network for Biomedical Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Most state-of-the-art models for biomedical image segmentation are developed based on U-shape architecture, which has two renowned, yet mutually affected, shortcomings: 1) … |
Huisi Wu; Baiming Zhang; Junquan Pan; Jing Qin; | IEEE Transactions on Automation Science and Engineering | 2024-07-01 |
437 | Multi-modal LiDAR Point Cloud Semantic Segmentation with Salience Refinement and Boundary Perception Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Point cloud segmentation is essential for scene understanding, which provides advanced information for many applications, such as autonomous driving, robots, and virtual reality. … |
YONG ZHOU et. al. | ACM Transactions on Multimedia Computing, Communications … | 2024-07-01 |
438 | Segment Anything Model for Automated Image Data Annotation: Empirical Studies Using Text Prompts from Grounding DINO Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end, we perform empirical studies on six publicly available datasets across different domains and reveal that these errors consistently follow a predictable pattern and can, thus, be mitigated by a simple strategy. |
Fuseini Mumuni; Alhassan Mumuni; | arxiv-cs.CV | 2024-06-27 |
439 | SimTxtSeg: Weakly-Supervised Medical Image Segmentation with Simple Text Cues Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present a novel framework, SimTxtSeg, that leverages simple text cues to generate high-quality pseudo-labels and study the cross-modal fusion in training segmentation models, simultaneously. |
Yuxin Xie; Tao Zhou; Yi Zhou; Geng Chen; | arxiv-cs.CV | 2024-06-27 |
440 | Artwork Segmentation in Eye-Tracking Experiments: Challenges and Future Directions Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Eye-tracking technology has gained prominence in cultural heritage studies, facilitating behavioral analysis and visitor engagement assessments. This paper explores the challenges … |
Alessio Ferrato; Carla Limongelli; M. Mezzini; Giuseppe Sansonetti; A. Micarelli; | Adjunct Proceedings of the 32nd ACM Conference on User … | 2024-06-27 |
441 | Principal Component Clustering for Semantic Segmentation in Synthetic Data Generation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This technical report outlines our method for generating a synthetic dataset for semantic segmentation using a latent diffusion model. |
Felix Stillger; Frederik Hasecke; Tobias Meisen; | arxiv-cs.CV | 2024-06-25 |
442 | A Lightweight Underwater Fish Image Semantic Segmentation Model Based on U-Net Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation of underwater fish images is vital for monitoring fish stocks, assessing marine resources, and sustaining fisheries. To tackle challenges such as low … |
Zhenkai Zhang; Wanghua Li; Boon-Chong Seet; | IET Image Process. | 2024-06-25 |
443 | Exploring Image Fusion Techniques for Off-Road Semantic Segmentation in Harsh Lighting Conditions. A Multispectral Imagery Analysis Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In recent years, we have witnessed significant progress in the field of autonomous mobility. However, these advancements have been highly limited to urban environments. Autonomous … |
Pankaj Deoli; Shubham Abhay Deshpande; A. Vierling; Karsten Berns; | 2024 21st International Conference on Ubiquitous Robots (UR) | 2024-06-24 |
444 | SegNet4D: Effective and Efficient 4D LiDAR Semantic Segmentation in Autonomous Driving Environments Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: 4D LiDAR semantic segmentation, also referred to as multi-scan semantic segmentation, plays a crucial role in enhancing the environmental understanding capabilities of autonomous … |
NENG WANG et. al. | ArXiv | 2024-06-24 |
445 | SegNet4D: Efficient Instance-Aware 4D Semantic Segmentation for LiDAR Point Cloud Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce SegNet4D, a novel real-time 4D semantic segmentation network offering both efficiency and strong semantic understanding. |
NENG WANG et. al. | arxiv-cs.CV | 2024-06-23 |
446 | Bidirectional Feature Fusion and Enhanced Alignment Based Multimodal Semantic Segmentation for Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Image–text multimodal deep semantic segmentation leverages the fusion and alignment of image and text information and provides more prior knowledge for segmentation tasks. It is … |
Qianqian Liu; Xili Wang; | Remote. Sens. | 2024-06-22 |
447 | Seg-LSTM: Performance of XLSTM for Semantic Segmentation of Remotely Sensed Images Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Our study found that Vision-LSTM’s performance in semantic segmentation was limited and generally inferior to Vision-Transformers-based and Vision-Mamba-based models in most comparative tests. |
Qinfeng Zhu; Yuanzhi Cai; Lei Fan; | arxiv-cs.CV | 2024-06-20 |
448 | Reparameterizable Dual-Resolution Network for Real-time Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Although existing real-time semantic segmentation models achieve a commendable balance between accuracy and speed, their multi-path blocks still affect overall speed. To address this issue, this study proposes a Reparameterizable Dual-Resolution Network (RDRNet) dedicated to real-time semantic segmentation. |
Guoyu Yang; Yuan Wang; Daming Shi; | arxiv-cs.CV | 2024-06-18 |
449 | RailPC: A Large-scale Railway Point Cloud Semantic Segmentation Dataset Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation in the context of 3D point clouds for the railway environment holds a significant economic value, but its development is severely hindered by the lack of … |
TENGPING JIANG et. al. | CAAI Trans. Intell. Technol. | 2024-06-17 |
450 | Point-Supervised Semantic Segmentation of Natural Scenes Via Hyperspectral Imaging Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Natural scene semantic segmentation is an important task in computer vision. While training accurate models for semantic segmentation relies heavily on detailed and accurate … |
Tianqi Ren; Qiu Shen; Ying Fu; Shaodi You; | 2024 IEEE/CVF Conference on Computer Vision and Pattern … | 2024-06-17 |
451 | Generalized Foggy-Scene Semantic Segmentation By Frequency Decoupling Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Foggy-scene semantic segmentation (FSSS) is highly challenging due to the diverse effects of fog on scene properties and the limited training data. Existing research has mainly … |
Qi Bi; Shaodi You; Theo Gevers; | 2024 IEEE/CVF Conference on Computer Vision and Pattern … | 2024-06-17 |
452 | OoDIS: Anomaly Instance Segmentation and Detection Benchmark Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We provide a competition and benchmark website under https://vision.rwth-aachen.de/oodis |
ALEXEY NEKRASOV et. al. | arxiv-cs.CV | 2024-06-17 |
453 | GSAM+Cutie: Text-Promptable Tool Mask Annotation for Endoscopic Video Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Machine learning approaches for multi-view geometric scene understanding in endoscopic surgery often assume temporal consistency across the frames to limit challenges that … |
ROGER D. SOBERANIS-MUKUL et. al. | 2024 IEEE/CVF Conference on Computer Vision and Pattern … | 2024-06-17 |
454 | Noisy Annotations in Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This study sheds light on the quality of segmentation masks produced by various models and challenges the efficacy of popular methods designed to address learning with label noise. |
Moshe Kimhi; Omer Kerem; Eden Grad; Ehud Rivlin; Chaim Baskin; | arxiv-cs.CV | 2024-06-16 |
455 | PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, the actual multi-scale feature fusion often comes with the semantic redundancy issue due to homogeneous semantic contents in pyramid features. To handle this issue, we propose a novel Mamba-based segmentation network, namely PyramidMamba. |
LIBO WANG et. al. | arxiv-cs.CV | 2024-06-16 |
456 | Bias-Compensation Augmentation Learning for Semantic Segmentation in UAV Networks Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In the realm of emergency disaster relief, it is paramount to attain a thorough comprehension of the semantic information associated with the local disaster scene for strategic … |
TIANKUO YU et. al. | IEEE Internet of Things Journal | 2024-06-15 |
457 | Unlocking The Potential of Pre-trained Vision Transformers for Few-Shot Semantic Segmentation Through Relationship Descriptors Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: The recent advent of pre-trained vision transformers has unveiled a promising property: their inherent capability to group semantically related visual concepts. In this paper we explore to harnesses this emergent feature to tackle few-shot semantic segmentation a task focused on classifying pixels in a test image with a few example data. |
Ziqin Zhou; Hai-Ming Xu; Yangyang Shu; Lingqiao Liu; | cvpr | 2024-06-13 |
458 | Building A Strong Pre-Training Baseline for Universal 3D Large-Scale Perception Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Such inconsiderate consistency greatly hampers the promising path of reaching an universal pre-training framework: (1) The cross-scene semantic self-conflict \textit i.e. the intense collision between primitive segments of the same semantics from different scenes; (2) Lacking a globally unified bond that pushes the cross-scene semantic consistency into 3D representation learning. To address above challenges we propose a CSC framework that puts a scene-level semantic consistency in the heart bridging the connection of the similar semantic segments across various scenes. |
HAOMING CHEN et. al. | cvpr | 2024-06-13 |
459 | LiSA: LiDAR Localization with Semantic Awareness Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: For example dynamic objects and repeating structures often negatively impact SCR. To address this problem we introduce LiSA the first method that incorporates semantic awareness into SCR to boost the localization robustness and accuracy. |
BOCHUN YANG et. al. | cvpr | 2024-06-13 |
460 | SAI3D: Segment Any Instance in 3D Scenes IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we introduce SAI3D a novel zero-shot 3D instance segmentation approach that synergistically leverages geometric priors and semantic cues derived from Segment Anything Model (SAM). |
YINGDA YIN et. al. | cvpr | 2024-06-13 |
461 | Segment Every Out-of-Distribution Object Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper introduces a method to convert anomaly Score To segmentation Mask called S2M a simple and effective framework for OoD detection in semantic segmentation. |
Wenjie Zhao; Jia Li; Xin Dong; Yu Xiang; Yunhui Guo; | cvpr | 2024-06-13 |
462 | MRFS: Mutually Reinforcing Image Fusion and Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper proposes a coupled learning framework to break the performance bottleneck of infrared-visible image fusion and segmentation called MRFS. |
Hao Zhang; Xuhui Zuo; Jie Jiang; Chunchao Guo; Jiayi Ma; | cvpr | 2024-06-13 |
463 | Lift3D: Zero-Shot Lifting of Any 2D Vision Model to 3D Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we ask the question of whether any 2D vision model can be lifted to make 3D consistent predictions. |
MUKUND VARMA T et. al. | cvpr | 2024-06-13 |
464 | ToNNO: Tomographic Reconstruction of A Neural Network’s Output for Weakly Supervised Segmentation of 3D Medical Images Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a novel approach ToNNO which is based on the Tomographic reconstruction of a Neural Network’s Output. |
Marius Schmidt-Mengin; Alexis Benichoux; Shibeshih Belachew; Nikos Komodakis; Nikos Paragios; | cvpr | 2024-06-13 |
465 | SANeRF-HQ: Segment Anything for NeRF in High Quality Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper we introduce the Segment Anything for NeRF in High Quality (SANeRF-HQ) to achieve high-quality 3D segmentation of any target object in a given scene. |
Yichen Liu; Benran Hu; Chi-Keung Tang; Yu-Wing Tai; | cvpr | 2024-06-13 |
466 | Image-Text Co-Decomposition for Text-Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We notice that there is a discrepancy between text alignment and semantic segmentation: A text often consists of multiple semantic concepts whereas semantic segmentation strives to create semantically homogeneous segments. To address this issue we propose a novel framework Image-Text Co-Decomposition (CoDe) where the paired image and text are jointly decomposed into a set of image regions and a set of word segments respectively and contrastive learning is developed to enforce region-word alignment. |
JI-JIA WU et. al. | cvpr | 2024-06-13 |
467 | Density-Guided Semi-Supervised 3D Semantic Segmentation with Dual-Space Hardness Sampling Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However existing point-to-point contrastive learning techniques in literature are generally sensitive to outliers resulting in insufficient modeling of the point-wise representations. To address this problem we propose a method named DDSemi for semi-supervised 3D semantic segmentation where a density-guided contrastive learning technique is explored. |
Jianan Li; Qiulei Dong; | cvpr | 2024-06-13 |
468 | Exploring Regional Clues in CLIP for Zero-Shot Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper to solve the mentioned challenge we analyze the gap between the capability of the CLIP model and the requirement of the zero-shot semantic segmentation task. |
Yi Zhang; Meng-Hao Guo; Miao Wang; Shi-Min Hu; | cvpr | 2024-06-13 |
469 | Unsupervised Universal Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose an Unsupervised Universal Segmentation model (U2Seg) adept at performing various image segmentation tasks—instance semantic and panoptic—using a novel unified framework. |
DANTONG NIU et. al. | cvpr | 2024-06-13 |
470 | APSeg: Auto-Prompt Network for Cross-Domain Few-Shot Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end we propose to leverage the cutting-edge foundation model the Segment Anything Model (SAM) for generalization enhancement. |
WEIZHAO HE et. al. | cvpr | 2024-06-13 |
471 | GSNeRF: Generalizable Semantic Neural Radiance Fields with Enhanced 3D Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work we introduce a Generalizable Semantic Neural Radiance Field (GSNeRF) which uniquely takes image semantics into the synthesis process so that both novel view images and the associated semantic maps can be produced for unseen scenes. |
Zi-Ting Chou; Sheng-Yu Huang; I-Jieh Liu; Yu-Chiang Frank Wang; | cvpr | 2024-06-13 |
472 | Flattening The Parent Bias: Hierarchical Semantic Segmentation in The Poincare Ball Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We find that on the new testing domains a flat (non-hierarchical) segmentation network in which the parents are inferred from the children has superior segmentation accuracy to the hierarchical approach across the board. Complementing these findings and inspired by the intrinsic properties of hyperbolic spaces we study a more principled approach to hierarchical segmentation using the Poincare ball model. |
Simon Weber; Bar?? Zöngür; Nikita Araslanov; Daniel Cremers; | cvpr | 2024-06-13 |
473 | USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The main challenge in open-vocabulary image segmentation now lies in accurately classifying these segments into text-defined categories. In this paper we introduce the Universal Segment Embedding (USE) framework to address this challenge. |
XIAOQI WANG et. al. | cvpr | 2024-06-13 |
474 | Hierarchical Intra-modal Correlation Learning for Label-free 3D Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However these methods usually suffer from inconsistent and noisy pseudo-labels provided by the vision language models. To address this issue we present a hierarchical intra-modal correlation learning framework that captures visual and geometric correlations in 3D scenes at three levels: intra-set intra-scene and inter-scene to help learn more compact 3D representations. |
Xin Kang; Lei Chu; Jiahao Li; Xuejin Chen; Yan Lu; | cvpr | 2024-06-13 |
475 | GP-NeRF: Generalized Perception NeRF for Context-Aware 3D Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However by rendering semantic/instance labels per pixel without considering the contextual information of the rendered image these methods usually suffer from unclear boundary segmentation and abnormal segmentation of pixels within an object. To solve this problem we propose Generalized Perception NeRF (GP-NeRF) a novel pipeline that makes the widely used segmentation model and NeRF work compatibly under a unified framework for facilitating context-aware 3D scene perception. |
HAO LI et. al. | cvpr | 2024-06-13 |
476 | Curriculum Point Prompting for Weakly-Supervised Referring Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Nevertheless we observe that simply integrating SAM yields limited benefits and can even lead to performance regression due to the inevitable noise issues and challenges in excessive focus on object parts. In this paper we present an innovative framework Point PrompTing (PPT) incorporated with the proposed multi-source curriculum learning strategy to address these challenges. |
Qiyuan Dai; Sibei Yang; | cvpr | 2024-06-13 |
477 | Open-World Semantic Segmentation Including Class Similarity Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose a novel approach that performs accurate closed-world semantic segmentation and at the same time can identify new categories without requiring any additional training data. |
Matteo Sodano; Federico Magistri; Lucas Nunes; Jens Behley; Cyrill Stachniss; | cvpr | 2024-06-13 |
478 | PanoOcc: Unified Occupancy Representation for Camera-based 3D Panoptic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This divide-and-conquer strategy simplifies the algorithm development process but comes at the cost of losing an end-to-end unified solution to the problem. In this work we address this limitation by studying camera-based 3D panoptic segmentation aiming to achieve a unified occupancy representation for camera-only 3D scene understanding. |
Yuqi Wang; Yuntao Chen; Xingyu Liao; Lue Fan; Zhaoxiang Zhang; | cvpr | 2024-06-13 |
479 | CAT-Seg: Cost Aggregation for Open-Vocabulary Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work we introduce a novel cost-based approach to adapt vision-language foundation models notably CLIP for the intricate task of semantic segmentation. |
SEOKJU CHO et. al. | cvpr | 2024-06-13 |
480 | Benchmarking Segmentation Models with Mask-Preserved Attribute Editing Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Different from the previous evaluation paradigms only in consideration of global attribute variations (e.g. adverse weather) we investigate both local and global attribute variations for robustness evaluation. To achieve this we construct a mask-preserved attribute editing pipeline to edit visual attributes of real images with precise control of structural information. |
Zijin Yin; Kongming Liang; Bing Li; Zhanyu Ma; Jun Guo; | cvpr | 2024-06-13 |
481 | PEM: Prototype-based Efficient MaskFormer for Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To achieve such impressive performance these architectures employ intensive operations and require substantial computational resources which are often not available especially on edge devices. To fill this gap we propose Prototype-based Efficient MaskFormer (PEM) an efficient transformer-based architecture that can operate in multiple segmentation tasks. |
NICCOLÒ CAVAGNERO et. al. | cvpr | 2024-06-13 |
482 | MRFP: Learning Generalizable Semantic Segmentation from Sim-2-Real with Multi-Resolution Feature Perturbation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However the large domain-specific inconsistencies between simulated and real-world data pose a significant generalization challenge in semantic segmentation. In this work to alleviate this problem we propose a novel Multi-Resolution Feature Perturbation (MRFP) technique to randomize domain-specific fine-grained features and perturb style of coarse features. |
Sumanth Udupa; Prajwal Gurunath; Aniruddh Sikdar; Suresh Sundaram; | cvpr | 2024-06-13 |
483 | Style Blind Domain Generalized Semantic Segmentation Via Covariance Alignment and Semantic Consistence Contrastive Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However these approaches struggle with the entanglement of style and content which may lead to the unintentional removal of crucial content information causing performance degradation. This study addresses this limitation by proposing BlindNet a novel DGSS approach that blinds the style without external modules or datasets. |
Woo-Jin Ahn; Geun-Yeong Yang; Hyun-Duck Choi; Myo-Taeg Lim; | cvpr | 2024-06-13 |
484 | SED: A Simple Encoder-Decoder for Open-Vocabulary Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper we propose a simple encoder-decoder named SED for open-vocabulary semantic segmentation which comprises a hierarchical encoder-based cost map generation and a gradual fusion decoder with category early rejection. |
Bin Xie; Jiale Cao; Jin Xie; Fahad Shahbaz Khan; Yanwei Pang; | cvpr | 2024-06-13 |
485 | Unsupervised Semantic Segmentation Through Depth-Guided Feature Correlation and Sampling Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To achieve this semantic knowledge is distilled by learning to correlate randomly sampled features from images across an entire dataset. In this work we build upon these advances by incorporating information about the structure of the scene into the training process through the use of depth information. |
Leon Sick; Dominik Engel; Pedro Hermosilla; Timo Ropinski; | cvpr | 2024-06-13 |
486 | EAGLE: Eigen Aggregation Learning for Object-Centric Unsupervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This technical limitation often leads to inadequate segmentation of complex objects with diverse structures. To address this gap we present a novel approach EAGLE which emphasizes object-centric representation learning for unsupervised semantic segmentation. |
Chanyoung Kim; Woojung Han; Dayun Ju; Seong Jae Hwang; | cvpr | 2024-06-13 |
487 | SatSynth: Augmenting Image-Mask Pairs Through Diffusion Models for Aerial Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work we explore the potential of generative image diffusion to address the scarcity of annotated data in earth observation tasks. |
Aysim Toker; Marvin Eisenberger; Daniel Cremers; Laura Leal-Taixé; | cvpr | 2024-06-13 |
488 | Traffic Scene Parsing Through The TSP6K Dataset Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However little effort has been put into improving the traffic monitoring scene understanding mainly due to the lack of specific datasets. To fill this gap we introduce a specialized traffic monitoring dataset termed TSP6K containing images from the traffic monitoring scenario with high-quality pixel-level and instance-level annotations. |
PENG-TAO JIANG et. al. | cvpr | 2024-06-13 |
489 | Scribble-Supervised Semantic Segmentation with Prototype-based Feature Augmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, existing methods often ignore the features of classified pixels during feature propagation. To address these limitations, this paper proposes a prototype-based feature augmentation method that leverages feature prototypes to augment scribble supervision. |
Guiyang Chan; Pengcheng Zhang; Hai Dong; Shunhui Ji; Bainian Chen; | icml | 2024-06-12 |
490 | BLO-SAM: Bi-level Optimization Based Finetuning of The Segment Anything Model for Overfitting-Preventing Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Current solutions to these problems, which involve finetuning SAM, often lead to overfitting, a notable issue in scenarios with very limited data, like in medical imaging. To overcome these limitations, we introduce BLO-SAM, which finetunes SAM based on bi-level optimization (BLO). |
Li Zhang; Youwei Liang; Ruiyi Zhang; Amirhosein Javadi; Pengtao Xie; | icml | 2024-06-12 |
491 | SimSAM: Simple Siamese Representations Based Semantic Affinity Matrix for Unsupervised Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Inspired by the non-contrastive SSL approach (SimSiam), we introduce a novel framework SIMSAM to compute the Semantic Affinity Matrix, which is significant for unsupervised image segmentation. |
Chanda Grover Kamra; Indra Deep Mastan; Nitin Kumar; Debayan Gupta; | arxiv-cs.CV | 2024-06-12 |
492 | PanoSSC: Exploring Monocular Panoptic 3D Scene Reconstruction for Autonomous Driving Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end, we investigate panoptic segmentation on 3D voxel scenarios and propose an instance-aware occupancy network, PanoSSC. |
YINING SHI et. al. | arxiv-cs.CV | 2024-06-11 |
493 | Beyond Bare Queries: Open-Vocabulary Object Grounding with 3D Scene Graph Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Existing CLIP-based open-vocabulary methods successfully perform 3D object grounding with simple (bare) queries, but cannot cope with ambiguous descriptions that demand an understanding of object relations. To tackle this problem, we propose a modular approach called BBQ (Beyond Bare Queries), which constructs 3D scene graph representation with metric and semantic spatial edges and utilizes a large language model as a human-to-agent interface through our deductive scene reasoning algorithm. |
SERGEY LINOK et. al. | arxiv-cs.CV | 2024-06-11 |
494 | U-Net Ensemble for Enhanced Semantic Segmentation in Remote Sensing Imagery Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation of remote sensing imagery stands as a fundamental task within the domains of both remote sensing and computer vision. Its objective is to generate a … |
I. Dimitrovski; Vlatko Spasev; S. Loskovska; Ivan Kitanovski; | Remote. Sens. | 2024-06-08 |
495 | 1st Place Winner of The 2024 Pixel-level Video Understanding in The Wild (CVPR’24 PVUW) Challenge in Video Panoptic Segmentation and Best Long Video Consistency of Video Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper details our research work that achieved the 1st place winner in the PVUW’24 VPS challenge, establishing state of art results in all metrics, including the Video Panoptic Quality (VPQ) and Segmentation and Tracking Quality (STQ). |
Qingfeng Liu; Mostafa El-Khamy; Kee-Bong Song; | arxiv-cs.CV | 2024-06-08 |
496 | USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The main challenge in open-vocabulary image segmentation now lies in accurately classifying these segments into text-defined categories. In this paper, we introduce the Universal Segment Embedding (USE) framework to address this challenge. |
XIAOQI WANG et. al. | arxiv-cs.CV | 2024-06-07 |
497 | 1st Place Solution for MOSE Track in CVPR 2024 PVUW Workshop: Complex Video Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The motivation behind the MOSE dataset is how to clearly recognize and distinguish objects in complex scenes. In this challenge, we propose a semantic embedding video object segmentation model and use the salient features of objects as query representations. |
Deshui Miao; Xin Li; Zhenyu He; Yaowei Wang; Ming-Hsuan Yang; | arxiv-cs.CV | 2024-06-06 |
498 | Frequency-based Matcher for Long-tailed Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Although the long-tailed phenomenon has been investigated in many fields, e.g., classification and object detection, it has not received enough attention in semantic segmentation and has become a non-negligible obstacle to applying semantic segmentation technology in autonomous driving and virtual reality. Therefore, in this work, we focus on a relatively under-explored task setting, long-tailed semantic segmentation (LTSS). |
Shan Li; Lu Yang; Pu Cao; Liulei Li; Huadong Ma; | arxiv-cs.CV | 2024-06-06 |
499 | Learning Semantic Traversability with Egocentric Video and Automated Annotation Strategy Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we present an effective methodology for training a semantic traversability estimator using egocentric videos and an automated annotation process. |
YUNHO KIM et. al. | arxiv-cs.RO | 2024-06-05 |
500 | DiffCut: Catalyzing Zero-Shot Semantic Segmentation with Diffusion Features and Recursive Normalized Cut Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we use a diffusion UNet encoder as a foundation vision encoder and introduce DiffCut, an unsupervised zero-shot segmentation method that solely harnesses the output features from the final self-attention block. |
Paul Couairon; Mustafa Shukor; Jean-Emmanuel Haugeard; Matthieu Cord; Nicolas Thome; | arxiv-cs.CV | 2024-06-04 |
501 | EAGLE: Efficient Adaptive Geometry-based Learning in Cross-view Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: At present, there are limited studies analyzing cross-view learning. To address this problem, we introduce a novel Unsupervised Cross-view Adaptation Learning approach to modeling the geometric structural change across views in Semantic Scene Understanding. |
THANH-DAT TRUONG et. al. | arxiv-cs.CV | 2024-06-03 |
502 | Diffusion Features to Bridge Domain Gap for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: By leveraging the strength of text-to-image generation capability, we introduce a new training framework designed to implicitly learn posterior knowledge from it. |
YUXIANG JI et. al. | arxiv-cs.CV | 2024-06-02 |
503 | 2nd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In order to deal with the task of video panoptic segmentation in the wild, we propose a robust integrated video panoptic segmentation solution. |
BIAO WU et. al. | arxiv-cs.CV | 2024-06-01 |
504 | PGGNet: Pyramid Gradual-guidance Network for RGB-D Indoor Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
WUJIE ZHOU et. al. | Signal Process. Image Commun. | 2024-06-01 |
505 | Semi-supervised Video Semantic Segmentation Using Unreliable Pseudo Labels for PVUW2024 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we adopt semi-supervised video semantic segmentation method based on unreliable pseudo labels. |
BIAO WU et. al. | arxiv-cs.CV | 2024-06-01 |
506 | Token-word Mixer Meets Object-aware Transformer for Referring Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Zhenliang Zhang; Zhu Teng; Jack Fan; Baopeng Zhang; Jianping Fan; | Pattern Recognit. | 2024-06-01 |
507 | Attention-Based Multi-Kernelized and Boundary-Aware Network for Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Xuanchen Zhou; Gengshen Wu; Xin Sun; Pengpeng Hu; Yi Liu; | Neurocomputing | 2024-06-01 |
508 | MCDS-VSS: Moving Camera Dynamic Scene Video Semantic Segmentation By Filtering with Self-Supervised Geometry and Motion Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose MCDS-VSS, a structured filter model that learns in a self-supervised manner to estimate scene geometry and ego-motion of the camera, while also estimating the motion of external objects. |
Angel Villar-Corrales; Moritz Austermann; Sven Behnke; | arxiv-cs.CV | 2024-05-30 |
509 | DenseSeg: Joint Learning for Semantic Segmentation and Landmark Detection Using Dense Image-to-Shape Representation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Methods: In this work, we propose a dense image-to-shape representation that enables the joint learning of landmarks and semantic segmentation by employing a fully convolutional architecture. |
RON KEUTH et. al. | arxiv-cs.CV | 2024-05-30 |
510 | SemFlow: Binding Semantic Segmentation and Image Synthesis Via Rectified Flow Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: For image synthesis, we propose a finite perturbation approach to enhance the diversity of generated results without changing the semantic categories. |
CHAOYANG WANG et. al. | arxiv-cs.CV | 2024-05-30 |
511 | View-Consistent Hierarchical 3D Segmentation Using Ultrametric Feature Fields Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we address the challenging task of lifting multi-granular and view-inconsistent image segmentations into a hierarchical and 3D-consistent representation. |
Haodi He; Colton Stearns; Adam W. Harley; Leonidas J. Guibas; | arxiv-cs.CV | 2024-05-30 |
512 | Reasoning3D — Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation Via Large Vision-Language Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce a new task: Zero-Shot 3D Reasoning Segmentation for parts searching and localization for objects, which is a new paradigm to 3D segmentation that transcends limitations for previous category-specific 3D semantic segmentation, 3D instance segmentation, and open-vocabulary 3D segmentation. |
TIANRUN CHEN et. al. | arxiv-cs.CV | 2024-05-29 |
513 | Reasoning3D – Grounding and Reasoning in 3D: Fine-Grained Zero-Shot Open-Vocabulary 3D Reasoning Part Segmentation Via Large Vision-Language Models Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we introduce a new task: Zero-Shot 3D Reasoning Segmentation for parts searching and localization for objects, which is a new paradigm to 3D segmentation that … |
TIANRUN CHEN et. al. | ArXiv | 2024-05-29 |
514 | CRIS: Collaborative Refinement Integrated with Segmentation for Polyp Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we propose an approach that integrates mask refinement and binary semantic segmentation, leveraging a novel collaborative training strategy that surpasses current widely-used refinement strategies. |
Ankush Gajanan Arudkar; Bernard J. E. Evans; | arxiv-cs.CV | 2024-05-29 |
515 | RT-GS2: Real-Time Generalizable Semantic Segmentation for 3D Gaussian Representations of Radiance Fields Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce RT-GS2, the first generalizable semantic segmentation method employing Gaussian Splatting. |
Mihnea-Bogdan Jurca; Remco Royen; Ion Giosan; Adrian Munteanu; | arxiv-cs.CV | 2024-05-28 |
516 | Zero-Shot Video Semantic Segmentation Based on Pre-Trained Diffusion Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce the first zero-shot approach for Video Semantic Segmentation (VSS) based on pre-trained diffusion models. |
QIAN WANG et. al. | arxiv-cs.CV | 2024-05-27 |
517 | Competing for Pixels: A Self-play Algorithm for Weakly-supervised Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Leveraging reinforcement learning (RL) self-play, we propose a novel WSS method that gamifies image segmentation of a ROI. |
SHAHEER U. SAEED et. al. | arxiv-cs.CV | 2024-05-26 |
518 | Multi-view Remote Sensing Image Segmentation With SAM Priors Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Subsequently, we introduce SAM features via a transformer into the INF of the scene, supplementing the semantic information. |
ZIPENG QI et. al. | arxiv-cs.CV | 2024-05-23 |
519 | BiomedParse: A Biomedical Foundation Model for Image Parsing of Everything Everywhere All at Once IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Here, we propose BiomedParse, a biomedical foundation model for imaging parsing that can jointly conduct segmentation, detection, and recognition for 82 object types across 9 imaging modalities. |
THEODORE ZHAO et. al. | arxiv-cs.CV | 2024-05-21 |
520 | Research on Efficient Asymmetric Attention Module for Real-Time Semantic Segmentation Networks in Urban Scenes Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Currently, numerous high-precision models have been proposed for semantic segmentation, but the model parameters are large and the segmentation speed is slow. Real-time semantic … |
Xu Su; Lihong Li; Jiejie Xiao; Pengtao Wang; | J. Adv. Comput. Intell. Intell. Informatics | 2024-05-20 |
521 | CLFusion:3D Semantic Segmentation Based on Camera and Lidar Fusion Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In the field of autonomous driving, semantic segmentation is crucial for scene understanding. Currently, there are two main methods: camera-based and Lidar-based approaches. To … |
TIANYUE WANG et. al. | 2024 IEEE International Symposium on Circuits and Systems … | 2024-05-19 |
522 | Universal Organizer of SAM for Unsupervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Recently, a robust framework called the segment anything model (SAM) has been proven to deliver precise boundary object masks. Therefore, this paper proposes a universal organizer based on SAM, termed as UO-SAM, to enhance the mask quality of USS models. |
TINGTING LI et. al. | arxiv-cs.MM | 2024-05-19 |
523 | Hybrid Shunted Transformer Embedding UNet for Remote Sensing Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Huacong Zhou; Xiangling Xiao; Huihui Li; Xiaoyong Liu; Peng Liang; | Neural Comput. Appl. | 2024-05-18 |
524 | CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose CM-UNet, comprising a CNN-based encoder for extracting local image features and a Mamba-based decoder for aggregating and integrating global information, facilitating efficient semantic segmentation of remote sensing images. |
MUSHUI LIU et. al. | arxiv-cs.CV | 2024-05-17 |
525 | Fourier Boundary Features Network with Wider Catchers for Glass Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We proposed the Fourier Boundary Features Network with Wider Catchers (FBWC), which might be the first attempt to utilize sufficiently wide horizontal shallow branches without vertical deepening for guiding the fine granularity segmentation boundary through primary glass semantic information. |
XIAOLIN QIN et. al. | arxiv-cs.CV | 2024-05-15 |
526 | UDA4Inst: Unsupervised Domain Adaptation for Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce UDA4Inst, a powerful framework for synth-to-real UDA in instance segmentation. |
Yachan Guo; Yi Xiao; Danna Xue; Jose L. Gomez; Antonio M. Lopez; | arxiv-cs.CV | 2024-05-15 |
527 | CLIP with Quality Captions: A Strong Pretraining for Vision Tasks Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we find that simply improving the quality of captions in image-text datasets improves the quality of CLIP’s visual representations, resulting in significant improvement on downstream dense prediction vision tasks. |
Pavan Kumar Anasosalu Vasu; Hadi Pouransari; Fartash Faghri; Oncel Tuzel; | arxiv-cs.CV | 2024-05-14 |
528 | Noisy Few-shot 3D Point Cloud Scene Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: 3D scene semantic segmentation plays a crucial role in robotics by enabling robots to understand and interpret their environment in a detailed and context-aware manner, … |
Hao Huang; Shuaihang Yuan; Congcong Wen; Yu Hao; Yi Fang; | 2024 IEEE International Conference on Robotics and … | 2024-05-13 |
529 | Zero Shot Context-Based Object Segmentation Using SLIP (SAM+CLIP) Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present SLIP (SAM+CLIP), an enhanced architecture for zero-shot object segmentation. |
Saaketh Koundinya Gundavarapu; Arushi Arora; Shreya Agarwal; | arxiv-cs.CV | 2024-05-12 |
530 | Global Motion Understanding in Large-Scale Video Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we show that transferring knowledge from other domains of video understanding combined with large-scale learning can improve robustness of Video Object Segmentation (VOS) under complex circumstances. |
Volodymyr Fedynyak; Yaroslav Romanus; Oles Dobosevych; Igor Babin; Roman Riazantsev; | arxiv-cs.CV | 2024-05-11 |
531 | A Novel Approach to Optimizing Convolutional Neural Networks for Improved Digital Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: To divide a digital image into individual parts that share similar characteristics is known as digital image segmentation, and it is a vital research subject in the field of … |
Kongduo Xing; Junhua Ku; Jie Zhao; | Int. J. Intell. Syst. | 2024-05-08 |
532 | Weakly-supervised Semantic Segmentation Via Dual-stream Contrastive Learning of Cross-image Contextual Information Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Weakly supervised semantic segmentation (WSSS) aims at learning a semantic segmentation model with only image-level tags. |
Qi Lai; Chi-Man Vong; | arxiv-cs.CV | 2024-05-08 |
533 | Exploration of An Open Vocabulary Model on Semantic Segmentation for Street Scene Imagery Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This study investigates the efficacy of an open vocabulary, multi-modal, foundation model for the semantic segmentation of images from complex urban street scenes. Unlike … |
Zichao Zeng; Jan Boehm; | ISPRS Int. J. Geo Inf. | 2024-05-05 |
534 | Few-Shot Fruit Segmentation Via Transfer Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we develop a few-shot semantic segmentation framework for infield fruits using transfer learning. |
Jordan A. James; Heather K. Manching; Amanda M. Hulse-Kemp; William J. Beksi; | arxiv-cs.CV | 2024-05-04 |
535 | Explainable AI (XAI) in Image Segmentation in Medicine, Industry, and Beyond: A Survey Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present the first comprehensive survey on XAI in semantic image segmentation. |
Rokas Gipiškis; Chun-Wei Tsai; Olga Kurasova; | arxiv-cs.CV | 2024-05-02 |
536 | Domain Adaptive Remote Sensing Image Semantic Segmentation with Prototype Guidance Related Papers Related Patents Related Grants Related Venues Related Experts View |
WANKANG ZENG et. al. | Neurocomputing | 2024-05-01 |
537 | Trimodal Navigable Region Segmentation Model: Grounding Navigation Instructions in Urban Areas Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this study, we develop a model that enables mobilities to have more friendly interactions with users. Specifically, we focus on the referring navigable regions task in which a … |
NAOKI HOSOMI et. al. | IEEE Robotics and Automation Letters | 2024-05-01 |
538 | Recognizing Pawing Behavior of Prepartum Doe Using Semantic Segmentation and Motion History Image (MHI) Features Related Papers Related Patents Related Grants Related Venues Related Experts View |
ZIKANG CHEN et. al. | Expert Syst. Appl. | 2024-05-01 |
539 | An Energy-Efficient, Unified CNN Accelerator for Real-Time Multi-Object Semantic Segmentation for Autonomous Vehicle Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: An energy-efficient, unified convolutional neural network (CNN) accelerator is proposed with a lightweight RGB-D network to achieve real-time, multi-object semantic segmentation … |
Jueun Jung; Seung-Ju Kim; Wuyoung Jang; Bokyoung Seo; K. Lee; | IEEE Transactions on Circuits and Systems I: Regular Papers | 2024-05-01 |
540 | On The Use of GNN-based Structural Information to Improve CNN-based Semantic Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Patty Coupeau; Jean-Baptiste Fasquel; M. Dinomais; | J. Vis. Commun. Image Represent. | 2024-05-01 |
541 | Break The Bias: Delving Semantic Transform Invariance for Few-Shot Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Few-shot semantic segmentation (FSS) aims to segment objects of unseen classes in query images with only a few annotated support images. Existing FSS algorithms typically focus on … |
Qinglong Cao; Yuntian Chen; Chao Ma; Xiaokang Yang; | IEEE Transactions on Circuits and Systems for Video … | 2024-05-01 |
542 | Remote Sensing Image Semantic Segmentation Via Class-guided Structural Interaction and Boundary Perception Related Papers Related Patents Related Grants Related Venues Related Experts View |
Xin He; Yong Zhou; Bing Liu; Jiaqi Zhao; Rui Yao; | Expert Syst. Appl. | 2024-05-01 |
543 | CLFT: Camera-LiDAR Fusion Transformer for Semantic Segmentation in Autonomous Driving Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Specifically, the vision transformer is the novel ground-breaker that successfully brought the multi-head-attention mechanism to computer vision applications. Therefore, we propose a vision-transformer-based network to carry out camera-LiDAR fusion for semantic segmentation applied to autonomous driving. |
Junyi Gu; Mauro Bellone; Tomáš Pivoňka; Raivo Sell; | arxiv-cs.CV | 2024-04-27 |
544 | Semantic Segmentation Refiner for Ultrasound Applications with Zero-Shot Foundation Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we address the performance degradation of segmentation models in low-data regimes and propose a prompt-less segmentation method harnessing the ability of segmentation foundation models to segment abstract shapes. |
HEDDA COHEN INDELMAN et. al. | arxiv-cs.CV | 2024-04-25 |
545 | Boosting Unsupervised Semantic Segmentation with Principal Mask Proposals Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present PriMaPs – Principal Mask Proposals – decomposing images into semantically meaningful masks based on their feature representation. |
Oliver Hahn; Nikita Araslanov; Simone Schaub-Meyer; Stefan Roth; | arxiv-cs.CV | 2024-04-25 |
546 | Semantic Segmentation of Remote Sensing Images Based on Dual-channel Attention Mechanism Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Due to the inadequate utilization of data correlation and complementarity in the feature extraction process of multimodal remote sensing images, the paper proposes a deep learning … |
Jionghui Jiang; Xi’an Feng; Hui Huang; | IET Image Process. | 2024-04-25 |
547 | Survey on Segmentation of Brain Abnormalities in MRI Scan Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: -Medical image segmentation plays an important role in disease monitoring, such as tumor growth, dosage control of medication, and radiation exposure in the human body. Image … |
Idrees Ibraheem Ahmed; Omar M. Hussien Al Okashi; | 2024 21st International Multi-Conference on Systems, … | 2024-04-22 |
548 | OccFeat: Self-supervised Occupancy Feature Prediction for Pretraining BEV Segmentation Networks Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce a self-supervised pretraining method, called OccFeat, for camera-only Bird’s-Eye-View (BEV) segmentation networks. |
SOPHIA SIRKO-GALOUCHENKO et. al. | arxiv-cs.CV | 2024-04-22 |
549 | Clio: Real-time Task-Driven Open-Set 3D Scene Graphs IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: While related work implicitly chooses a level of granularity by tuning thresholds for object detection, we argue that such a choice is intrinsically task-dependent. The first contribution of this paper is to propose a task-driven 3D scene understanding problem, where the robot is given a list of tasks in natural language and has to select the granularity and the subset of objects and scene structure to retain in its map that is sufficient to complete the tasks. |
DOMINIC MAGGIO et. al. | arxiv-cs.RO | 2024-04-21 |
550 | Show and Grasp: Few-shot Semantic Segmentation for Robot Grasping Through Zero-shot Foundation Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, this often comes at the cost of limited performance and fine-tuning is required to be effective in robot grasping scenarios. In this work, we propose to overcome all these limitations by combining the impressive generalization capability reached by foundation models with a high-performing few-shot classifier, working as a score function to select the segmentation that is closer to the support set. |
Leonardo Barcellona; Alberto Bacchin; Matteo Terreran; Emanuele Menegatti; Stefano Ghidoni; | arxiv-cs.RO | 2024-04-19 |
551 | Weakly Supervised LiDAR Semantic Segmentation Via Scatter Image Annotation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Specifically, we propose employing scatter images to annotate LiDAR point clouds, combining a pre-trained optical flow estimation network with a foundation image segmentation model to rapidly propagate manual annotations into dense labels for both images and point clouds. |
YILONG CHEN et. al. | arxiv-cs.CV | 2024-04-19 |
552 | BACS: Background Aware Continual Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper proposes a Backward Background Shift Detector (BACS) to detect previously observed classes based on their distance in the latent space from the foreground centroids of previous steps. |
Mostafa ElAraby; Ali Harakeh; Liam Paull; | arxiv-cs.CV | 2024-04-19 |
553 | Contrastive Gaussian Clustering: Weakly Supervised 3D Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce Contrastive Gaussian Clustering, a novel approach capable of provide segmentation masks from any viewpoint and of enabling 3D segmentation of the scene. |
Myrna C. Silva; Mahtab Dahaghin; Matteo Toso; Alessio Del Bue; | arxiv-cs.CV | 2024-04-19 |
554 | Group-On: Boosting One-Shot Segmentation with Supportive Query Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel and effective approach for ONE-shot semantic segmentation, called Group-On, which packs multiple query images in batches for the benefit of mutual knowledge support within the same category. |
Hanjing Zhou; Mingze Yin; Danny Chen; Jian Wu; JinTai Chen; | arxiv-cs.CV | 2024-04-17 |
555 | Neuromorphic Vision-based Motion Segmentation with Graph Transformer Neural Network Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a novel event-based motion segmentation algorithm using a Graph Transformer Neural Network, dubbed GTNN. |
Yusra Alkendi; Rana Azzam; Sajid Javed; Lakmal Seneviratne; Yahya Zweiri; | arxiv-cs.CV | 2024-04-16 |
556 | Vocabulary-free Image Classification and Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This assumption is impractical in scenarios with unknown or evolving semantic context. Here, we address this issue and introduce the Vocabulary-free Image Classification (VIC) task, which aims to assign a class from an unconstrained language-induced semantic space to an input image without needing a known vocabulary. |
ALESSANDRO CONTI et. al. | arxiv-cs.CV | 2024-04-16 |
557 | ECLAIR: A High-Fidelity Aerial LiDAR Dataset for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We introduce ECLAIR (Extended Classification of Lidar for AI Recognition), a new outdoor large-scale aerial LiDAR dataset designed specifically for advancing research in point cloud semantic segmentation. |
Iaroslav Melekhov; Anand Umashankar; Hyeong-Jin Kim; Vladislav Serkov; Dusty Argyle; | arxiv-cs.CV | 2024-04-16 |
558 | Conformal Semantic Image Segmentation: Post-hoc Quantification of Predictive Uncertainty Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: We propose a post-hoc, computationally lightweight method to quantify predictive uncertainty in semantic image segmentation. Our approach uses conformal prediction to generate … |
Luca Mossina; Joseba Dalmau; L’eo And’eol; | 2024 IEEE/CVF Conference on Computer Vision and Pattern … | 2024-04-16 |
559 | Cross-Image Distillation for Semi-Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Moreover, scarce annotated data usually exhibits a biased distribution against the desired one, hindering performance improvement. To address these challenging problems, we propose a novel cross-image distillation framework for semi-supervised semantic segmentation. |
N. ZHANG et. al. | icassp | 2024-04-15 |
560 | YOLO-Med : Multi-Task Interaction Network for Biomedical Images Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we propose an efficient end-to-end multi-task network capable of concurrently performing object detection and semantic segmentation called YOLO-Med. |
S. Huang; | icassp | 2024-04-15 |
561 | RD-NERF: Neural Robust Distilled Feature Fields for Sparse-View Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose Neural Robust Distilled Feature Fields (RD-NeRF) for achieving robust 3D semantic feature distillation and 3D consistent scene segmentation with sparse-view labels. |
Y. Ma; B. Dou; T. Zhang; Z. Yuan; | icassp | 2024-04-15 |
562 | Domain-Adaptive Semantic Segmentation Emerges From Vision-Language Supervised Domain-Debiased Self-Training Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Even worse, some classes exhibit the extreme domain gap, where the feature distributions undergo a complete shift between the two domains. To alleviate it, we propose a domain-debiased self-training strategy with CLIP to distill its domain-agnostic knowledge. |
H. WANG et. al. | icassp | 2024-04-15 |
563 | Language-Driven Open-Vocabulary 3D Semantic Segmentation with Knowledge Distillation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: 3D open-vocabulary semantic segmentation is a challenge in the task of 3D scene understanding, as most current models trained on closed-set datasets struggle to effectively identify categories that were not seen during training. To address this, we introduce a framework called LSWKD. |
Y. Wu; X. -F. Han; G. Xiao; | icassp | 2024-04-15 |
564 | Semantic Segmentation for Multi-Scene Remote Sensing Images with Noisy Labels Based on Uncertainty Perception Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, a semantic segmentation method for remote sensing images based on uncertainty perception with noisy labels is proposed. |
X. Lyu; L. Zhang; | icassp | 2024-04-15 |
565 | CALSeg: Improving Calibration of Medical Image Segmentation Via Variational Label Smoothing Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, medical image segmentation typically relies on hard labels (one-hot vectors), and when minimizing the cross-entropy loss, the model’s softmax predictions are compelled to align with hard labels, resulting in over-confident predictions. To alleviate above problems, this study proposes a novel framework on calibration of medical image segmentation, called CALSeg. |
X. Guo; Y. Yang; C. Ye; G. Cai; T. Ma; | icassp | 2024-04-15 |
566 | The Revenge of BiSeNet: Efficient Multi-Task Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, existing research has primarily concentrated on single-task settings, especially on semantic segmentation, leading to redundant efforts and specialized architectures for different tasks. To address this limitation, we propose a novel architecture for efficient multi-task image segmentation, capable of handling various segmentation tasks without sacrificing efficiency or accuracy. |
Gabriele Rosi; Claudia Cuttano; Niccolò Cavagnero; Giuseppe Averta; Fabio Cermelli; | arxiv-cs.CV | 2024-04-15 |
567 | Language-Guided Few-Shot Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose an innovative solution to tackle the challenge of few-shot semantic segmentation using only language information, i.e.image-level text labels. |
J. Wang; Y. Liu; Q. Zhou; F. Wang; | icassp | 2024-04-15 |
568 | SGT: Self-Guided Transformer for Few-Shot Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, they often overlook the fact that there is variability in different regions of the same object, and intra-image similarity is higher than inter-image similarity. To address these limitations, a Self-Guided Transformer (SGT) is proposed by leveraging intra-image similarity to improve intra-object inconsistencies in this paper. |
K. Ai; H. Hu; Q. Zhou; Q. Guan; | icassp | 2024-04-15 |
569 | Gaga: Group Any Gaussians Via 3D-aware Memory Bank Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce Gaga, a framework that reconstructs and segments open-world 3D scenes by leveraging inconsistent 2D masks predicted by zero-shot class-agnostic segmentation models. |
Weijie Lyu; Xueting Li; Abhijit Kundu; Yi-Hsuan Tsai; Ming-Hsuan Yang; | arxiv-cs.CV | 2024-04-11 |
570 | Exploiting Object-based and Segmentation-based Semantic Features for Deep Learning-based Indoor Scene Classification Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To overcome such issues, gathering semantic information has been shown to be a promising source of information towards a more complete and discriminative feature representation of indoor scenes. Therefore, the work described in this paper uses both semantic information, obtained from object detection, and semantic segmentation techniques. |
Ricardo Pereira; Luís Garrote; Tiago Barros; Ana Lopes; Urbano J. Nunes; | arxiv-cs.CV | 2024-04-11 |
571 | O2V-Mapping: Online Open-Vocabulary Mapping with Neural Implicit Representation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: For the purpose of preserving consistency in 3D object properties across different viewpoints, we propose a spatial adaptive voxel adjustment mechanism and a multi-view weight selection method. |
MUER TIE et. al. | arxiv-cs.CV | 2024-04-10 |
572 | QueSTMaps: Queryable Semantic Topological Maps for 3D Scene Understanding Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, such methods struggle to segment out topological regions like kitchen in the scene. In this work, we introduce a two-step pipeline to solve this problem. |
YASH MEHAN et. al. | arxiv-cs.CV | 2024-04-09 |
573 | DaF-BEVSeg: Distortion-aware Fisheye Camera Based Bird’s Eye View Segmentation with Occlusion Reasoning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We extend the model with an occlusion reasoning module, which is critical for estimating in BEV space. |
Senthil Yogamani; David Unger; Venkatraman Narayanan; Varun Ravi Kumar; | arxiv-cs.CV | 2024-04-09 |
574 | Evaluating The Efficacy of Cut-and-Paste Data Augmentation in Semantic Segmentation for Satellite Imagery Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In order to mitigate those issues, our study explores the effectiveness of a Cut-and-Paste augmentation technique for semantic segmentation in satellite images. We adapt this augmentation, which usually requires labeled instances, to the case of semantic segmentation. |
Ionut M. Motoi; Leonardo Saraceni; Daniele Nardi; Thomas A. Ciarfuglia; | arxiv-cs.CV | 2024-04-08 |
575 | D2SL: Decouple Defogging and Semantic Learning for Foggy Domain-Adaptive Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Despite making some progress, there are still two main drawbacks: (1) the coupling of segmentation and defogging feature representations, resulting in a decrease in semantic representation capability, and (2) the failure to leverage real fog priors in unlabeled foggy data, leading to insufficient model generalization ability. To address these issues, we propose a novel training framework, Decouple Defogging and Semantic learning, called D2SL, aiming to alleviate the adverse impact of defogging tasks on the final segmentation task. |
Xuan Sun; Zhanfu An; Yuyu Liu; | arxiv-cs.CV | 2024-04-07 |
576 | HawkDrive: A Transformer-driven Visual Perception System for Autonomous Driving in Night Scene Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Many established vision perception systems for autonomous driving scenarios ignore the influence of light conditions, one of the key elements for driving safety. To address this problem, we present HawkDrive, a novel perception system with hardware and software solutions. |
Ziang Guo; Stepan Perminov; Mikhail Konenkov; Dzmitry Tsetserukou; | arxiv-cs.CV | 2024-04-06 |
577 | Panoptic Perception: A Novel Task and Fine-grained Dataset for Universal Remote Sensing Image Interpretation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose Panoptic Perception, a novel task and a new fine-grained dataset (FineGrip) to achieve a more thorough and universal interpretation for RSIs. |
DANPEI ZHAO et. al. | arxiv-cs.CV | 2024-04-06 |
578 | Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we introduce Sigma, a Siamese Mamba network for multi-modal semantic segmentation, utilizing the Selective Structured State Space Model, Mamba. |
ZIFU WAN et. al. | arxiv-cs.CV | 2024-04-05 |
579 | Background Noise Reduction of Attention Map for Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The proposed method successfully reduces background noise, leading to improved accuracy of pseudo labels. |
Izumi Fujimori; Masaki Oono; Masami Shishibori; | arxiv-cs.CV | 2024-04-04 |
580 | OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Indeed, point cloud and 3D meshes typically have a lower resolution than images and the reconstructed 3D scene geometry might not project well to the underlying 2D image sequences used to compute pixel-aligned CLIP features. To address these challenges, we propose OpenNeRF which naturally operates on posed images and directly encodes the VLM features within the NeRF. |
FRANCIS ENGELMANN et. al. | arxiv-cs.CV | 2024-04-04 |
581 | Flattening The Parent Bias: Hierarchical Semantic Segmentation in The Poincaré Ball Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We find that on the new testing domains, a flat (non-hierarchical) segmentation network, in which the parents are inferred from the children, has superior segmentation accuracy to the hierarchical approach across the board. Complementing these findings and inspired by the intrinsic properties of hyperbolic spaces, we study a more principled approach to hierarchical segmentation using the Poincar\’e ball model. |
Simon Weber; Barış Zöngür; Nikita Araslanov; Daniel Cremers; | arxiv-cs.CV | 2024-04-04 |
582 | A Weakly Supervised End-to-end Framework for Semantic Segmentation of Cancerous Area in Whole Slide Image Related Papers Related Patents Related Grants Related Venues Related Experts View |
Yanbo Feng; Adel Hafiane; Hélène Laurent; | Pattern Anal. Appl. | 2024-04-02 |
583 | Segmentation of Road Negative Obstacles Based on Dual Semantic-Feature Complementary Fusion for Autonomous Driving Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Segmentation of road negative obstacles (i.e., potholes and cracks) is important to the safety of autonomous driving. Although existing RGB-D fusion networks could achieve … |
Zhen Feng; Yanning Guo; Yuxiang Sun; | IEEE Transactions on Intelligent Vehicles | 2024-04-01 |
584 | Smooth Fusion of Multi-spectral Images Via Total Variation Minimization for Traffic Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
YING LI et. al. | Eng. Appl. Artif. Intell. | 2024-04-01 |
585 | Training-Free Semantic Segmentation Via LLM-Supervision Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces a new approach to text-supervised semantic segmentation using supervision by a large language model (LLM) that does not require extra training. |
Wenfang Sun; Yingjun Du; Gaowen Liu; Ramana Kompella; Cees G. M. Snoek; | arxiv-cs.CV | 2024-03-31 |
586 | MedCLIP-SAM: Bridging Text and Image Towards Universal Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a novel framework, called MedCLIP-SAM that combines CLIP and SAM models to generate segmentation of clinical scans using text prompts in both zero-shot and weakly supervised settings. |
Taha Koleilat; Hojat Asgariandehkordi; Hassan Rivaz; Yiming Xiao; | arxiv-cs.CV | 2024-03-29 |
587 | Segmentation Re-thinking Uncertainty Estimation Metrics for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, our investigation identifies three core deficiencies within the PAvPU framework and proposes robust solutions aimed at refining the metric. By addressing these issues, we aim to enhance the reliability and applicability of uncertainty quantification, especially in scenarios that demand high levels of safety and accuracy, thus contributing to the advancement of semantic segmentation methodologies in critical applications. |
Qitian Ma; Shyam Nanda Rai; Carlo Masone; Tatiana Tommasi; | arxiv-cs.AI | 2024-03-28 |
588 | I2CKD : Intra- and Inter-Class Knowledge Distillation for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a new knowledge distillation method tailored for image semantic segmentation, termed Intra- and Inter-Class Knowledge Distillation (I2CKD). |
Ayoub Karine; Thibault Napoléon; Maher Jridi; | arxiv-cs.CV | 2024-03-27 |
589 | Segment Anything Model (SAM) Meets Object Detected Box Prompts Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Segmenting images is an intricate and exceptionally demanding field within computer vision. Instance Segmentation is one of the subfields of image segmentation that segments … |
Erdal Akin; Héctor Caltenco; K. Adewole; Reza Malekian; Jan A. Persson; | 2024 IEEE International Conference on Industrial Technology … | 2024-03-25 |
590 | SatSynth: Augmenting Image-Mask Pairs Through Diffusion Models for Aerial Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we explore the potential of generative image diffusion to address the scarcity of annotated data in earth observation tasks. |
Aysim Toker; Marvin Eisenberger; Daniel Cremers; Laura Leal-Taixé; | arxiv-cs.CV | 2024-03-25 |
591 | Learning Generalized Segmentation for Foggy-Scenes By Bi-directional Wavelet Guidance IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: Learning scene semantics that can be well generalized to foggy conditions is important for safety-crucial applications such as autonomous driving. Existing methods need both … |
Qi Bi; Shaodi You; Theo Gevers; | AAAI Conference on Artificial Intelligence | 2024-03-24 |
592 | SM2C: Boost The Semi-supervised Segmentation for Medical Image By Using Meta Pseudo Labels and Mixed Images Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end, we introduce a novel method called Scaling-up Mix with Multi-Class (SM2C). |
Yifei Wang; Chuhong Zhu; | arxiv-cs.CV | 2024-03-24 |
593 | Improved MLP Point Cloud Processing with High-Dimensional Positional Encoding Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Multi-Layer Perceptron (MLP) models are the bedrock of contemporary point cloud processing. However, their complex network architectures obscure the source of their strength. We … |
Yanmei Zou; Hongshan Yu; Zhengeng Yang; Zechuan Li; Naveed Akhtar; | AAAI Conference on Artificial Intelligence | 2024-03-24 |
594 | WeatherProof: Leveraging Language Guidance for Semantic Segmentation in Adverse Weather Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a method to infer semantic segmentation maps from images captured under adverse weather conditions. |
BLAKE GELLA et. al. | arxiv-cs.CV | 2024-03-21 |
595 | MTP: Advancing Remote Sensing Foundation Model Via Multitask Pretraining IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Foundation models have reshaped the landscape of remote sensing (RS) by enhancing various image interpretation tasks. Pretraining is an active research topic, encompassing … |
DI WANG et. al. | IEEE Journal of Selected Topics in Applied Earth … | 2024-03-20 |
596 | MTP: Advancing Remote Sensing Foundation Model Via Multi-Task Pretraining Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, transferring the pretrained models to downstream tasks may encounter task discrepancy due to their formulation of pretraining as image classification or object discrimination tasks. In this study, we explore the Multi-Task Pretraining (MTP) paradigm for RS foundation models to address this issue. |
DI WANG et. al. | arxiv-cs.CV | 2024-03-20 |
597 | CUS3D: A New Comprehensive Urban-Scale Semantic-Segmentation 3D Benchmark Dataset Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With the continuous advancement of the construction of smart cities, the availability of large-scale and semantically enriched datasets is essential for enhancing the machine’s … |
LIN GAO et. al. | Remote. Sens. | 2024-03-19 |
598 | Reflectivity Is All You Need!: Advancing LiDAR Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Building upon our previous work, this paper explores the advantages of employing calibrated intensity (also referred to as reflectivity) within learning-based LiDAR semantic segmentation frameworks. |
Kasi Viswanath; Peng Jiang; Srikanth Saripalli; | arxiv-cs.CV | 2024-03-19 |
599 | TTT-KD: Test-Time Training for 3D Semantic Segmentation Through Knowledge Distillation from Foundation Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose the first TTT method for 3D semantic segmentation, TTT-KD, which models Knowledge Distillation (KD) from foundation models (e.g. DINOv2) as a self-supervised objective for adaptation to distribution shifts at test-time. |
Lisa Weijler; Muhammad Jehanzeb Mirza; Leon Sick; Can Ekkazan; Pedro Hermosilla; | arxiv-cs.CV | 2024-03-18 |
600 | BEVCar: Camera-Radar Fusion for BEV Map and Object Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: Semantic scene segmentation from a bird’s-eye-view (BEV) perspective plays a crucial role in facilitating planning and decision-making for mobile robots. Although recent … |
JONAS SCHRAMM et. al. | ArXiv | 2024-03-18 |
601 | Segment Any Object Model (SAOM): Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To allow our Segment Any Object Model (SAOM) to work in the everything mode, we propose the novel nearest neighbour assignment method, updating point embeddings for each ground-truth mask. |
MARIIA KHAN et. al. | arxiv-cs.CV | 2024-03-15 |
602 | TransLandSeg: A Transfer Learning Approach for Landslide Semantic Segmentation Based on Vision Foundation Model Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose TransLandSeg, which is a transfer learning approach for landslide semantic segmentation based on a vision foundation model (VFM). |
CHANGHONG HOU et. al. | arxiv-cs.CV | 2024-03-15 |
603 | Annotation Free Semantic Segmentation with Vision Foundation Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we generate free annotations for any semantic segmentation dataset using existing foundation models. |
Soroush Seifi; Daniel Olmeda Reino; Fabien Despinoy; Rahaf Aljundi; | arxiv-cs.CV | 2024-03-14 |
604 | ASPP+-LANet: A Multi-Scale Context Extraction Network for Semantic Segmentation of High-Resolution Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation of remote sensing (RS) images is a pivotal branch in the realm of RS image processing, which plays a significant role in urban planning, building extraction, … |
Lei Hu; Xun Zhou; Jiachen Ruan; Supeng Li; | Remote. Sens. | 2024-03-14 |
605 | When Semantic Segmentation Meets Frequency Aliasing Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Existing research only separates an image into easy and hard regions and empirically observes the latter are associated with object boundaries. In this paper, we conduct a comprehensive analysis of hard pixel errors, categorizing them into three types: false responses, merging mistakes, and displacements. |
Linwei Chen; Lin Gu; Ying Fu; | arxiv-cs.CV | 2024-03-13 |
606 | Average Calibration Error: A Differentiable Loss for Improved Reliability in Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose to use marginal L1 average calibration error (mL1-ACE) as a novel auxiliary loss function to improve pixel-wise calibration without compromising segmentation quality. |
Theodore Barfoot; Luis Garcia-Peraza-Herrera; Ben Glocker; Tom Vercauteren; | arxiv-cs.CV | 2024-03-11 |
607 | Refining Segmentation On-the-Fly: An Interactive Framework for Point Cloud Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Concretely, we presents the first interactive framework for point cloud semantic segmentation, named InterPCSeg, which seamlessly integrates with off-the-shelf semantic segmentation networks without offline re-training, enabling it to run in an on-the-fly manner. |
Peng Zhang; Ting Wu; Jinsheng Sun; Weiqing Li; Zhiyong Su; | arxiv-cs.CV | 2024-03-10 |
608 | Multi-Grained Cross-modal Alignment for Learning Open-vocabulary Semantic Segmentation from Text Supervision Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we introduce a Multi-Grained Cross-modal Alignment (MGCA) framework, which explicitly learns pixel-level alignment along with object- and region-level alignment to bridge the granularity gap without any dense annotations. |
Yajie Liu; Pu Ge; Qingjie Liu; Di Huang; | arxiv-cs.CV | 2024-03-06 |
609 | Semi-Supervised Semantic Segmentation Based on Pseudo-Labels: A Survey Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This review aims to provide a first comprehensive and organized overview of the state-of-the-art research results on pseudo-label methods in the field of semi-supervised semantic segmentation, which we categorize from different perspectives and present specific methods for specific application areas. |
Lingyan Ran; Yali Li; Guoqiang Liang; Yanning Zhang; | arxiv-cs.CV | 2024-03-04 |
610 | RISeg: Robot Interactive Object Segmentation Via Body Frame-Invariant Features Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In order to successfully perform manipulation tasks in new environments, such as grasping, robots must be proficient in segmenting unseen objects from the background and/or other … |
HOWARD H. QIAN et. al. | 2024 IEEE International Conference on Robotics and … | 2024-03-04 |
611 | Auxiliary Tasks Enhanced Dual-affinity Learning for Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Prior works have commonly used an off-line heuristic thresholding process that combines the CAM maps with off-the-shelf saliency maps produced by a general pre-trained saliency model to produce more accurate pseudo-segmentation labels. We propose AuxSegNet+, a weakly supervised auxiliary learning framework to explore the rich information from these saliency maps and the significant inter-task correlation between saliency detection and semantic segmentation. |
LIAN XU et. al. | arxiv-cs.CV | 2024-03-02 |
612 | Building Energy Efficient Semantic Segmentation in Intelligent Edge Computing Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation is a critical area in computer vision, which needs voluminous image data streaming from user devices. Usually, it is challenging to process semantic … |
Xingyu Yuan; He Li; K. Ota; M. Dong; | IEEE Transactions on Green Communications and Networking | 2024-03-01 |
613 | FGMNet: Feature Grouping Mechanism Network for RGB-D Indoor Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Yuming Zhang; Wujie Zhou; L. Ye; Lu Yu; Ting Luo; | Digit. Signal Process. | 2024-03-01 |
614 | Contrastive Learning-based Knowledge Distillation for RGB-thermal Urban Scene Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Xiaodong Guo; Wujie Zhou; Tong Liu; | Knowl. Based Syst. | 2024-03-01 |
615 | FusionVision: A Comprehensive Approach of 3D Object Reconstruction and Segmentation from RGB-D Cameras Using YOLO and Fast Segment Anything Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In the realm of computer vision, the integration of advanced techniques into the processing of RGB-D camera inputs poses a significant challenge, given the inherent complexities arising from diverse environmental conditions and varying object appearances. Therefore, this paper introduces FusionVision, an exhaustive pipeline adapted for the robust 3D segmentation of objects in RGB-D imagery. |
Safouane El Ghazouali; Youssef Mhirit; Ali Oukhrid; Umberto Michelucci; Hichem Nouira; | arxiv-cs.CV | 2024-02-29 |
616 | PEM: Prototype-based Efficient MaskFormer for Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To achieve such impressive performance, these architectures employ intensive operations and require substantial computational resources, which are often not available, especially on edge devices. To fill this gap, we propose Prototype-based Efficient MaskFormer (PEM), an efficient transformer-based architecture that can operate in multiple segmentation tasks. |
NICCOLÒ CAVAGNERO et. al. | arxiv-cs.CV | 2024-02-29 |
617 | YOLO-MED : Multi-Task Interaction Network for Biomedical Images Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we propose an efficient end-to-end multi-task network capable of concurrently performing object detection and semantic segmentation called YOLO-Med. |
SUIZHI HUANG et. al. | arxiv-cs.CV | 2024-02-29 |
618 | An Automated Learning Method of Semantic Segmentation for Train Autonomous Driving Environment Understanding Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This article proposes an automated machine learning method for semantic segmentation that can be used for automated training of models in fields such as autonomous driving. This … |
Yang Wang; Jin Zhang; Yihao Chen; Hao Yuan; Cheng Wu; | IEEE Transactions on Industrial Informatics | 2024-02-29 |
619 | Spannotation: Enhancing Semantic Segmentation for Autonomous Navigation with Efficient Image Annotation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Unlike other popular annotation tools that requires about 40 seconds to annotate an image for semantic segmentation in a typical navigation task, Spannotation achieves similar result in about 6.03 seconds. The tools utility was validated through the utilization of its generated masks to train a U-Net model which achieved a validation accuracy of 98.27% and mean Intersection Over Union (mIOU) of 96.66%. |
Samuel O. Folorunsho; William R. Norris; | arxiv-cs.CV | 2024-02-28 |
620 | DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present DFormer, a novel RGB-D pretraining framework to learn transferable representations for RGB-D segmentation tasks. |
BOWEN YIN et. al. | iclr | 2024-02-26 |
621 | BLO-SAM: Bi-level Optimization Based Overfitting-Preventing Finetuning of SAM Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Current solutions to these problems, which involve finetuning SAM, often lead to overfitting, a notable issue in scenarios with very limited data, like in medical imaging. To overcome these limitations, we introduce BLO-SAM, which finetunes SAM based on bi-level optimization (BLO). |
Li Zhang; Youwei Liang; Ruiyi Zhang; Amirhosein Javadi; Pengtao Xie; | arxiv-cs.CV | 2024-02-26 |
622 | Rainy Day Image Semantic Segmentation Based on Two-stage Progressive Network Related Papers Related Patents Related Grants Related Venues Related Experts View |
Heng Zhang; Dongli Jia; Hui Ma; | Vis. Comput. | 2024-02-26 |
623 | P2Seg: Pointly-supervised Segmentation Via Mutual Distillation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we design a Mutual Distillation Module (MDM) to leverage the complementary strengths of both instance position and semantic information and achieve accurate instance-level object perception. |
ZIPENG WANG et. al. | iclr | 2024-02-26 |
624 | EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, generating fine-grained segmentation masks with diffusion models often requires additional training on annotated datasets, leaving it unclear to what extent pre-trained diffusion models alone understand the semantic relations of their generated images. To address this question, we leverage the semantic knowledge extracted from Stable Diffusion (SD) and aim to develop an image segmentor capable of generating fine-grained segmentation maps without any additional training. |
Koichi Namekata; Amirmojtaba Sabour; Sanja Fidler; Seung Wook Kim; | iclr | 2024-02-26 |
625 | ConSept: Continual Semantic Segmentation Via Adapter-based Vision Transformer Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we delve into the realm of vision transformers for continual semantic segmentation, a problem that has not been sufficiently explored in previous literature. … |
Bowen Dong; Guanglei Yang; W. Zuo; Lei Zhang; | ArXiv | 2024-02-26 |
626 | Placing Objects in Context Via Inpainting for Out-of-distribution Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose the Placing Objects in Context (POC) pipeline to realistically add any object into any image via diffusion models. |
Pau de Jorge; Riccardo Volpi; Puneet K. Dokania; Philip H. S. Torr; Gregory Rogez; | arxiv-cs.CV | 2024-02-26 |
627 | Convolution Meets LoRA: Parameter Efficient Finetuning for Segment Anything Model IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: While it exhibits remarkable zero-shot generalization in typical scenarios, its advantage diminishes when applied to specialized domains like medical imagery and remote sensing. To address this limitation, this paper introduces Conv-LoRA, a simple yet effective parameter-efficient fine-tuning approach. |
Zihan Zhong; Zhiqiang Tang; Tong He; Haoyang Fang; Chun Yuan; | iclr | 2024-02-26 |
628 | OpenNeRF: Open Set 3D Neural Scene Segmentation with Pixel-Wise Features and Rendered Novel Views IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Indeed, point cloud and 3D meshes typically have a lower resolution than images and the reconstructed 3D scene geometry might not project well to the underlying 2D image sequences used to compute pixel-aligned CLIP features. To address these challenges, we propose OpenNeRF which naturally operates on posed images and directly encodes the VLM features within the NeRF. |
Francis Engelmann; Fabian Manhardt; Michael Niemeyer; Keisuke Tateno; Federico Tombari; | iclr | 2024-02-26 |
629 | Task Specific Pretraining with Noisy Labels for Remote Sensing Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose to exploit noisy semantic segmentation maps for model pretraining. |
Chenying Liu; Conrad M Albrecht; Yi Wang; Xiao Xiang Zhu; | arxiv-cs.CV | 2024-02-25 |
630 | Cross-CBAM: A Lightweight Network for Real-time Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Zhengbin Zhang; Zhenhao Xu; Xingsheng Gu; Juan Xiong; | J. Real Time Image Process. | 2024-02-24 |
631 | A New CNN-based Semantic Object Segmentation for Autonomous Vehicles in Urban Traffic Scenes Related Papers Related Patents Related Grants Related Venues Related Experts View |
Gürkan Doğan; B. Ergen; | Int. J. Multim. Inf. Retr. | 2024-02-23 |
632 | QIS : Interactive Segmentation Via Quasi-Conformal Mappings Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose the quasi-conformal interactive segmentation (QIS) model, which incorporates user input in the form of positive and negative clicks. |
Han Zhang; Daoping Zhang; Lok Ming Lui; | arxiv-cs.CV | 2024-02-22 |
633 | DeiSAM: Segment Anything with Deictic Prompting Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, deep learning approaches cannot reliably interpret such deictic representations due to their lack of reasoning capabilities in complex scenarios. To remedy this issue, we propose DeiSAM — a combination of large pre-trained neural networks with differentiable logic reasoners — for deictic promptable segmentation. |
HIKARU SHINDO et. al. | arxiv-cs.LG | 2024-02-21 |
634 | Multi-Modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To this end, this paper proposes a simple yet effective scene-level weakly supervised point cloud segmentation method with a newly introduced multi-modality point affinity inference module. |
XIAWEI LI et. al. | aaai | 2024-02-20 |
635 | W2P: Switching from Weak Supervision to Partial Supervision for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper underscores the significant influence of noisy pseudo-labels on segmentation network performance, particularly in boundary region. To address above issues, we introduce a novel paradigm: Weak to Partial Supervision (W2P). |
Fangyuan Zhang; Tianxiang Pan; Jun-Hai Yong; Bin Wang; | aaai | 2024-02-20 |
636 | A Novel Radial Kernel Watershed Basis Segmentation Algorithm for Color Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
C. Kumari; A. Mustafi; | Wireless Personal Communications | 2024-02-20 |
637 | CGMGM: A Cross-Gaussian Mixture Generative Model for Few-Shot Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Consequently, they result in incomplete segmentation of foreground objects and mis-segmentation of the complex background. To overcome this issue, we propose the Cross Gaussian Mixture Generative Model (CGMGM), a novel Gaussian Mixture Models~(GMMs)-based FSS method, which establishes the joint distribution of pixel and category in both the support and query images. |
JUNAO SHEN et. al. | aaai | 2024-02-20 |
638 | X4D-SceneFormer: Enhanced Scene Understanding on 4D Point Cloud Videos Through Cross-Modal Knowledge Transfer Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Moreover, the irregularity of point cloud poses a difficulty in aligning temporal information within video sequences. To address these issues, we propose a novel cross-modal knowledge transfer framework, called X4D-SceneFormer. |
LINGLIN JING et. al. | aaai | 2024-02-20 |
639 | Less Is More: Label Recommendation for Weakly Supervised Point Cloud Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, unconstrained or heuristic weakly supervised annotation forms may lead to suboptimal label efficiency. To address this issue, we propose a novel label recommendation framework for weakly supervised point cloud semantic segmentation. |
Zhiyi Pan; Nan Zhang; Wei Gao; Shan Liu; Ge Li; | aaai | 2024-02-20 |
640 | Progressive Feature Self-Reinforcement for Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: A typical manifestation is the diminished precision on object boundaries, leading to deteriorated accuracy of WSSS. To alleviate this issue, we propose to adaptively partition the image content into certain regions (e.g., confident foreground and background) and uncertain regions (e.g., object boundaries and misclassified categories) for separate processing. |
JINGXUAN HE et. al. | aaai | 2024-02-20 |
641 | Semantic-Aware Autoregressive Image Modeling for Visual Representation Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This is mainly caused by the challenge that images are not sequential signals and lack a natural order when applying autoregressive modeling. In this study, inspired by human beings’ way of grasping an image, i.e., focusing on the main object first, we present a semantic-aware autoregressive image modeling (SemAIM) method to tackle this challenge. |
Kaiyou Song; Shan Zhang; Tong Wang; | aaai | 2024-02-20 |
642 | Learning Content-Enhanced Mask Transformer for Domain Generalized Urban-Scene Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a Content-enhanced Mask TransFormer (CMFormer) for domain-generalized USSS. |
Qi Bi; Shaodi You; Theo Gevers; | aaai | 2024-02-20 |
643 | Variance-Insensitive and Target-Preserving Mask Refinement for Interactive Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce a novel method, Variance-Insensitive and Target-Preserving Mask Refinement to enhance segmentation quality with fewer user inputs. |
CHAOWEI FANG et. al. | aaai | 2024-02-20 |
644 | Weakly Supervised Semantic Segmentation for Driving Scenes Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose solutions for each issue as follows. |
Dongseob Kim; Seungho Lee; Junsuk Choe; Hyunjung Shim; | aaai | 2024-02-20 |
645 | Real-time 3D Semantic Scene Perception for Egocentric Robots with Binocular Vision Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we present an end-to-end pipeline with instance segmentation, feature matching, and point-set registration for egocentric robots with binocular vision, and demonstrate the robot’s grasping capability through the proposed pipeline. |
K. Nguyen; T. Dang; M. Huber; | arxiv-cs.RO | 2024-02-19 |
646 | ISCUTE: Instance Segmentation of Cables Using Text Embedding Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a foundation model-based DLO instance segmentation technique that is text-promptable and user-friendly. |
Shir Kozlovsky; Omkar Joglekar; Dotan Di Castro; | arxiv-cs.CV | 2024-02-19 |
647 | Semantically-aware Neural Radiance Fields for Visual Scene Understanding: A Comprehensive Review Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This review thoroughly examines the role of semantically-aware Neural Radiance Fields (NeRFs) in visual scene understanding, covering an analysis of over 250 scholarly papers. |
Thang-Anh-Quan Nguyen; Amine Bourki; Mátyás Macudzinski; Anthony Brunel; Mohammed Bennamoun; | arxiv-cs.CV | 2024-02-16 |
648 | BEFUnet: A Hybrid CNN-Transformer Architecture for Precise Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: While transformers have achieved state-of-the-art results in natural language processing and image recognition, they face challenges in medical image segmentation due to image locality and translational invariance issues. To address these challenges, this paper proposes an innovative U-shaped network called BEFUnet, which enhances the fusion of body and edge information for precise medical image segmentation. |
Omid Nejati Manzari; Javad Mirzapour Kaleybar; Hooman Saadat; Shahin Maleki; | arxiv-cs.CV | 2024-02-13 |
649 | Moving Object Proposals with Deep Learned Optical Flow for Video Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a state of art architecture of neural networks to accurately and efficiently get the moving object proposals (MOP). |
Ge Shi; Zhili Yang; | arxiv-cs.CV | 2024-02-13 |
650 | Hybridnet for Depth Estimation and Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, depth estimation and semantic segmentation are addressed together from a single input image through a hybrid convolutional network. |
Dalila Sánchez-Escobedo; Xiao Lin; Josep R. Casas; Montse Pardàs; | arxiv-cs.CV | 2024-02-09 |
651 | Early Fusion of Features for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces a novel segmentation framework that integrates a classifier network with a reverse HRNet architecture for efficient image segmentation. |
Anupam Gupta; Ashok Krishnamurthy; Lisa Singh; | arxiv-cs.CV | 2024-02-08 |
652 | Quasi-Dense Matching for Oblique Stereo Images Through Semantic Segmentation and Local Feature Enhancement Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper proposes a quasi-dense feature matching algorithm that combines image semantic segmentation and local feature enhancement networks to address the problem of the poor … |
GUOBIAO YAO et. al. | Remote. Sens. | 2024-02-08 |
653 | Multi-Scale Semantic Segmentation with Modified MBConv Blocks Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper introduces a novel adaptation of MBConv blocks specifically tailored for semantic segmentation. |
Xi Chen; Yang Cai; Yuan Wu; Bo Xiong; Taesung Park; | arxiv-cs.CV | 2024-02-07 |
654 | On The Effect of Image Resolution on Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we demonstrate that a streamlined model capable of directly producing high-resolution segmentations can match the performance of more complex systems that generate lower-resolution results. |
Ritambhara Singh; Abhishek Jain; Pietro Perona; Shivani Agarwal; Junfeng Yang; | arxiv-cs.CV | 2024-02-07 |
655 | SGS-SLAM: Semantic Gaussian Splatting For Neural Dense SLAM IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present SGS-SLAM, the first semantic visual SLAM system based on Gaussian Splatting. |
MINGRUI LI et. al. | arxiv-cs.CV | 2024-02-05 |
656 | Instance Segmentation XXL-CT Challenge of A Historic Airplane Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The challenge aimed to explore automatic or interactive instance segmentation methods for an efficient delineation of the different aircraft components, such as screws, rivets, metal sheets or pressure tubes. We report the organization and outcome of this challenge and describe the capabilities and limitations of the submitted segmentation methods. |
ROLAND GRUBER et. al. | arxiv-cs.CV | 2024-02-05 |
657 | Applying Unsupervised Semantic Segmentation to High-Resolution UAV Imagery for Enhanced Road Scene Parsing Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: There are two challenges presented in parsing road scenes from UAV images: the complexity of processing high-resolution images and the dependency on extensive manual annotations required by traditional supervised deep learning methods to train robust and accurate models. In this paper, a novel unsupervised road parsing framework that leverages advancements in vision language models with fundamental computer vision techniques is introduced to address these critical challenges. |
Zihan Ma; Yongshang Li; Ronggui Ma; Chen Liang; | arxiv-cs.CV | 2024-02-05 |
658 | Few-Shot Semantic Segmentation for Consumer Electronics: An Inter-Class Relation Mining Approach Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Few-shot semantic segmentation (FSS), which can perform segmentation using only a limited number of annotated examples, is a promising technique that has been embedded in many … |
HUAFEI HUANG et. al. | IEEE Transactions on Consumer Electronics | 2024-02-01 |
659 | Vision-enhanced Peg-in-Hole for Automotive Body Parts Using Semantic Image Segmentation and Object Detection Related Papers Related Patents Related Grants Related Venues Related Experts View |
M. SILEO et. al. | Eng. Appl. Artif. Intell. | 2024-02-01 |
660 | Multi-branch Residual Image Semantic Segmentation Combined with Inverse Weight Gated-control Related Papers Related Patents Related Grants Related Venues Related Experts View |
Haicheng Qu; Xiaona Wang; Ying Wang; Yao Chen; | Image Vis. Comput. | 2024-02-01 |
661 | An Adaptive Post-Processing Network With The Global-Local Aggregation for Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Current semantic segmentation methods mainly focus on modeling the context of the global image to obtain high-quality segmentation results. However, they ignore the role of local … |
GUILIN ZHU et. al. | IEEE Transactions on Circuits and Systems for Video … | 2024-02-01 |
662 | Low-Rank Sparse Generative Adversarial Unsupervised Domain Adaptation for Multitarget Traffic Scene Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation in diverse real-world traffic scenes is a challenging task for autonomous vehicles to have a reliable understanding of the outside environment. Although deep … |
M. Saffari; Mahdi Khodayar; | IEEE Transactions on Industrial Informatics | 2024-02-01 |
663 | Small Target Augmentation for Urban Remote Sensing Image Real-Time Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Urban remote sensing (URS) image segmentation is very important for many applications from automotive navigation to infrastructure monitoring, and urban management. There are … |
Shasha Ren; Qiong Liu; | IEEE Transactions on Intelligent Transportation Systems | 2024-02-01 |
664 | FRPNet: An Improved Faster-ResNet with PASPP for Real-time Semantic Segmentation in The Unstructured Field Scene Related Papers Related Patents Related Grants Related Venues Related Experts View |
BIAO YANG et. al. | Comput. Electron. Agric. | 2024-02-01 |
665 | Multi-Level Medical Image Segmentation Network Based on Multi-Scale and Context Information Fusion Strategy Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Accurate segmentation of human tissue structure from medical images is one of the critical links in medical image diagnosis. However, due to the medical image scale of different … |
DAYU TAN et. al. | IEEE Transactions on Emerging Topics in Computational … | 2024-02-01 |
666 | SubPipe: A Submarine Pipeline Inspection Dataset for Segmentation and Visual-inertial Localization Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents SubPipe, an underwater dataset for SLAM, object detection, and image segmentation. |
OLAYA ÁLVAREZ-TUÑÓN et. al. | arxiv-cs.RO | 2024-01-31 |
667 | CAFCT-Net: A CNN-Transformer Hybrid Network with Contextual and Attentional Feature Fusion for Liver Tumor Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a Contextual and Attentional feature Fusions enhanced Convolutional Neural Network (CNN) and Transformer hybrid network (CAFCT-Net) for liver tumor segmentation. |
Ming Kang; Chee-Ming Ting; Fung Fung Ting; Raphaël Phan; | arxiv-cs.CV | 2024-01-30 |
668 | Synthetic Data Enables Faster Annotation and Robust Segmentation for Multi-object Grasping in Clutter Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we propose a synthetic data generation method that minimizes human intervention and makes downstream image segmentation algorithms more robust by combining a generated synthetic dataset with a smaller real-world dataset (hybrid dataset). |
Dongmyoung Lee; Wei Chen; Nicolas Rojas; | arxiv-cs.CV | 2024-01-24 |
669 | Self-Supervised Vision Transformers Are Efficient Segmentation Learners for Imperfect Labels Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This study demonstrates a cost-effective approach to semantic segmentation using self-supervised vision transformers (SSVT). |
Seungho Lee; Seoungyoon Kang; Hyunjung Shim; | arxiv-cs.CV | 2024-01-23 |
670 | DatUS^2: Data-driven Unsupervised Semantic Segmentation with Pre-trained Self-supervised Vision Transformer Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, unsupervised dense semantic segmentation has not been explored as a downstream task, which can utilize and evaluate the quality of semantic information introduced in patch-level feature representations during self-supervised training of a vision transformer. Therefore, this paper proposes a novel data-driven approach for unsupervised semantic segmentation (DatUS^2) as a downstream task. |
Sonal Kumar; Arijit Sur; Rashmi Dutta Baruah; | arxiv-cs.CV | 2024-01-23 |
671 | Semantic Prompt Learning for Weakly-Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To address the issues, we propose a Semantic Prompt Learning for WSSS (SemPLeS) framework, which learns to effectively prompt the CLIP latent space to enhance the semantic alignment between the segmented regions and the target object categories. |
Ci-Siang Lin; Chien-Yi Wang; Yu-Chiang Frank Wang; Min-Hung Chen; | arxiv-cs.CV | 2024-01-22 |
672 | Concealed Object Segmentation with Hierarchical Coherence Modeling Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Despite achieving remarkable success, existing COS segmenters still struggle to achieve complete segmentation results in extremely concealed scenarios. In this paper, we propose a Hierarchical Coherence Modeling (HCM) segmenter for COS, aiming to address this incomplete segmentation limitation. |
Fengyang Xiao; Pan Zhang; Chunming He; Runze Hu; Yutao Liu; | arxiv-cs.CV | 2024-01-22 |
673 | MetaSeg: Content-Aware Meta-Net for Omni-Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Inspired by recent advances in meta learning, we argue that rather than struggling to tolerate noise hidden behind clean labels passively, a more feasible solution would be to find out the noisy regions actively, so as to simply ignore them during model optimization. With this in mind, this work presents a novel meta learning based semantic segmentation method, MetaSeg, that comprises a primary content-aware meta-net (CAM-Net) to sever as a noise indicator for an arbitrary segmentation model counterpart. |
SHENWANG JIANG et. al. | arxiv-cs.CV | 2024-01-22 |
674 | S$^{3}$M-Net: Joint Learning of Semantic Segmentation and Stereo Matching for Autonomous Driving Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation and stereo matching are two essential components of 3D environmental perception systems for autonomous driving. Nevertheless, conventional approaches often … |
ZHIYUAN WU et. al. | IEEE Transactions on Intelligent Vehicles | 2024-01-21 |
675 | S$^3$M-Net: Joint Learning of Semantic Segmentation and Stereo Matching for Autonomous Driving Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Hence, in this article, we introduce S$^3$M-Net, a novel joint learning framework developed to perform semantic segmentation and stereo matching simultaneously. |
ZHIYUAN WU et. al. | arxiv-cs.CV | 2024-01-21 |
676 | Spatial Structure Constraints for Weakly Supervised Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose spatial structure constraints (SSC) for weakly supervised semantic segmentation to alleviate the unwanted object over-activation of attention expansion. |
TAO CHEN et. al. | arxiv-cs.CV | 2024-01-20 |
677 | XAI-Enhanced Semantic Segmentation Models for Visual Quality Inspection Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents a framework to bolster visual quality inspection by using CAM-based explanations to refine semantic segmentation models. |
Tobias Clement; Truong Thanh Hung Nguyen; Mohamed Abdelaal; Hung Cao; | arxiv-cs.CV | 2024-01-18 |
678 | OMG-Seg: Is One Model Good Enough For All Segmentation? IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we address various segmentation tasks, each traditionally tackled by distinct or partially unified models. |
XIANGTAI LI et. al. | arxiv-cs.CV | 2024-01-18 |
679 | Uncertainty Estimates for Semantic Segmentation: Providing Enhanced Reliability for Automated Motor Claims Handling Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We explore the use of a meta-classification model to empirically assess the precision of segments predicted by a model trained for the semantic segmentation of car body parts. |
Jan Küchler; Daniel Kröll; Sebastian Schoenen; Andreas Witte; | arxiv-cs.CV | 2024-01-17 |
680 | Importance-Aware Image Segmentation-based Semantic Communication for Autonomous Driving Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This article studies the problem of image segmentation-based semantic communication in autonomous driving. In real traffic scenes, detecting the key objects (e.g., vehicles, … |
JIE LV et. al. | ArXiv | 2024-01-16 |
681 | Semantic Scene Segmentation for Robotics IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, the recent advances in deep learning combined with the boost in the computational capacity and the availability of large-scale labeled datasets have led to significant advances in semantic segmentation. In this chapter, we introduce the task of semantic segmentation and present the deep learning techniques that have been proposed to address this task over the years. |
Juana Valeria Hurtado; Abhinav Valada; | arxiv-cs.RO | 2024-01-15 |
682 | Learning Segmented 3D Gaussians Via Efficient Feature Unprojection for Zero-shot Neural Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This issue stems primarily from their redundant learnable attributes assigned on individual Gaussians, leading to a lack of robustness against the 3D-inconsistencies in zero-shot generated raw labels. To address this problem, our work, named Compact Segmented 3D Gaussians (CoSegGaussians), proposes the Feature Unprojection and Fusion module as the segmentation field, which utilizes a shallow decoder generalizable for all Gaussians based on high-level features. |
Bin Dou; Tianyu Zhang; Zhaohui Wang; Yongjia Ma; Zejian Yuan; | arxiv-cs.CV | 2024-01-11 |
683 | Attention-based Prohibited Item Detection in X-ray Images During Security Checking Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper focuses on the intelligent detection of prohibited items in X‐ray images during the security checking process. An intelligent semantic segmentation model of prohibited … |
Haigang Zhang; Zihao Zhao; Jinfeng Yang; | IET Image Process. | 2024-01-10 |
684 | Generic Knowledge Boosted Pretraining for Remote Sensing Images IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Deep learning models are essential for scene classification, change detection, land cover segmentation, and other remote sensing (RS) image understanding tasks. Most backbones of … |
Ziyue Huang; Mingming Zhang; Yuan Gong; Qingjie Liu; Yunhong Wang; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-09 |
685 | Shadow-Robust Semantic Segmentation for Autonomous Navigation in Walking Space Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Extensive research has been conducted on autonomous navigation systems for last-mile outdoor transportation. A method in previous work utilizes semantic segmentation to classify … |
Kota Hayashi; Hiroaki Nakamichi; H. Yoshitake; Motoki Shino; | 2024 IEEE/SICE International Symposium on System … | 2024-01-08 |
686 | Primitive Geometry Segment Pre-training for 3D Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We thus present the Primitive Geometry Segment Pre-training (PrimGeoSeg) method to enable the learning of 3D semantic features by pre-training segmentation tasks using only primitive geometric objects for 3D medical image segmentation. |
Ryu Tadokoro; Ryosuke Yamada; Kodai Nakashima; Ryo Nakamura; Hirokatsu Kataoka; | arxiv-cs.CV | 2024-01-07 |
687 | Scene Simplification for Simulated Prosthetic Vision with Improved Scene Understanding Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Visual impairment or blindness affects millions of people worldwide, causing significant challenges in their daily activities and communication. These people are potentially a … |
Shijie Yang; Dehao Han; Jingbang Wu; Xiaoming Chen; Vera Chung; | 2024 IEEE International Conference on Consumer Electronics … | 2024-01-06 |
688 | CSMB-VSS: Video Scene Segmentation with Cosine Similarity Matrix Related Papers Related Patents Related Grants Related Venues Related Experts View |
Zeyu Chen; Xinbo Wang; Ji Wang; Yi Zhang; Xiang Cao; | Multim. Tools Appl. | 2024-01-06 |
689 | Systematic Review of Image Segmentation Using Complex Networks Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This review presents various image segmentation methods using complex networks. |
Amin Rezaei; Fatemeh Asadi; | arxiv-cs.CV | 2024-01-05 |
690 | DeVOS: Flow-Guided Deformable Transformer for Video Object Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The recent works on Video Object Segmentation achieved remarkable results by matching dense semantic and instance-level features between the current and previous frames for … |
VOLODYMYR FEDYNYAK et. al. | 2024 IEEE/CVF Winter Conference on Applications of Computer … | 2024-01-03 |
691 | Temporally-Consistent Video Semantic Segmentation with Bidirectional Occlusion-guided Feature Propagation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Despite recent progress in static image segmentation, video segmentation is still challenging due to the need for an accurate, fast, and temporally consistent model. Conducting … |
Razieh Kaviani Baghbaderani; Yuanxin Li; Shuangquan Wang; Hairong Qi; | 2024 IEEE/CVF Winter Conference on Applications of Computer … | 2024-01-03 |
692 | Dual-Guided Frequency Prototype Network for Few-Shot Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Few-shot semantic segmentation is a challenging task that aims to segment novel classes in the query images given only a few annotated support samples. Most existing … |
Chunlin Wen; Hui Huang; Yan Ma; Feiniu Yuan; Hongqing Zhu; | IEEE Transactions on Multimedia | 2024-01-01 |
693 | Swin-CDSA: The Semantic Segmentation of Remote Sensing Images Based on Cascaded Depthwise Convolution and Spatial Attention Mechanism Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: As an important task in remote sensing image processing, semantic segmentation of remote sensing images has broad application prospects in many fields such as disaster warning and … |
YUHAN KANG et. al. | IEEE Geoscience and Remote Sensing Letters | 2024-01-01 |
694 | Research on Image Semantic Segmentation Based on Hybrid Cascade Feature Fusion and Detailed Attention Mechanism Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In view of the low segmentation accuracy for small-scale object and insufficient segmentation of local boundary for semantic segmentation methods based on Deep Learning, this … |
Zuoqiang Du; Yuan Liang; | IEEE Access | 2024-01-01 |
695 | Semantic Image Segmentation By Dynamic Discriminative Prototypes Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation achieves significant success through large-scale training data. Meanwhile, few-shot semantic segmentation was proposed to segment image regions of novel … |
Kaipeng Zhang; Yoichi Sato; | IEEE Transactions on Multimedia | 2024-01-01 |
696 | Evaluation of Global-Scale and Local-Scale Optimized Segmentation Algorithms in GEOBIA With SAM on Land Use and Land Cover Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Segmentation is crucial in geographic object-based image analysis for accurate land use and land cover mapping. However, obtaining outstanding segmentation results in all … |
Tao He; Jianyu Chen; Linchong Kang; Qiankun Zhu; | IEEE Journal of Selected Topics in Applied Earth … | 2024-01-01 |
697 | Semantic Segmentation-Based Intelligent Threshold-Free Feeder Detection Method for Single-Phase Ground Fault in Distribution Networks Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Feeder detection for single-phase ground fault (SPGF) is challenging in a resonant grounded system due to the difference in feeder capacitance to ground and the influence of the … |
Cui Hong; Heng-Yi Qiu; Jian-Hong Gao; Shuyue Lin; Moufa Guo; | IEEE Transactions on Instrumentation and Measurement | 2024-01-01 |
698 | Detail-Optimized Super-Resolution Reconstruction-Based Multistage Training Strategy for Remote Sensing Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Low resolution is a major factor that negatively impacts the accuracy of remote sensing (RS) interpretation. High-quality super-resolution reconstruction (SRR) can help alleviate … |
BAIKAI SUI et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
699 | Densely Multiscale Fusion Network for Lightweight and Accurate Semantic Segmentation of Railway Scenes Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation in railway scenes constitutes a fundamental task in the context of autonomous train driving, serving as a critical source of positional information for other … |
LIRONG LIAN et. al. | IEEE Transactions on Instrumentation and Measurement | 2024-01-01 |
700 | HBSeNet: A Hybrid Bilateral Network for Accurate Semantic Segmentation of Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation of aerial and satellite images plays a crucial role in a wide range of applications and services, catering to the increasing needs of environmental resource … |
Thien Huynh-The; Son Ngoc Truong; Gia-Vuong Nguyen; | IEEE Journal of Selected Topics in Applied Earth … | 2024-01-01 |
701 | TransSea: Hybrid CNN–Transformer With Semantic Awareness for 3-D Brain Tumor Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Accurate segmentation of brain tumors in multimodal magnetic resonance imaging (MRI) plays a crucial role in clinical quantitative assessments, diagnostic processes, and the … |
Yu Liu; Yize Ma; Zhiqin Zhu; Juan Cheng; Xun Chen; | IEEE Transactions on Instrumentation and Measurement | 2024-01-01 |
702 | A Stepwise Refining Image-Level Weakly Supervised Semantic Segmentation Method for Detecting Exposed Surface for Buildings (ESB) From Very High-Resolution Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Exposed surface for buildings (ESB), which refers to exposed surfaces with traces of building construction, often leads to urban dust. Accurate ESB detection is important for … |
Xin Huang; Wenrui Wang; Jiayi Li; Leiguang Wang; Xing Xie; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
703 | A Review of Optical and SAR Image Deep Feature Fusion in Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With the advent of the era of high-resolution remote sensing, semantic segmentation methods for solving pixel-level classification have been widely studied. Deep learning has … |
CHENFANG LIU et. al. | IEEE Journal of Selected Topics in Applied Earth … | 2024-01-01 |
704 | Enhanced Visual SLAM for Construction Robots By Efficient Integration of Dynamic Object Segmentation and Scene Semantics IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View |
Liu Yang; Hubo Cai; | Adv. Eng. Informatics | 2024-01-01 |
705 | Robust 3D Semantic Segmentation Based on Multi-Phase Multi-Modal Fusion for Intelligent Vehicles Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: 3D semantic segmentation is a key technology for intelligent vehicles. Recently, great efforts have been made to achieve accurate and robust 3D semantic segmentation results … |
PEIZHOU NI et. al. | IEEE Transactions on Intelligent Vehicles | 2024-01-01 |
706 | Point-Based Weakly Supervised Deep Learning for Semantic Segmentation of Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Weakly supervised semantic segmentation methods can effectively alleviate the problem of high cost and difficult access to annotation in traditional methods. Among these … |
Yuanhao Zhao; Genyun Sun; Ziyan Ling; A. Zhang; Xiuping Jia; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
707 | Prototype Comparison Convolutional Networks for One-Shot Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In few-shot semantic segmentation (FSS), the key challenges are efficiently tuning the interaction between the support set and the query set and distinguishing between context, … |
LINGBO LI et. al. | IEEE Access | 2024-01-01 |
708 | Image Semantic Segmentation Approach Based on DeepLabV3 Plus Network with An Attention Mechanism IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View |
YANYAN LIU et. al. | Eng. Appl. Artif. Intell. | 2024-01-01 |
709 | Multilateral Semantic With Dual Relation Network for Remote Sensing Images Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation of remote sensing images is an extensively employed and demanding task. Although deep convolutional neural networks have significantly increased the accuracy … |
Weiheng Zhao; Jiannong Cao; Xueyan Dong; | IEEE Journal of Selected Topics in Applied Earth … | 2024-01-01 |
710 | DSHNet: A Semantic Segmentation Model of Remote Sensing Images Based on Dual Stream Hybrid Network Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation is an important issue in intelligent interpretation of remote sensing, playing an important role in applications such as Earth observation and land data … |
Yujia Fu; Xiangrong Zhang; Mingyang Wang; | IEEE Journal of Selected Topics in Applied Earth … | 2024-01-01 |
711 | Cin-Seg: Causal Invariance for Tag-Supervised Segmentation on Medical Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Weakly supervised semantic segmentation (WSSS) is the method of learning a segmentation model with only weak labels, e.g., image-level labels. For WSSS methods, the segmentation … |
Zhang Chen; Zhiqiang Tian; Jihua Zhu; Shaoyi Du; Qindong Sun; | IEEE Transactions on Instrumentation and Measurement | 2024-01-01 |
712 | RailCloud-HdF: A Large-Scale Point Cloud Dataset for Railway Scene Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: : Semantic scene perception is critical for various applications, including railway systems where safety and efficiency are paramount. Railway applications demand precise … |
Mahdi Abid; Mathis Teixeira; Ankur Mahtani; Thomas Laurent; | VISIGRAPP : VISAPP | 2024-01-01 |
713 | DETisSeg: A Dual-encoder Network for Tissue Semantic Segmentation of Histopathology Image Related Papers Related Patents Related Grants Related Venues Related Experts View |
Penghui He; Aiping Qu; Shuomin Xiao; Meidan Ding; | Biomed. Signal Process. Control. | 2024-01-01 |
714 | End-to-End Instance-Level Human Parsing By Segmenting Persons Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Instance-level human parsing is aimed at separately partitioning the human body into different semantic parts for each individual, which remains a challenging task due to human … |
Zhuang Li; Leilei Cao; Hongbin Wang; Lihong Xu; | IEEE Transactions on Multimedia | 2024-01-01 |
715 | On Exploring Shape and Semantic Enhancements for RGB-X Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The robustness of scene segmentation can be enhanced with the aid of other modality information, e.g., thermal or/and depth, under poor environmental conditions. In this context, … |
Yuanjian Yang; Caifeng Shan; Fang Zhao; Wenli Liang; Jungong Han; | IEEE Transactions on Intelligent Vehicles | 2024-01-01 |
716 | Semantic Anything in 3D Gaussians IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: 3D Gaussian Splatting has emerged as an alternative 3D representation of Neural Radiance Fields (NeRFs), bene-fiting from its high-quality rendering results and real-time … |
XU HU et. al. | ArXiv | 2024-01-01 |
717 | BSVOS: Background Interference Suppression Strategy for Satellite Video Multiobject Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Video satellites provide dynamic real-time monitoring of hotspot areas and objects by continuously imaging objects within a specified sequence, providing dynamic information over … |
Longxuan Kou; Shengyang Li; Jian Yang; Yixuan Lv; | IEEE Journal of Selected Topics in Applied Earth … | 2024-01-01 |
718 | Semantic Transition Detection for Self-supervised Video Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Lu Chen; Jiawei Tan; Pingan Yang; Hongxing Wang; | Conference on Multimedia Modeling | 2024-01-01 |
719 | Canonical Plane Segmentation Without Annotating Pixel-Level Object Regions for Image Registration Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Two-dimensional (2D) image registration is a natural choice for simultaneous object pose estimation and object recognition. However, it was not designed to perform object … |
Shunsuke Yoneda; Go Irie; Masashi Nishiyama; | IEEE Access | 2024-01-01 |
720 | Mutual Dual-Task Generator With Adaptive Attention Fusion for Image Inpainting Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Image segmentation can reveal the semantic structure information in an image, which is helpful guidance information for image inpainting. Notably, it can help mitigate the … |
Yongle Zhang; Yimin Liu; Ruotong Hu; Qiang Wu; Jian Zhang; | IEEE Transactions on Multimedia | 2024-01-01 |
721 | Bolstering Performance Evaluation of Image Segmentation Models With Efficacy Metrics in The Absence of A Gold Standard Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Image segmentation using deep learning has become overwhelmingly widespread. However, routine model testing methods can encounter evaluation inconsistencies or bias, largely due … |
LINA TANG et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
722 | A Deep Learning Image Augmentation Method for Field Agriculture Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Vision-based smart agriculture is an important way to improve the efficiency of agricultural production. Labeling images for deep learning in complex field photos is a difficult … |
Kunlin Zou; Yi Shan; Xun Zhao; De Cai Ran; Xiaoxi Che; | IEEE Access | 2024-01-01 |
723 | Oriented Object Detection for Remote Sensing Images Via Object-Wise Rotation-Invariant Semantic Representation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Oriented object detection (OOD) in remote sensing images (RSIs) remains a challenging work due to an arbitrary orientation of instances. Learning rotation-invariant features is … |
Shangdong Zheng; Zebin Wu; Q. Du; Yang Xu; Zhihui Wei; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
724 | GFFNet: Global Feature Fusion Network for Semantic Segmentation of Large-Scale Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation plays a pivotal role in interpreting high-resolution remote sensing images (RSIs), where contextual information is essential for achieving accurate … |
Yong Cao; Chunlei Huo; Shiming Xiang; Chunhong Pan; | IEEE Journal of Selected Topics in Applied Earth … | 2024-01-01 |
725 | RSBEV: Multiview Collaborative Segmentation of 3-D Remote Sensing Scenes With Bird’s-Eye-View Representation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Perception of 3-D remote sensing scenes plays a crucial role in accurately recognizing and locating ground objects, as it enables a deeper understanding of complex environments by … |
Baihong Lin; Zhengxia Zou; Z. Shi; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
726 | A Sample Augmentation Method for Side-Scan Sonar Full-Class Images That Can Be Used for Detection and Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: To solve the problems of small samples, acquisition difficulties, under representation and labeling difficulties in object detection, recognition, and segmentation tasks for … |
Zhiwei Yang; Jianhu Zhao; Yongcan Yu; Chao Huang; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
727 | Design of Forward-Looking Sonar System for Real-Time Image Segmentation With Light Multiscale Attention Net IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Forward-looking sonar is a commonly used underwater detection device. However, due to the complex underwater environment, small target areas, and blurred features, the detection … |
DONGDONG ZHAO et. al. | IEEE Transactions on Instrumentation and Measurement | 2024-01-01 |
728 | Stealthy Adversarial Examples for Semantic Segmentation in Remote Sensing Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Deep learning methods have been proven effective in remote sensing image analysis and interpretation, where semantic segmentation plays a vital role. These deep segmentation … |
Tao Bai; Yiming Cao; Yonghao Xu; Bihan Wen; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
729 | Scenario-Based Segmentation: Traffic Image Segmentation By GNN Based Driver’s Scenario Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper introduces the Scenario-Based Segmentation Network (SBS-Net), which highlights significant advances in autonomous driving. Through the integration of the Scenario … |
Seungwoo Nham; Jinho Lee; Seongryul Yang; Jihun Kim; Shunsuke Kamijo; | IEEE Access | 2024-01-01 |
730 | PCNN Orchard Heterologous Image Fusion with Semantic Segmentation of Significance Regions Related Papers Related Patents Related Grants Related Venues Related Experts View |
Wubo Xu; Liqun Liu; | Comput. Electron. Agric. | 2024-01-01 |
731 | SAMPE: Auto-Prompting SAM for Generalizable Power Equipment Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Power equipment image segmentation is challenging as it involves objects of various scales/sizes, illustration conditions, and imaging angles, making task-specific deep learning … |
YUANSHAN GUO et. al. | IEEE Access | 2024-01-01 |
732 | SAM2-PATH: A Better Segment Anything Model for Semantic Segmentation in Digital Pathology Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The semantic segmentation task in pathology plays an indispensable role in assisting physicians in determining the condition of tissue lesions. Foundation models, such as the SAM … |
MINGYA ZHANG et. al. | ArXiv | 2024-01-01 |
733 | A Knowledge Distillation-Based Ground Feature Classification Network With Multiscale Feature Fusion in Remote-Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: As a fundamental task in remote-sensing interpretation, semantic segmentation of remote-sensing images intends to allocate a definite class to each pixel in the image. Fast and … |
Yang Yang; Yanhui Wang; Junwu Dong; Bibo Yu; | IEEE Journal of Selected Topics in Applied Earth … | 2024-01-01 |
734 | Low-Rank Adaptation of Segment Anything Model for Surgical Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Jay N. Paranjape; S. Sikder; S. Vedula; Vishal M. Patel; | International Conference on Pattern Recognition | 2024-01-01 |
735 | Advancing Data-Efficient Exploitation for Semi-Supervised Remote Sensing Images Semantic Segmentation IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: To reduce the dependence of remote sensing (RS) image semantic segmentation models on extensive pixel-level annotated images, this article aims to address the issue of … |
Liang Lv; Lefei Zhang; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
736 | Causality-Guided Stepwise Intervention and Reweighting for Remote Sensing Image Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation is one of the most significant tasks in remote sensing (RS) image interpretation, which focuses on learning global and local information to infer the … |
SHUTING SHI et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
737 | A Novel SegNet Model for Crack Image Semantic Segmentation in Bridge Inspection Related Papers Related Patents Related Grants Related Venues Related Experts View |
RONG PANG et. al. | Pacific-Asia Conference on Knowledge Discovery and Data … | 2024-01-01 |
738 | Unbalanced Class Learning Network With Scale-Adaptive Perception for Complicated Scene in Remote Sensing Images Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The semantic segmentation of wide-field remote sensing images (RSIs) plays a significant role in many fields. However, due to the complexity of the content of RSIs, the dataset … |
HE WANG et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
739 | SliceMamba for Medical Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Despite the progress made in Mamba-based medical image segmentation models, current methods utilizing unidirectional or multi-directional feature scanning mechanisms fail to well … |
CHAO FAN et. al. | ArXiv | 2024-01-01 |
740 | ER-Swin: Feature Enhancement and Refinement Network Based on Swin Transformer for Semantic Segmentation of Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: As the field of remote sensing image processing continues to advance, semantic segmentation has become a focal point in this domain. The emergence of the swin transformer (SwinT) … |
Jiang Liu; Shuli Cheng; Anyu Du; | IEEE Geoscience and Remote Sensing Letters | 2024-01-01 |
741 | Encouraging The Mutual Interact Between Dataset-Level and Image-Level Context for Semantic Segmentation of Remote Sensing Image Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Recently, semantic segmentation of remote sensing images has witnessed rapid advancement with the adoption of deep neural networks. Contextual cues, referring to the long-range … |
Ke An; Yupei Wang; Liang Chen; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
742 | Online Continual Domain Adaptation for Semantic Image Segmentation Using Internal Representations Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We develop an online UDA algorithm for semantic segmentation of images that improves model generalization on unannotated domains in scenarios where source data access is restricted during adaptation. |
Serban Stan; Mohammad Rostami; | arxiv-cs.CV | 2024-01-01 |
743 | PIF-Net: A Deep Point-Image Fusion Network for Multimodality Semantic Segmentation of Very High-Resolution Imagery and Aerial Point Cloud Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation is of great significance in many applications. However, automating such a task on single-modality data is challenging in the field of remote sensing due to … |
Zhou Guo; Rui Xu; Chen‐Chieh Feng; Zhao Zeng; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
744 | Crop Identification of UAV Images Based on An Unsupervised Semantic Segmentation Method Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Crop identification is a fundamental task in remote sensing image interpretation. The rapid development of unmanned aerial vehicle (UAV) has revolutionized the acquisition of … |
Zebing Zhang; Leiguang Wang; Yuncheng Chen; Chen Zheng; | IEEE Geoscience and Remote Sensing Letters | 2024-01-01 |
745 | SWINT-RESNet: An Improved Remote Sensing Image Segmentation Model Based on Transformer Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Deep neural networks have been widely used in remote sensing image segmentation. Nowadays, artificial intelligence methods are increasingly applied to remote sensing feature … |
Yue Ma; Yingli Wang; Xingya Liu; Haiying Wang; | IEEE Geoscience and Remote Sensing Letters | 2024-01-01 |
746 | Infusing Multisource Heterogeneous Knowledge for Language-Conditioned Segmentation and Grasping Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Language-conditioned segmentation and grasping (LCSG) requires the robot to simultaneously identify and grasp a specific object in accordance with human linguistic instruction. … |
JIALONG XIE et. al. | IEEE Transactions on Instrumentation and Measurement | 2024-01-01 |
747 | Object Pose Estimation From RGB-D Images With Affordance-Instance Segmentation Constraint for Semantic Robot Manipulation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Object pose estimation is a crucial task for semantic robot manipulation involving the detection of suitable manipulation regions. Given the diversity of object shapes and scene … |
Zhongli Wang; Guohui Tian; | IEEE Robotics and Automation Letters | 2024-01-01 |
748 | Region-Based Unsupervised Low-Light Image Enhancement in The Wild With Explicit Domain Supervision Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Prior unsupervised low-light image enhancement methods have exhibited commendable performance within indoor environments. However, adopting them in the wild scene whose low-light … |
YINGJIE MA et. al. | IEEE Transactions on Instrumentation and Measurement | 2024-01-01 |
749 | PCL: Point Contrast and Labeling for Weakly Supervised Point Cloud Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Point cloud semantic segmentation is a fundamental task in 3D scene understanding and has recently achieved remarkable progress. The success of existing approaches is attributed … |
Anan Du; Tianfei Zhou; Shuchao Pang; Qiang Wu; Jian Zhang; | IEEE Transactions on Multimedia | 2024-01-01 |
750 | DDSNet: Deep Dual-Branch Networks for Surface Defect Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation of surface defects is essential to ensure product quality in intelligent manufacturing. However, due to the diversity and complexity of industrial scenarios … |
ZHENYU YIN et. al. | IEEE Transactions on Instrumentation and Measurement | 2024-01-01 |
751 | A Lightweight CNN–Transformer Network With Laplacian Loss for Low-Altitude UAV Imagery Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation is crucial for enabling autonomous flight and landing of low-altitude unmanned aerial vehicles (UAVs) and is indispensable for various intelligent … |
Wen Lu; Zhiqi Zhang; Minh Nguyen; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
752 | DRD-UNet, A UNet-Like Architecture for Multi-Class Breast Cancer Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Staining of histological slides with Hematoxylin and Eosin is widely used in clinical and laboratory settings as these dyes reveal nuclear structures as well as cytoplasm and … |
Mauricio Alberto Ortega-Ruiz; C. Karabağ; Edgar Roman-Rangel; C. Reyes-Aldasoro; | IEEE Access | 2024-01-01 |
753 | A Simple Framework of Few-Shot Learning Using Sparse Annotations for Semantic Segmentation of 3-D Point Clouds Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The semantic segmentation of point clouds plays a crucial role in the interpretation of 3-D scene. However, the majority of supervised learning methods needs a great number of … |
Rong Huang; Yang Gao; Yusheng Xu; L. Hoegner; X. Tong; | IEEE Journal of Selected Topics in Applied Earth … | 2024-01-01 |
754 | Adaptive Self-Supporting Prototype Learning for Remote Sensing Few-Shot Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The semantic segmentation of remote sensing images with few shots has important theoretical and application value. Most of the existing few-shot semantic segmentation frameworks … |
Weihao Shen; A. Ma; Junjue Wang; Zhuo Zheng; Yanfei Zhong; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
755 | RSSGLT: Remote Sensing Image Segmentation Network Based on Global–Local Transformer Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Remotely captured images possess an immense scale and object appearance variability due to the complex scene. It becomes challenging to capture the underlying attributes in the … |
S. Kumar; Abhishek Kumar; Dong-Gyu Lee; | IEEE Geoscience and Remote Sensing Letters | 2024-01-01 |
756 | MA-DBFAN: Multiple-attention-based Dual Branch Feature Aggregation Network for Aerial Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Haoyu Yue; Junhong Yue; Xuejun Guo; Yizhen Wang; Liancheng Jiang; | Signal Image Video Process. | 2024-01-01 |
757 | CloudFU-Net: A Fine-Grained Segmentation Method for Ground-Based Cloud Images Based on An Improved Encoder–Decoder Structure Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The segmentation of ground-based cloud image is a crucial aspect of ground-based cloud observation, with significant implications for meteorological forecasting, photovoltaic … |
CHAOJUN SHI et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
758 | Learn More and Learn Usefully: Truncation Compensation Network for Semantic Segmentation of High-Resolution Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation of high-resolution remote-sensing images (HR-RSIs) focuses on classifying each pixel of input images. Recent methods have incorporated a downscaled global … |
Li Zhang; Zhenshan Tan; Guo Zhang; Wen Zhang; Zhijiang Li; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
759 | SSF-MOS: Semantic Scene Flow Assisted Moving Object Segmentation for Autonomous Vehicles Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Detecting moving objects in dynamic environments is precisely essential in autonomous driving. Existing object detection methods using point clouds have difficulties in … |
Tao Song; Yunhao Liu; Ziying Yao; Xinkai Wu; | IEEE Transactions on Instrumentation and Measurement | 2024-01-01 |
760 | MSGFormer: A DeepLabv3+ Like Semantically Masked and Pixel Contrast Transformer for MouseHole Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In semantic segmentation, the efficient representation of multi-scale context is of paramount importance. Inspired by the remarkable performance of Vision Transformers (ViT) in … |
PENG YANG et. al. | IEEE Access | 2024-01-01 |
761 | Stair Fusion Network With Context-Refined Attention for Remote Sensing Image Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation of remote sensing images is essential in various fields, such as Earth resource census, environmental pollution monitoring, and land use planning. The … |
Jia Liu; Wenyi Hua; Wenhua Zhang; Fang Liu; Liang Xiao; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
762 | Object Segmentation Using Polarization Random Feature in Passive Millimeter-Wave Imaging Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Object segmentation is an important issue in the field of passive millimeter-wave (PMMW) imaging remote sensing and detection. The brightness temperature (TB) difference of … |
YAYUN CHENG et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
763 | MFCANet: A Road Scene Segmentation Network Based on Multi-Scale Feature Fusion and Context Information Aggregation Related Papers Related Patents Related Grants Related Venues Related Experts View |
YUNFENG WANG et. al. | J. Vis. Commun. Image Represent. | 2024-01-01 |
764 | Scene-Adaptive 3D Semantic Segmentation Based on Multi-Level Boundary-Semantic-Enhancement for Intelligent Vehicles IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: 3D semantic segmentation is a key technology of scene understanding in the self-driving field, which remains challenging problems. Recent 3D segmentation methods have achieved … |
Peizhou Ni; Xu Li; Dong Kong; Xiaoqing Yin; | IEEE Transactions on Intelligent Vehicles | 2024-01-01 |
765 | MROVSeg: Breaking The Resolution Curse of Vision-Language Models in Open-Vocabulary Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Open-vocabulary semantic segmentation aims to segment and recognize semantically meaningful regions based on text-based descriptions during inference. A typical so-lution to … |
YUANBING ZHU et. al. | ArXiv | 2024-01-01 |
766 | DHRNet: A Dual-Branch Hybrid Reinforcement Network for Semantic Segmentation of Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In the field of remote sensing image processing, semantic segmentation has always been a hot research topic. Currently, deep convolutional neural networks (DCNNs) are the … |
Qinyan Bai; Xiaobo Luo; Yaxu Wang; Tengfei Wei; | IEEE Journal of Selected Topics in Applied Earth … | 2024-01-01 |
767 | SonarNet: Hybrid CNN-Transformer-HOG Framework and Multifeature Fusion Mechanism for Forward-Looking Sonar Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Forward-looking sonar (FLS) image segmentation plays a significant role in ocean engineering. However, the existing image segmentation algorithms present difficulties in … |
Ju He; Jianfeng Chen; Hu Xu; Yang Yu; | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
768 | MixUNet: A Lightweight Medical Image Segmentation Network Capturing Multidimensional Semantic Information Related Papers Related Patents Related Grants Related Venues Related Experts View |
YUFENG CHEN et. al. | Biomed. Signal Process. Control. | 2024-01-01 |
769 | Semantic Segmentation of Remote Sensing Images With Transformer-Based U-Net and Guided Focal-Axial Attention Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In the field of remote sensing, semantic segmentation of unmanned aerial vehicle (UAV) imagery is crucial for tasks such as land resource management, urban planning, precision … |
Bianca-Cerasela-Zelia Blaga; S. Nedevschi; | IEEE Journal of Selected Topics in Applied Earth … | 2024-01-01 |
770 | MASNet: Road Semantic Segmentation Based on Multiscale Modality Fusion Perception Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: With the growing complexity of driving environments, relying solely on a single sensor for scene understanding is no longer sufficient. To address this issue, this article … |
Xiaohang Li; Jianjiang Zhou; | IEEE Transactions on Instrumentation and Measurement | 2024-01-01 |
771 | MeSAM: Multiscale Enhanced Segment Anything Model for Optical Remote Sensing Images IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Segment anything model (SAM) has been widely applied to various downstream tasks for its excellent performance and generalization capability. However, SAM exhibits three … |
XICHUAN ZHOU et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
772 | HierU-Net: A Hierarchical Semantic Segmentation Method for Land Cover Mapping Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Land cover mapping is crucial for natural resource assessment, urban planning, and sustainable development. Land cover nomenclature often includes two or three hierarchical levels … |
LANFA LIU et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
773 | Boundary-Guided Lightweight Semantic Segmentation With Multi-Scale Semantic Context IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Lightweight semantic segmentation plays an essential role in image signal processing that is beneficial to many multimedia applications, such as self-driving, robotic vision, and … |
QUAN ZHOU et. al. | IEEE Transactions on Multimedia | 2024-01-01 |
774 | Dense Dual-Branch Cross Attention Network for Semantic Segmentation of Large-Scale Point Clouds Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation of large-scale point clouds provides foundational knowledge for various geodetic and cartographic applications, including autonomous driving, smart cities, … |
ZIWEI LUO et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
775 | A Mamba-Diffusion Framework for Multimodal Remote Sensing Image Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Recent advances in deep learning have made significant progress in multimodal remote sensing semantic segmentation. However, current methods face challenges in maintaining … |
WENLIANG DU et. al. | IEEE Geoscience and Remote Sensing Letters | 2024-01-01 |
776 | Clustering-Guided Class Activation for Weakly Supervised Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Abstract: Weakly-supervised semantic segmentation (WSSS) methods via transformer have been actively studied by leveraging their strong capability to capture the global context. However, … |
Yeong Woo Kim; Wonjun Kim; | IEEE Access | 2024-01-01 |
777 | SegCLIP: Multimodal Visual-Language and Prompt Learning for High-Resolution Remote Sensing Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Remote sensing semantic segmentation is considered a key step in the intelligent interpretation of high-resolution remote sensing (HRRS) images, with widespread applications in … |
SHIJIE ZHANG et. al. | IEEE Transactions on Geoscience and Remote Sensing | 2024-01-01 |
778 | MLU-Net: A Multi-Level Lightweight U-Net for Medical Image Segmentation Integrating Frequency Representation and MLP-Based Methods Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Medical image segmentation is a challenging and popular task in the field of medical image processing in recent decades. Most of the current mainstream segmentation networks are … |
LIPING FENG et. al. | IEEE Access | 2024-01-01 |
779 | Unified Semantic Model for Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Shuai Yuan; Jianjian Yin; Runcheng Li; Yi Chen; Yudong Zhang; | Biomed. Signal Process. Control. | 2024-01-01 |
780 | Salient-Boundary-Guided Pseudo-Pixel Supervision for Weakly-Supervised Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This letter presents an innovative approach for generating pixel-wise pseudo masks as supervision for image-level Weakly Supervised Semantic Segmentation (WSSS). This is achieved … |
Min Shi; Weizhao Deng; Qingming Yi; Weiping Liu; Aiwen Luo; | IEEE Signal Processing Letters | 2024-01-01 |
781 | TFRNet: Semantic Segmentation Network with Token Filtration and Refinement Method Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Transformer-based semantic segmentation has been developed rapidly. Vision transformer (ViT) rely on self-attention mechanism which employs all image patches to compute long-range … |
Yingdong Ma; Xiaoyu Hu; | IEEE Transactions on Multimedia | 2024-01-01 |
782 | Uncertainty-Guided Segmentation Network for Geospatial Object Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Geospatial objects pose significant challenges, including dense distribution, substantial interclass variations, and minimal intraclass variations. These complexities make … |
Hongyu Jia; Wenwu Yang; Lin Wang; Haolin Li; | IEEE Journal of Selected Topics in Applied Earth … | 2024-01-01 |
783 | Unsupervised Semantic Segmentation of PolSAR Images Based on Multiview Similarity Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation is an essential task in polarimetric synthetic aperture radar (PolSAR) image interpretation. To address the issue of insufficient measurement ability of … |
MEILIN LI et. al. | IEEE Journal of Selected Topics in Applied Earth … | 2024-01-01 |
784 | 1st Place Solution for 5th LSVOS Challenge: Referring Video Object Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we integrate strengths of that leading RVOS models to build up an effective paradigm. |
ZHUOYAN LUO et. al. | arxiv-cs.CV | 2023-12-31 |
785 | Promoting Segment Anything Model Towards Highly Accurate Dichotomous Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Therefore, it is both interesting and valuable to explore whether SAM can be improved towards highly accurate object segmentation, which is known as the dichotomous image segmentation (DIS) task. To address this issue, we propose DIS-SAM, which advances SAM towards DIS with extremely accurate details. |
Xianjie Liu; Keren Fu; Yao Jiang; Qijun Zhao; | arxiv-cs.CV | 2023-12-30 |
786 | LISA++: An Improved Baseline for Reasoning Segmentation with Large Language Model Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we introduce LISA++, an update to the existing LISA model, focusing on improving core functionalities while keeping the base architecture intact. |
SENQIAO YANG et. al. | arxiv-cs.CV | 2023-12-28 |
787 | An Improved Baseline for Reasoning Segmentation with Large Language Model IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: While LISA effectively bridges the gap between segmentation and large language models to enable reasoning segmentation, it poses certain limitations: unable to distinguish … |
SENQIAO YANG et. al. | ArXiv | 2023-12-28 |
788 | Special Perceptual Parsing for Chinese Landscape Painting Scene Understanding: A Semantic Segmentation Approach Related Papers Related Patents Related Grants Related Venues Related Experts View |
RUI YANG et. al. | Neural Comput. Appl. | 2023-12-27 |
789 | 2D-Guided 3D Gaussian Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In response, this paper introduces a 3D Gaussian segmentation method implemented with 2D segmentation as supervision. |
KUN LAN et. al. | arxiv-cs.CV | 2023-12-26 |
790 | UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we end the current fragmented situation and propose UniRef++ to unify the four reference-based object segmentation tasks with a single architecture. |
JIANNAN WU et. al. | arxiv-cs.CV | 2023-12-25 |
791 | I2PN: Improved Image Projection Network for OCTA Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Optical coherence tomography (OCT) is one of the most significant advances in medical images, and OCT segmentation is an important task in medical-assisted diagnostics. However, … |
TIANLEI WANG et. al. | Proceedings of the 2023 6th International Conference on … | 2023-12-22 |
792 | Few Shot Part Segmentation Reveals Compositional Logic for Industrial Anomaly Detection IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this study, we introduce a novel component segmentation model for LA detection that leverages a few labeled samples and unlabeled images sharing logical constraints. |
SOOPIL KIM et. al. | arxiv-cs.CV | 2023-12-21 |
793 | BEVSeg2TP: Surround View Camera Bird’s-Eye-View Based Joint Vehicle Segmentation and Ego Vehicle Trajectory Prediction Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The proposed method in this paper predicts trajectories by considering perception and trajectory prediction as a unified system. |
Sushil Sharma; Arindam Das; Ganesh Sistu; Mark Halton; Ciarán Eising; | arxiv-cs.CV | 2023-12-20 |
794 | Monocular 3D Object Detection for Construction Scene Analysis Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Three‐dimensional (3D) object detection, that is, localizing and classifying all critical objects in a 3D space, is essential for downstream construction scene analysis tasks. … |
Jie Shen; Lang Jiao; Cong Zhang; Keran Peng; | Computer‐Aided Civil and Infrastructure Engineering | 2023-12-20 |
795 | MetaSegNet: Metadata-collaborative Vision-Language Representation Learning for Semantic Segmentation of Remote Sensing Images Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Non-visual data, such as text, can gather extra knowledge from the real world, which can strengthen the interpretability, reliability, and generalization of visual models. Inspired by this, we propose a novel metadata-collaborative segmentation network (MetaSegNet) that applies vision-language representation learning for semantic segmentation of remote sensing images. |
LIBO WANG et. al. | arxiv-cs.CV | 2023-12-19 |
796 | All for One, and One for All: UrbanSyn Dataset, The Third Musketeer of Synthetic Driving Scenes IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce UrbanSyn, a photorealistic dataset acquired through semi-procedurally generated synthetic urban driving scenarios. |
JOSE L. GÓMEZ et. al. | arxiv-cs.CV | 2023-12-19 |
797 | Video Semantic Segmentation Network with Low Latency Based on Deep Learning Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Recently, new advances in deep learning algorithms have yielded some fascinating results in the field of computer vision technology. As a result, it can now perform activities … |
Channappa Gowda D V; R. Kanagavalli; | Int. J. Commun. Networks Inf. Secur. | 2023-12-19 |
798 | SegRefiner: Towards Model-Agnostic Segmentation Refinement with Discrete Diffusion Process IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we explore a principal way to enhance the quality of object masks produced by different segmentation models. |
MENGYU WANG et. al. | arxiv-cs.CV | 2023-12-19 |
799 | The Endoscapes Dataset for Surgical Scene Segmentation, Object Detection, and Critical View of Safety Assessment: Official Splits and Benchmark IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this report, we provide detailed dataset statistics (size, class distribution, dataset splits, etc.) and a comprehensive performance benchmark for instance segmentation, object detection, and CVS prediction. |
ADITYA MURALI et. al. | arxiv-cs.CV | 2023-12-19 |
800 | Semantic Segmentation Using Transfer Learning on Fisheye Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: While semantic segmentation has been extensively studied in the realm of regular perspective images, its application to fisheye images remains relatively unexplored. Existing … |
S. Paul; Zachary Patterson; Nizar Bouguila; | 2023 International Conference on Machine Learning and … | 2023-12-15 |
801 | WeatherProof: A Paired-Dataset Approach to Semantic Segmentation in Adverse Weather Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We find that training on these paired clear and adverse weather frames which share an underlying scene results in improved performance on adverse weather data. With this knowledge, we propose a training pipeline which accentuates the advantages of paired-data training using consistency losses and language guidance, which leads to performance improvements by up to 18.4% as compared to standard training procedures. |
BLAKE GELLA et. al. | arxiv-cs.CV | 2023-12-14 |
802 | Semi-supervised Semantic Segmentation Meets Masked Modeling:Fine-grained Locality Learning Matters in Consistency Regularization Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This can be because it originally stems from the image classification task and lacks specialized mechanisms to capture fine-grained local semantics that prioritizes in dense prediction. To address this issue, we propose a novel framework called \texttt{MaskMatch}, which enables fine-grained locality learning to achieve better dense segmentation. |
WENTAO PAN et. al. | arxiv-cs.CV | 2023-12-13 |
803 | Transferring CLIP’s Knowledge Into Zero-Shot Point Cloud Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we focus on zero-shot point cloud semantic segmentation and propose a simple yet effective baseline to transfer the visual-linguistic knowledge implied in CLIP to point cloud encoder at both feature and output levels. |
YUANBIN WANG et. al. | arxiv-cs.CV | 2023-12-12 |
804 | SDAT-Former++: A Foggy Scene Semantic Segmentation Method with Stronger Domain Adaption Teacher for Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation based on optical images can provide comprehensive scene information for intelligent vehicle systems, thus aiding in scene perception and decision making. … |
ZIQUAN WANG et. al. | Remote. Sens. | 2023-12-12 |
805 | Densify Your Labels: Unsupervised Clustering with Bipartite Matching for Weakly Supervised Point Cloud Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a weakly supervised semantic segmentation method for point clouds that predicts per-point labels from just whole-scene annotations while achieving the performance of recent fully supervised approaches. |
SHAOBO XIA et. al. | arxiv-cs.CV | 2023-12-11 |
806 | Cataract-1K: Cataract Surgery Dataset for Scene Segmentation, Phase Recognition, and Irregularity Detection Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Particularly, surgical scene understanding and phase recognition stand as pivotal pillars within the realm of computer-assisted surgery and post-operative assessment of cataract surgery videos. In this context, we present the largest cataract surgery video dataset that addresses diverse requisites for constructing computerized surgical workflow analysis and detecting post-operative irregularities in cataract surgery. |
NEGIN GHAMSARIAN et. al. | arxiv-cs.CV | 2023-12-11 |
807 | Architectural Floorplan Recognition Via Iterative Semantic Segmentation Networks Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper presents a novel method for architectural floorplan recognition based on iterative semantic segmentation networks, effectively improving the segmentation performance of … |
Wenming Wu; | Proceedings of the 2023 7th International Conference on … | 2023-12-08 |
808 | Loss Functions in The Era of Semantic Segmentation: A Survey and Outlook Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To aid researchers in identifying the optimal loss function for their particular application, this survey provides a comprehensive and unified review of $25$ loss functions utilized in image segmentation. |
REZA AZAD et. al. | arxiv-cs.CV | 2023-12-08 |
809 | GcDLSeg: Integrating Graph-cut Into Deep Learning for Binary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To combine the strengths of both approaches, we propose in this study to integrate the graph-cut approach into a deep learning network for end-to-end learning. |
Hui Xie; Weiyu Xu; Ya Xing Wang; John Buatti; Xiaodong Wu; | arxiv-cs.CV | 2023-12-07 |
810 | Augmentation-Free Dense Contrastive Knowledge Distillation for Efficient Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Existing methods heavily rely on data augmentation and memory buffer, which entail high computational resource demands when applying them to handle semantic segmentation that requires to preserve high-resolution feature maps for making dense pixel-wise predictions. In order to address this problem, we present Augmentation-free Dense Contrastive Knowledge Distillation (Af-DCD), a new contrastive distillation learning paradigm to train compact and accurate deep neural networks for semantic segmentation applications. |
Jiawei Fan; Chao Li; Xiaolong Liu; Meina Song; Anbang Yao; | arxiv-cs.CV | 2023-12-07 |
811 | Strategic Improvements of SqueezeSegV2 for Road-Scene Semantic Segmentation Using 3D LiDAR Point Cloud Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation of LiDAR point clouds for road-scene analysis in autonomous vehicles and driver assistance systems is a challenging task due to the confusion of categories … |
Quang-Thai Le; Quoc-Hung Tran; Thien Huynh-The; | Proceedings of the 12th International Symposium on … | 2023-12-07 |
812 | LiPoSeg: A Lightweight Encoder-Decoder Network for LiDAR-based Road-Object Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: LiDAR point cloud segmentation is one of the most challenging tasks in autonomous driving systems, as it requires a cutting-edge perception method that should be accurate, … |
Anh-Kiet Vo; Thien Huynh-The; | Proceedings of the 12th International Symposium on … | 2023-12-07 |
813 | Auto-Vocabulary Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce Auto-Vocabulary Semantic Segmentation (AVS), advancing open-ended image understanding by eliminating the necessity to predefine object categories for segmentation. |
Osman Ülger; Maksymilian Kulicki; Yuki Asano; Martin R. Oswald; | arxiv-cs.CV | 2023-12-07 |
814 | LSegDiff: A Latent Diffusion Model for Medical Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Initially designed for image generation, diffusion models can also be effectively applied to various tasks, including semantic segmentation. However, most existing diffusion-based … |
Hung Vu Quoc; Thao Tran Le Phuong; Minh Trinh Xuan; Sang Dinh Viet; | Proceedings of the 12th International Symposium on … | 2023-12-07 |
815 | DeepPyramid+: Medical Image Segmentation Using Pyramid View Fusion and Deformable Pyramid Reception Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose a network architecture, DeepPyramid+, which addresses diverse challenges encountered in medical image and surgical video segmentation. |
Negin Ghamsarian; Sebastian Wolf; Martin Zinkernagel; Klaus Schoeffmann; Raphael Sznitman; | arxiv-cs.CV | 2023-12-06 |
816 | Novel Class Discovery Meets Foundation Models for 3D Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The task of Novel Class Discovery (NCD) in semantic segmentation entails training a model able to accurately segment unlabelled (novel) classes, relying on the available supervision from annotated (base) classes. |
Luigi Riz; Cristiano Saltori; Yiming Wang; Elisa Ricci; Fabio Poiesi; | arxiv-cs.CV | 2023-12-06 |
817 | Foundation Model Assisted Weakly Supervised Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To this end, we propose a coarse-to-fine framework based on CLIP and SAM for generating high-quality segmentation seeds. |
Xiaobo Yang; Xiaojin Gong; | arxiv-cs.CV | 2023-12-06 |
818 | Semi-MedSeq: Semi-supervised Semantic Segmentation for Medical Image Sequences Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In clinical practice, medical imaging techniques include 2D video-based examinations that capture sequential scans, and 3D volumetric imaging that forms a comprehensive 3D … |
RUNTIAN YUAN et. al. | 2023 IEEE International Conference on Bioinformatics and … | 2023-12-05 |
819 | Morphological Guided Causal Constraint Network for Medical Image Multi-Object Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Multi-objective segmentation (MOS) in medical images is to simultaneously extract multiple regions of interest in the medical images. Due to the unbalanced distribution of samples … |
YIFAN GAO et. al. | 2023 IEEE International Conference on Bioinformatics and … | 2023-12-05 |
820 | Diff-SFCT: A Diffusion Model with Spatial-Frequency Cross Transformer for Medical Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Most existing semantic segmentation methods primarily employ supervised learning with discriminative models. Although these methods are straightforward, they overlook the modeling … |
YUXUAN JIANG et. al. | 2023 IEEE International Conference on Bioinformatics and … | 2023-12-05 |
821 | Adaptive Thresholding Based on Multi-task Learning for Refining Binary Medical Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Binary medical image segmentation plays a pivotal role in the diagnosis and treatment of a wide range of diseases. However, the performance of the segmentation model is closely … |
Qin Lei; Rongzhen Li; Jiang Zhong; Chen Wang; Qizhu Dai; | 2023 IEEE International Conference on Bioinformatics and … | 2023-12-05 |
822 | SAM-Assisted Remote Sensing Imagery Semantic Segmentation with Object and Boundary Constraints IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present a streamlined framework aimed at leveraging the raw output of SAM by exploiting two novel concepts called SAM-Generated Object (SGO) and SAM-Generated Boundary (SGB). |
XIANPING MA et. al. | arxiv-cs.CV | 2023-12-04 |
823 | SRSNetwork: Siamese Reconstruction-Segmentation Networks Based on Dynamic-Parameter Convolution Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present a high-performance deep neural network for weak target image segmentation, including medical image segmentation and infrared image segmentation. |
BINGKUN NIAN et. al. | arxiv-cs.CV | 2023-12-04 |
824 | A Review and A Robust Framework of Data-Efficient 3D Scene Parsing with Traditional/Learned 3D Descriptors Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work presents a general and simple framework to tackle point cloud understanding when labels are limited. |
Kangcheng Liu; | arxiv-cs.CV | 2023-12-02 |
825 | CellMixer: Annotation-free Semantic Cell Segmentation of Heterogeneous Cell Populations Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present CellMixer, an innovative annotation-free approach for the semantic segmentation of heterogeneous cell populations. |
MEHDI NAOUAR et. al. | arxiv-cs.CV | 2023-12-01 |
826 | Generalized Robot 3D Vision-Language Model with Fast Rendering and Pre-Training Vision-Language Alignment Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This work presents a generalized and straightforward framework for dealing with 3D scene understanding when the labeled scenes are quite limited. |
Kangcheng Liu; Yong-Jin Liu; Baoquan Chen; | arxiv-cs.CV | 2023-12-01 |
827 | ClothSeg: Semantic Segmentation Network with Feature Projection for Clothing Parsing Related Papers Related Patents Related Grants Related Venues Related Experts View |
GUANGYU TANG et. al. | J. Vis. Commun. Image Represent. | 2023-12-01 |
828 | Twin-SegNet: Dynamically Coupled Complementary Segmentation Networks for Generalized Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View |
Shahed Ahmed; Md. Kamrul Hasan; | Comput. Vis. Image Underst. | 2023-12-01 |
829 | Semantic Segmentation in Thermal Videos: A New Benchmark and Multi-Granularity Contrastive Learning-Based Framework Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Video semantic segmentation has achieved great success, which is significant for road scene understanding. However, semantic segmentation remains challenging in poor illumination … |
Yu Zheng; F. Zhou; Shangying Liang; Wentao Song; X. Bai; | IEEE Transactions on Intelligent Transportation Systems | 2023-12-01 |
830 | Efficient Multimodal Semantic Segmentation Via Dual-Prompt Learning IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Existing approaches often fully fine-tune a dual-branch encoder-decoder framework with a complicated feature fusion strategy for achieving multimodal semantic segmentation, which is training-costly due to the massive parameter updates in feature extraction and fusion. To address this issue, we propose a surprisingly simple yet effective dual-prompt learning network (dubbed DPLNet) for training-efficient multimodal (e.g., RGB-D/T) semantic segmentation. |
SHAOHUA DONG et. al. | arxiv-cs.CV | 2023-12-01 |
831 | A Two-step Image Segmentation Based on Clone Selection Multi-object Emperor Penguin Optimizer for Fault Diagnosis of Power Transformer Related Papers Related Patents Related Grants Related Venues Related Experts View |
Zhikai Xing; Yigang He; | Expert Syst. Appl. | 2023-12-01 |
832 | 3D Semantic Segmentation of Aerial Photogrammetry Models Based on Orthographic Projection Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation of 3D scenes is one of the most important tasks in the field of computer vision and has attracted much attention. In this paper, we propose a novel framework … |
Mengqi Rong; Shuhan Shen; | IEEE Transactions on Circuits and Systems for Video … | 2023-12-01 |
833 | Temporal Feature Matching and Propagation for Semantic Segmentation on 3D Point Cloud Sequences Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In real-world LiDAR-based applications, data is generated in the form of 3D point cloud sequences or 4D point clouds. However, the topic of semantic segmentation on 4D point … |
Hanyu Shi; Ruibo Li; Fayao Liu; Guosheng Lin; | IEEE Transactions on Circuits and Systems for Video … | 2023-12-01 |
834 | A Lightweight Clustering Framework for Unsupervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We thus propose a lightweight clustering framework for unsupervised semantic segmentation. |
Yau Shing Jonathan Cheung; Xi Chen; Lihe Yang; Hengshuang Zhao; | arxiv-cs.CV | 2023-11-30 |
835 | Spherical Frustum Sparse Convolution Network for LiDAR Point Cloud Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To avoid quantized information loss, in this paper, we propose a novel spherical frustum structure. |
Yu Zheng; Guangming Wang; Jiuming Liu; Marc Pollefeys; Hesheng Wang; | arxiv-cs.CV | 2023-11-29 |
836 | Continual Learning for Image Segmentation with Dynamic Query IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a simple, yet effective Continual Image Segmentation method with incremental Dynamic Query (CISDQ), which decouples the representation learning of both old and new knowledge with lightweight query embedding. |
WEIJIA WU et. al. | arxiv-cs.CV | 2023-11-29 |
837 | ALSTER: A Local Spatio-Temporal Expert for Online 3D Semantic Reconstruction Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose an online 3D semantic segmentation method that incrementally reconstructs a 3D semantic map from a stream of RGB-D frames. |
SILVAN WEDER et. al. | arxiv-cs.CV | 2023-11-29 |
838 | Emergent Open-Vocabulary Semantic Segmentation from Off-the-shelf Vision-Language Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, leveraging the learned association for open-vocabulary semantic segmentation remains a challenge. In this paper, we propose a simple, yet extremely effective, training-free technique, Plug-and-Play Open-Vocabulary Semantic Segmentation (PnP-OVSS) for this task. |
Jiayun Luo; Siddhesh Khandelwal; Leonid Sigal; Boyang Li; | arxiv-cs.CV | 2023-11-28 |
839 | ScribbleGen: Generative Data Augmentation Improves Scribble-supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose ScribbleGen, a generative data augmentation method that leverages a ControlNet diffusion model conditioned on semantic scribbles to produce high-quality training data. |
Jacob Schnell; Jieke Wang; Lu Qi; Vincent Tao Hu; Meng Tang; | arxiv-cs.CV | 2023-11-28 |
840 | Image Segmentation with Traveling Waves in An Exactly Solvable Recurrent Neural Network Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We show that this network generates sophisticated spatiotemporal dynamics that can effectively divide an image into groups according to a scene’s structural characteristics. Using an exact solution of the recurrent network’s dynamics, we present a precise description of the mechanism underlying object segmentation in this network, providing a clear mathematical interpretation of how the network performs this task. |
LUISA H. B. LIBONI et. al. | arxiv-cs.CV | 2023-11-28 |
841 | PiGPDS: Perspective Independent Ground Plane Detection and Segmentation for Complex 3D Indoor Scenes Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Ground plane detection and segmentation techniques can benefit and help improve the accuracy and robustness of a wide range of computer vision applications, from 3D object … |
Ali Ebrahimi; S. Czarnuch; | 2023 International Conference on Digital Image Computing: … | 2023-11-28 |
842 | 2D Feature Distillation for Weakly- and Semi-Supervised 3D Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose an image-guidance network (IGNet) which builds upon the idea of distilling high level feature information from a domain adapted synthetically trained 2D semantic segmentation network. |
Ozan Unal; Dengxin Dai; Lukas Hoyer; Yigit Baran Can; Luc Van Gool; | arxiv-cs.CV | 2023-11-27 |
843 | FALCON: Fairness Learning Via Contrastive Attention Approach to Continual Semantic Scene Understanding in Open World Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Continual Learning in semantic scene segmentation aims to continually learn new unseen classes in dynamic environments while maintaining previously learned knowledge. Prior … |
Thanh-Dat Truong; Utsav Prabhu; Bhiksha Raj; Jackson Cothren; Khoa Luu; | ArXiv | 2023-11-27 |
844 | Adapter Is All You Need for Tuning Visual Tasks IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To find a competitive alternative to full fine-tuning, we propose the Multi-cognitive Visual Adapter (Mona) tuning, a novel adapter-based tuning method. |
Dongshuo Yin; Leiyi Hu; Bin Li; Youqun Zhang; | arxiv-cs.CV | 2023-11-25 |
845 | Segment (Almost) Nothing: Prompt-Agnostic Adversarial Attacks on Segmentation Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose instead to generate prompt-agnostic adversarial attacks by maximizing the $\ell_2$-distance, in the latent space, between the embedding of the original and perturbed images. |
Francesco Croce; Matthias Hein; | arxiv-cs.CV | 2023-11-24 |
846 | Language-guided Few-shot Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose an innovative solution to tackle the challenge of few-shot semantic segmentation using only language information, i.e.image-level text labels. |
Jing Wang; Yuang Liu; Qiang Zhou; Fan Wang; | arxiv-cs.CV | 2023-11-23 |
847 | SegVol: Universal and Interactive Volumetric Medical Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a 3D foundation segmentation model, named SegVol, supporting universal and interactive volumetric medical image segmentation. |
Yuxin Du; Fan Bai; Tiejun Huang; Bo Zhao; | arxiv-cs.CV | 2023-11-22 |
848 | Instance-aware 3D Semantic Segmentation Powered By Shape Generators and Classifiers Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we proposed a novel instance-aware approach for 3D semantic segmentation. |
Bo Sun; Qixing Huang; Xiangru Huang; | arxiv-cs.CV | 2023-11-20 |
849 | Generalized Category Discovery in Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We propose a straightforward yet effective framework that reinterprets the GCDSS challenge as a task of mask classification. |
ZHENGYUAN PENG et. al. | arxiv-cs.CV | 2023-11-19 |
850 | Optimizing Rgb-d Semantic Segmentation Through Multi-modal Interaction and Pooling Attention Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, in indoor environments, the simple input of RGB and depth images often results in a relatively limited acquisition of semantic and spatial information, leading to suboptimal segmentation outcomes. To address this, we propose the Multi-modal Interaction and Pooling Attention Network (MIPANet), a novel approach designed to harness the interactive synergy between RGB and depth modalities, optimizing the utilization of complementary information. |
Shuai Zhang; Minghong Xie; | arxiv-cs.CV | 2023-11-19 |
851 | Self-trained Panoptic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The aim of this work is to develop a framework to perform embedding-based self-supervised panoptic segmentation using self-training in a synthetic-to-real domain adaptation problem setting. |
Shourya Verma; | arxiv-cs.CV | 2023-11-17 |
852 | Labeling Indoor Scenes with Fusion of Out-of-the-Box Perception Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We aim to develop a cost-effective labeling approach to obtain pseudo-labels for semantic segmentation and object instance detection in indoor environments, with the ultimate goal of facilitating the training of lightweight models for various downstream tasks. |
Yimeng Li; Navid Rajabi; Sulabh Shrestha; Md Alimoor Reza; Jana Kosecka; | arxiv-cs.CV | 2023-11-17 |
853 | Remote Sensing Image Semantic Segmentation Method Based on A Deep Convolutional Neural Network and Multiscale Feature Fusion Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: There are many problems with remote sensing images, such as large data scales, complex illumination conditions, occlusion, and dense targets. The existing semantic segmentation … |
Guangzhen Zhang; Wangyang Jiang; | Int. J. Semantic Web Inf. Syst. | 2023-11-16 |
854 | Unlocking Early-Exiting Semantic Segmentation with Branched Networks Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Early-exit Deep Neural Networks (DNNs) and DNNs partitioning are valuable options to ease DNN implementation for image classification in resource-constrained devices and … |
Mateus S. Gilbert; R. G. Pacheco; R. S. Couto; M. L. R. D. Campos; Miguel Elias M. Campista; | 2023 IEEE Latin-American Conference on Communications … | 2023-11-15 |
855 | Semantic Segmentation Algorithm for Remote Sensing Images Based on Attention Mechanism Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper propose to combine the attention mechanism with the U-Net model to improve the performance and accuracy of semantic segmentation tasks. The attention mechanism can … |
Jionghui Jiang; Xi’an Feng; Hui Huang; | 2023 IEEE International Conference on Signal Processing, … | 2023-11-14 |
856 | 3DFusion, A Real-time 3D Object Reconstruction Pipeline Based on Streamed Instance Segmented Data Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To achieve real-time performance, the paper proposes a method that effectively samples consecutive frames to reduce network load while ensuring reconstruction quality. |
Xi Sun; Derek Jacoby; Yvonne Coady; | arxiv-cs.CV | 2023-11-11 |
857 | MeshNet-SP: A Semantic Urban 3D Mesh Segmentation Network with Sparse Prior Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: A textured urban 3D mesh is an important part of 3D real scene technology. Semantically segmenting an urban 3D mesh is a key task in the photogrammetry and remote sensing field. … |
Guangyun Zhang; Rongting Zhang; | Remote. Sens. | 2023-11-11 |
858 | FDNet: Feature Decoupled Segmentation Network for Tooth CBCT Image Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose FDNet, a Feature Decoupled Segmentation Network, to excel in the face of the variable dental conditions encountered in CBCT scans, such as complex artifacts and indistinct tooth boundaries. |
XIANG FENG et. al. | arxiv-cs.CV | 2023-11-11 |
859 | U3DS3: Unsupervised 3D Semantic Scene Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Contemporary point cloud segmentation approaches largely rely on richly annotated 3D training data. However, it is both time-consuming and challenging to obtain consistently … |
Jiaxu Liu; Zhengdi Yu; T. Breckon; Hubert P. H. Shum; | 2024 IEEE/CVF Winter Conference on Applications of Computer … | 2023-11-10 |
860 | U3DS$^3$: Unsupervised 3D Semantic Scene Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper presents U3DS$^3$, as a step towards completely unsupervised point cloud segmentation for any holistic 3D scenes. |
Jiaxu Liu; Zhengdi Yu; Toby P. Breckon; Hubert P. H. Shum; | arxiv-cs.CV | 2023-11-10 |
861 | Lidar Annotation Is All You Need Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: The work described in this paper aims to improve the efficiency of image segmentation using a convolutional neural network in a multi-sensor setup. |
Dinar Sharafutdinov; Stanislav Kuskov; Saian Protasov; Alexey Voropaev; | arxiv-cs.CV | 2023-11-08 |
862 | Pelvic Floor MRI Segmentation Based on Semi-supervised Deep Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Insufficient segmentation labels limit the precise segmentation and reconstruction of pelvic floor organs. To address these issues, we propose a semi-supervised framework for pelvic organ segmentation. |
JIANWEI ZUO et. al. | arxiv-cs.CV | 2023-11-06 |
863 | The Remote Sensing Image Segmentation of Land Cover Based on Multi-scale Attention Features Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Segmentation of land cover in remote sensing images is a task that involves interpreting remote sensing data using machine vision. Satisfying segmentation results in agriculture … |
Haiyang Hu; Linnan Yang; Jiaojiao Chen; Shuang Luo; | 2023 IEEE 35th International Conference on Tools with … | 2023-11-06 |
864 | PointNAC: Copula-Based Point Cloud Semantic Segmentation Network Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Three-dimensional point cloud data generally contain complex scene information and diversified category structures. Existing point cloud semantic segmentation networks tend to … |
CHUNYUAN DENG et. al. | Symmetry | 2023-11-06 |
865 | PotholeGuard: A Pothole Detection Approach By Point Cloud Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Our research presents an innovative point cloud-based pothole segmentation architecture. |
SAHIL NAWALE et. al. | arxiv-cs.CV | 2023-11-05 |
866 | ISAR: A Benchmark for Single- and Few-Shot Object Instance Segmentation and Re-Identification Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: So far there is neither a method fulfilling all of these requirements in unison nor a benchmark that could be used to test such a method. Addressing this, we propose ISAR, a benchmark and baseline method for single- and few-shot object Instance Segmentation And Re-identification, in an effort to accelerate the development of algorithms that can robustly detect, segment, and re-identify objects from a single or a few sparse training examples. |
Nicolas Gorlo; Kenneth Blomqvist; Francesco Milano; Roland Siegwart; | arxiv-cs.CV | 2023-11-05 |
867 | MemorySeg: Online LiDAR Semantic Segmentation with A Latent Memory IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we tackle the challenge of exploiting the information from the past frames to improve the predictions of the current frame in an online fashion. |
Enxu Li; Sergio Casas; Raquel Urtasun; | arxiv-cs.CV | 2023-11-02 |
868 | Customizing SAM for Histologic Image Segmentation of Kidney Biopsy By A Detector Approach: Det-SAM Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In recent years, large models have manifested unparalleled strengths in various fields and became a major trend in AI advancement. In the field of computer vision, the Segment … |
XUJIA NING et. al. | 2023 International Conference on Cyber-Enabled Distributed … | 2023-11-02 |
869 | Real-time Semantic Segmentation in Traffic Scene Using Cross Stage Partial-based Encoder-decoder Network Related Papers Related Patents Related Grants Related Venues Related Experts View |
Liguo Zhou; Guang Chen; Lian Liu; Ruining Wang; Alois Knoll; | Eng. Appl. Artif. Intell. | 2023-11-01 |
870 | Towards Dynamic Backdoor Attacks Against LiDAR Semantic Segmentation in Autonomous Driving Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: LiDAR perception is widely deployed in high-level autonomous vehicles (AVs) to gain accurate information about the driving environment, where 3D semantic segmentation plays a … |
Shuai Li; Yu Wen; Xu Cheng; | 2023 IEEE 22nd International Conference on Trust, Security … | 2023-11-01 |
871 | FastICENet: A Real-time and Accurate Semantic Segmentation Model for Aerial Remote Sensing River Ice Image Related Papers Related Patents Related Grants Related Venues Related Experts View |
XIUWEI ZHANG et. al. | Signal Process. | 2023-11-01 |
872 | Joint Depth Prediction and Semantic Segmentation with Multi-View SAM Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: With this work we propose a Multi-View Stereo (MVS) technique for depth prediction that benefits from rich semantic features of the Segment Anything Model (SAM). |
Mykhailo Shvets; Dongxu Zhao; Marc Niethammer; Roni Sengupta; Alexander C. Berg; | arxiv-cs.CV | 2023-10-31 |
873 | SACuP: Sonar Image Augmentation with Cut and Paste Based DataBank for Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we introduce Sonar image Augmentation with Cut and Paste based DataBank for semantic segmentation (SACuP), a novel data augmentation framework specifically designed … |
Sundong Park; Yoonyoung Choi; Hyoseok Hwang; | Remote. Sens. | 2023-10-31 |
874 | MMGLOTS: Multi-Modal Global-Local Transformer Segmentor for Remote Sensing Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Multi-modal semantic segmentation of remote sensing (RS) images is a challenging task due to the complex relationship between different modalities and the large intra-class … |
Yuheng Liu; Ye Wang; Yifan Zhang; Shaohui Mei; | 2023 13th Workshop on Hyperspectral Imaging and Signal … | 2023-10-31 |
875 | Train One, Generalize to All: Generalizable Semantic Segmentation from Single-Scene to All Adverse Scenes Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Unsupervised Domain Adaptation (UDA) for semantic segmentation has received widespread attention for its ability to transfer knowledge from the source to target domains without a … |
ZIYANG GONG et. al. | Proceedings of the 31st ACM International Conference on … | 2023-10-26 |
876 | SmooSeg: Smoothness Prior for Unsupervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we demonstrate that the smoothness prior, asserting that close features in a metric space share the same semantics, can significantly simplify segmentation by casting unsupervised semantic segmentation as an energy minimization problem. Under this paradigm, we propose a novel approach called SmooSeg that harnesses self-supervised learning methods to model the closeness relationships among observations as smoothness signals. |
MENGCHENG LAN et. al. | arxiv-cs.CV | 2023-10-26 |
877 | Characters Link Shots: Character Attention Network for Movie Scene Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Movie scene segmentation aims to automatically segment a movie into multiple story units, i.e., scenes, each of which is a series of semantically coherent and time-continual … |
Jiawei Tan; Hongxing Wang; Junsong Yuan; | ACM Transactions on Multimedia Computing, Communications … | 2023-10-26 |
878 | 4D-Editor: Interactive Object-level Editing in Dynamic Neural Radiance Fields Via 4D Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper targets interactive object-level editing(e.g., deletion, recoloring, transformation, composition) in dynamic scenes. Recently, some methods aiming for flexible editing … |
Dadong Jiang; Zhihui Ke; Xiaobo Zhou; Xidong Shi; | ArXiv | 2023-10-25 |
879 | Segmentation of Retinal Images Using Improved Segmentation Network, MesU-Net Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Given the immense importance of medical image segmentation and the challenges associated with manual execution, a diverse range of automated medical image segmentation methods … |
Anitha T. Nair; A. M L; Arun Kumar M. N.; | Int. J. Online Biomed. Eng. | 2023-10-25 |
880 | SAMRS: Scaling-up Remote Sensing Segmentation Dataset with Segment Anything Model IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this study, we leverage SAM and existing RS object detection datasets to develop an efficient pipeline for generating a large-scale RS segmentation dataset, dubbed SAMRS. |
DI WANG et. al. | nips | 2023-10-24 |
881 | Rethinking Semi-Supervised Medical Image Segmentation: A Variance-Reduction Perspective IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose $\texttt{ARCO}$, a semi-supervised contrastive learning (CL) framework with stratified group theory for medical image segmentation. |
CHENYU YOU et. al. | nips | 2023-10-24 |
882 | ClusterFomer: Clustering As A Universal Visual Learner IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents ClusterFormer, a universal vision model that is based on the Clustering paradigm with TransFormer. |
JAMES LIANG et. al. | nips | 2023-10-24 |
883 | Label-efficient Segmentation Via Affinity Propagation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we formulate the affinity modeling task as an affinity propagation process, and consequently propose both local and global pairwise affinity terms to generate accurate soft pseudo labels. |
WENTONG LI et. al. | nips | 2023-10-24 |
884 | GNeSF: Generalizable Neural Semantic Fields Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, existing approaches still requires expensive per-scene optimization that prohibits generalization to novel scenes during inference. To circumvent this problem, we introduce a generalizable 3D segmentation framework based on implicit representation. |
Hanlin Chen; Chen Li; Mengqi Guo; Zhiwen Yan; Gim Hee Lee; | arxiv-cs.CV | 2023-10-24 |
885 | OV-PARTS: Towards Open-Vocabulary Part Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Furthermore, the large-scale vision and language models, which play a key role in the open vocabulary setting, struggle to recognize parts as effectively as objects. To comprehensively investigate and tackle these challenges, we propose an Open-Vocabulary Part Segmentation (OV-PARTS) benchmark. |
MENG WEI et. al. | nips | 2023-10-24 |
886 | Uncertainty Estimation for Safety-critical Scene Segmentation Via Fine-grained Reward Maximization Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose a novel fine-grained reward maximization (FGRM) framework, to address uncertainty estimation by directly utilizing an uncertainty metric related reward function with a reinforcement learning based model tuning paradigm. |
Hongzheng Yang; Cheng Chen; Yueyao CHEN; Hon Chi Yip; DOU QI; | nips | 2023-10-24 |
887 | Open Compound Domain Adaptation with Object Style Compensation for Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes the Object Style Compensation, where we construct the Object-Level Discrepancy Memory with multiple sets of discrepancy features. |
TINGLIANG FENG et. al. | nips | 2023-10-24 |
888 | AIMS: All-Inclusive Multi-Level Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a new task, All-Inclusive Multi-Level Segmentation (AIMS), which segments visual regions into three levels: part, entity, and relation (two entities with some semantic relationships). |
LU QI et. al. | nips | 2023-10-24 |
889 | Bridging Semantic Gaps for Language-Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Such semantic misalignment circulates in pre-training, leading to inferior zero-shot performance in dense predictions due to insufficient visual concepts captured in textual representations. To close such semantic gap, we propose Concept Curation (CoCu), a pipeline that leverages CLIP to compensate for the missing semantics. |
YUN XING et. al. | nips | 2023-10-24 |
890 | Fairness Continual Learning Approach to Semantic Scene Understanding in Open-World Environments IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present a novel Fairness Continual Learning approach to the semantic segmentation problem. |
Thanh-Dat Truong; Hoang-Quan Nguyen; Bhiksha Raj; Khoa Luu; | nips | 2023-10-24 |
891 | CPSeg: Finer-grained Image Semantic Segmentation Via Chain-of-Thought Language Prompting IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Natural scene analysis and remote sensing imagery offer immense potential for advancements in large-scale language-guided context-aware data utilization. This potential is … |
Lei Li; | 2024 IEEE/CVF Winter Conference on Applications of Computer … | 2023-10-24 |
892 | Bridging The Domain Gap: Self-Supervised 3D Scene Understanding with Foundation Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, their potential to enrich 3D scene representation learning is largely untapped due to the existence of the domain gap. In this work, we propose an innovative methodology called Bridge3D to address this gap by pre-training 3D models using features, semantic masks, and captions sourced from foundation models. |
Zhimin Chen; Bing Li; | nips | 2023-10-24 |
893 | Augmentation-free Dense Contrastive Distillation for Efficient Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Existing methods heavily rely on data augmentation and memory buffer, which entail high computational resource demands when applying them to handle semantic segmentation that requires to preserve high-resolution feature maps for making dense pixel-wise predications. In order to alleviate this problem, we present Augmentation-free Dense Contrastive Knowledge Distillation (Af-DCD), a new contrastive distillation learning paradigm to train compact and accurate deep neural networks for semantic segmentation applications. |
Jiawei Fan; Chao Li; Xiaolong Liu; Meina Song; Anbang Yao; | nips | 2023-10-24 |
894 | Segment Everything Everywhere All at Once IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we present SEEM, a promotable and interactive model for segmenting everything everywhere all at once in an image. |
XUEYAN ZOU et. al. | nips | 2023-10-24 |
895 | Pixel-Level Clustering Network for Unsupervised Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present a pixel-level clustering framework for segmenting images into regions without using ground truth annotations. |
Cuong Manh Hoang; Byeongkeun Kang; | arxiv-cs.CV | 2023-10-24 |
896 | SOC: Semantic-Assisted Object Cluster for Referring Video Object Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, the lack of a global view of video content leads to difficulties in effectively utilizing inter-frame relationships and understanding textual descriptions of object temporal variations. To address this issue, we propose Semantic-assisted Object Cluster (SOC), which aggregates video content and textual guidance for unified temporal modeling and cross-modal alignment. |
ZHUOYAN LUO et. al. | nips | 2023-10-24 |
897 | OpenMask3D: Open-Vocabulary 3D Instance Segmentation IF:4 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: While such a representation can be directly employed to perform semantic segmentation, existing methods have limitations in their ability to handle object instances. In this work, we address this limitation, and propose OpenMask3D, which is a zero-shot approach for open-vocabulary 3D instance segmentation. |
AYCA TAKMAZ et. al. | nips | 2023-10-24 |
898 | Weakly-Supervised Concealed Object Segmentation with SAM-based Pseudo Labeling and Multi-scale Feature Grouping IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: It remains a challenging task since (1) it is hard to distinguish concealed objects from the background due to the intrinsic similarity and (2) the sparsely-annotated training data only provide weak supervision for model learning. In this paper, we propose a new WSCOS method to address these two challenges. |
CHUNMING HE et. al. | nips | 2023-10-24 |
899 | P2AT: Pyramid Pooling Axial Transformer for Real-time Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a real-time semantic segmentation architecture named Pyramid Pooling Axial Transformer (P2AT). |
Mohammed A. M. Elhassan; Changjun Zhou; Amina Benabid; Abuzar B. M. Adam; | arxiv-cs.CV | 2023-10-23 |
900 | RT-YOSO: Revisiting YOSO for Real-time Panoptic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Panoptic segmentation, a crucial computer vision task for scene understanding, simultaneously combines semantic segmentation and instance segmentation to classify pixels and … |
Abdallah Ammar; Mahmoud I. Khalil; Cherif R. Salama; | 2023 5th Novel Intelligent and Leading Emerging Sciences … | 2023-10-21 |
901 | Weakly-Supervised Semantic Segmentation with Image-Level Labels: from Traditional Models to Foundation Models Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We scrutinize SAM in two intriguing scenarios: text prompting and zero-shot learning. We provide insights into the potential and challenges of deploying visual foundational models for WSSS, facilitating future developments in this exciting research area. |
Zhaozheng Chen; Qianru Sun; | arxiv-cs.CV | 2023-10-19 |
902 | Lidar Panoptic Segmentation and Tracking Without Bells and Whistles Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we re-think this approach and propose a surprisingly simple yet effective detection-centric network for both LPS and tracking. |
ABHINAV AGARWALLA et. al. | arxiv-cs.CV | 2023-10-19 |
903 | Minimalist and High-Performance Semantic Segmentation with Plain Vision Transformers Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: As a result, we introduce the PlainSeg, a model comprising only three 3$\times$3 convolutions in addition to the transformer layers (either encoder or decoder). |
Yuanduo Hong; Jue Wang; Weichao Sun; Huihui Pan; | arxiv-cs.CV | 2023-10-19 |
904 | SegmATRon: Embodied Adaptive Semantic Segmentation for Indoor Environment Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents an adaptive transformer model named SegmATRon for embodied image semantic segmentation. |
Tatiana Zemskova; Margarita Kichik; Dmitry Yudin; Aleksei Staroverov; Aleksandr Panov; | arxiv-cs.CV | 2023-10-18 |
905 | Loci-Segmented: Improving Scene Segmentation Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Current slot-oriented approaches for compositional scene segmentation from images and videos rely on provided background information or slot assignments. We present a segmented location and identity tracking system, Loci-Segmented (Loci-s), which does not require either of this information. |
Manuel Traub; Frederic Becker; Adrian Sauter; Sebastian Otte; Martin V. Butz; | arxiv-cs.CV | 2023-10-16 |
906 | Volumetric Medical Image Segmentation Via Scribble Annotations and Shape Priors Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Furthermore, most current methods are designed for 2D image segmentation, which do not fully leverage the volumetric information if directly applied to each image slice. In this paper, we propose a scribble-based volumetric image segmentation, Scribble2D5, which tackles 3D anisotropic image segmentation and aims to its improve boundary prediction. |
Qiuhui Chen; Haiying Lyu; Xinyue Hu; Yong Lu; Yi Hong; | arxiv-cs.CV | 2023-10-12 |
907 | Detection and Mapping of Chestnut Using Deep Learning from High-Resolution UAV-Based RGB Imagery Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The semantic segmentation method based on high-resolution RGB images obtained by unmanned aerial vehicle (UAV) provides a cost-effective way to improve the accuracy of detection … |
Yifei Sun; Zhenbang Hao; Zhanbao Guo; Zhenhu Liu; Jiaxing Huang; | Remote. Sens. | 2023-10-12 |
908 | S4C: Self-Supervised Semantic Scene Completion with Neural Fields IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This process relies on special sensors and annotation by hand which are costly and do not scale well. To overcome this issue, our work presents the first self-supervised approach to SSC called S4C that does not rely on 3D ground truth data. |
Adrian Hayler; Felix Wimbauer; Dominik Muhle; Christian Rupprecht; Daniel Cremers; | arxiv-cs.CV | 2023-10-11 |
909 | A LiDAR Semantic Segmentation Framework for The Cooperative Vehicle-Infrastructure System Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: LiDAR semantic segmentation plays an important role in 3D scene understanding for autonomous driving. However, the performance based on the LiDAR equipped on a vehicle may be … |
Hongwei Liu; Zihao Gu; Chao Wang; Ping Wang; D. Vukobratović; | 2023 IEEE 98th Vehicular Technology Conference … | 2023-10-10 |
910 | Zero-Shot Open-Vocabulary Tracking with Large Pre-Trained Models Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we re-purpose an open-vocabulary detector, segmenter, and dense optical flow estimator, into a model that tracks and segments objects of any category in 2D videos. |
WEN-HSUAN CHU et. al. | arxiv-cs.CV | 2023-10-10 |
911 | Densely Connected Swin-UNet for Multiscale Information Aggregation in Medical Image Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Image semantic segmentation is a dense prediction task in computer vision that is dominated by deep learning techniques in recent years. UNet, which is a symmetric encoder-decoder … |
Ziyang Wang; Meiwen Su; Jian-Qing Zheng; Yang Liu; | 2023 IEEE International Conference on Image Processing … | 2023-10-08 |
912 | Geometry Aware Field-to-field Transformations for 3D Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a novel approach to perform 3D semantic segmentation solely from 2D supervision by leveraging Neural Radiance Fields (NeRFs). |
Dominik Hollidt; Clinton Wang; Polina Golland; Marc Pollefeys; | arxiv-cs.CV | 2023-10-08 |
913 | Towards Dynamic and Small Objects Refinement for Unsupervised Domain Adaptative Nighttime Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper proposes a novel UDA method that refines both label and feature levels for dynamic and small objects for nighttime semantic segmentation. |
Jingyi Pan; Sihang Li; Yucheng Chen; Jinjing Zhu; Lin Wang; | arxiv-cs.CV | 2023-10-07 |
914 | A Deeply Supervised Semantic Segmentation Method Based on GAN Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this study, we propose an improved semantic segmentation model that combines the strengths of adversarial learning with state-of-the-art semantic segmentation techniques. |
Wei Zhao; Qiyu Wei; Zeng Zeng; | arxiv-cs.CV | 2023-10-06 |
915 | DiffPrompter: Differentiable Implicit Visual Prompts for Semantic-Segmentation in Adverse Conditions Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We introduce DiffPrompter, a novel differentiable visual and latent prompting mechanism aimed at expanding the learning capabilities of existing adaptors in foundation models. |
SANKET KALWAR et. al. | arxiv-cs.CV | 2023-10-06 |
916 | Ablation Study to Clarify The Mechanism of Object Segmentation in Multi-Object Representation Learning Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Multi-object representation learning aims to represent complex real-world visual input using the composition of multiple objects. Representation learning methods have often used unsupervised learning to segment an input image into individual objects and encode these objects into each latent vector. |
Takayuki Komatsu; Yoshiyuki Ohmura; Yasuo Kuniyoshi; | arxiv-cs.CV | 2023-10-04 |
917 | Flooded Area Segmentation on Remote Sensing Image from Unmanned Aerial Vehicles (UAV) Using DeepLabV3 and EfficientNet-B4 Model Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Climate change caused by global warming results in increased rainfall and has the potential to cause flooding. Floods are natural disasters that often occur in Indonesia and can … |
Riskyana Dewi Intan Puspitasari; Fadhilah Qalbi Annisa; Danang Ariyanto; | 2023 International Conference on Computer, Control, … | 2023-10-04 |
918 | Improved Approach for Semantic Segmentation of MBRSC Aerial Imagery Based on Transfer Learning and Modified UNet Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Aerial imagery has emerged in numerous fields such as sustainable development, forestry, urban planning, agriculture, earth science and climate research. Extracting relevant … |
Khawla Ben Salah; Mohamed Othmani; Selma Saida; M. Kherallah; | 2023 International Conference on Cyberworlds (CW) | 2023-10-03 |
919 | TransRadar: Adaptive-Directional Transformer for Real-Time Multi-View Radar Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we propose a novel approach to the semantic segmentation of radar scenes using a multi-input fusion of radar data through a novel architecture and loss functions that are tailored to tackle the drawbacks of radar perception. |
Yahia Dalbah; Jean Lahoud; Hisham Cholakkal; | arxiv-cs.CV | 2023-10-03 |
920 | STARS: Zero-shot Sim-to-Real Transfer for Segmentation of Shipwrecks in Sonar Imagery Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we address the problem of sim-to-real transfer for object segmentation when there is no access to real examples of an object of interest during training, i.e. zero-shot sim-to-real transfer for segmentation. |
Advaith Venkatramanan Sethuraman; Katherine A. Skinner; | arxiv-cs.CV | 2023-10-02 |
921 | Semantically Enhanced Scene Captions with Physical and Weather Condition Changes Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Vision-Language models (VLMs), i.e., image-text pairs of CLIP, have boosted image-based Deep Learning (DL). Moreover, Visual-Question-Answer (VQA) tools and open-vocabulary … |
Hidetomo Sakaino; | 2023 IEEE/CVF International Conference on Computer Vision … | 2023-10-02 |
922 | Semantic Motif Segmentation of Archaeological Fresco Fragments Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Archaeological fragment processing is crucial to support the analysis of pictorial contents of broken artifacts. In this paper, we focus on the unexplored task of semantic … |
Aref Enayati; Luca Palmieri; S. Vascon; M. Pelillo; Sinem Aslan; | 2023 IEEE/CVF International Conference on Computer Vision … | 2023-10-02 |
923 | Large-scale Apple Orchard Mapping from Multi-source Data Using The Semantic Segmentation Model with Image- To- Image Translation and Transfer Learning Related Papers Related Patents Related Grants Related Venues Related Experts View |
TINGTING ZHANG et. al. | Comput. Electron. Agric. | 2023-10-01 |
924 | Lightweight Semantic Segmentation Network for Semantic Scene Understanding on Low-Compute Devices Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic scene understanding is beneficial for mobile robots. Semantic information obtained through onboard cameras can improve robots’ navigation performance. However, obtaining … |
H. Son; James Weiland; | 2023 IEEE/RSJ International Conference on Intelligent … | 2023-10-01 |
925 | Elastic Interaction Energy-Informed Real-Time Traffic Scene Perception Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, a simple and efficient topology-aware energy loss function-based network training strategy named EIEGSeg is proposed. |
Yaxin Feng; Yuan Lan; Luchan Zhang; Guoqing Liu; Yang Xiang; | arxiv-cs.CV | 2023-10-01 |
926 | CompUDA: Compositional Unsupervised Domain Adaptation for Semantic Segmentation Under Adverse Conditions Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In autonomous driving, performing robust semantic segmentation under adverse weather conditions is a long-standing challenge. Imperfect camera observations under adverse … |
Ziqiang Zhengl; Yingshu Chen; Binh-Son Hua; Sai-Kit Yeung; | 2023 IEEE/RSJ International Conference on Intelligent … | 2023-10-01 |
927 | LiDAR-SGMOS: Semantics-Guided Moving Object Segmentation with 3D LiDAR Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Most of the existing moving object segmentation (MOS) methods regard MOS as an independent task, in this paper, we associate the MOS task with semantic segmentation, and propose a … |
Shuo Gu; Suling Yao; Jian Yang; Chengzhong Xu; Hui Kong; | 2023 IEEE/RSJ International Conference on Intelligent … | 2023-10-01 |
928 | TROSD: A New RGB-D Dataset for Transparent and Reflective Object Segmentation in Practice IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Transparent and reflective objects are omnipresent in our daily life, but their unique visual and optical characteristics are notoriously challenging even for state-of-the-art … |
Tianyu Sun; Guodong Zhang; Wenming Yang; Jing-Hao Xue; Guijin Wang; | IEEE Transactions on Circuits and Systems for Video … | 2023-10-01 |
929 | Propagating Semantic Labels in Video Data Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This work presents a method for performing segmentation for objects in video. |
David Balaban; Justin Medich; Pranay Gosar; Justin Hart; | arxiv-cs.CV | 2023-10-01 |
930 | Polarimetric Synthetic Aperture Radar Image Semantic Segmentation Network with Lovász-Softmax Loss Optimization Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The deep learning technique has already been successfully applied in the field of microwave remote sensing. Especially, convolutional neural networks have demonstrated remarkable … |
Rui Guo; Xiaopeng Zhao; Guanzhong Zuo; Ying Wang; Yi Liang; | Remote. Sens. | 2023-10-01 |
931 | Improving 6D Object Pose Estimation Based on Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: The performance of 6D pose estimation, which is important for scene understanding, can be improved by more accurate object segmentation. RGB-D data including depth maps can … |
Fang Gao; Qiujun Li; Qingyi Sun; | 2023 IEEE International Conference on Systems, Man, and … | 2023-10-01 |
932 | Generalized Few-shot Semantic Segmentation for LiDAR Point Clouds Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation of LiDAR point clouds can provide assistance for precise perception in autonomous driving, but traditional segmentation methods face challenges such as … |
Pengze Wu; Jilin Mei; Xijun Zhao; Yu Hu; | 2023 IEEE/RSJ International Conference on Intelligent … | 2023-10-01 |
933 | Few-Shot Segmentation and Semantic Segmentation for Underwater Imagery Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: This paper tackles image segmentation problems for underwater environments. First, we introduce a novel under-water animal-centric dataset with dense pixel-level annotations … |
IMRAN KABIR et. al. | 2023 IEEE/RSJ International Conference on Intelligent … | 2023-10-01 |
934 | Knowledge Distillation for Efficient Panoptic Semantic Segmentation: Applied to Agriculture Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Panoptic segmentation provides both holistic and detailed image parsing information at both the pixel and the instance level. However, the computational burdens restrict its … |
Maohui Li; Michael Halstead; Chris McCool; | 2023 IEEE/RSJ International Conference on Intelligent … | 2023-10-01 |
935 | An Easy Zero-shot Learning Combination: Texture Sensitive Semantic Segmentation IceHrNet and Advanced Style Transfer Learning Strategy Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We proposed an easy method of Zero-Shot semantic segmentation by using style transfer. |
ZHIYONG YANG et. al. | arxiv-cs.CV | 2023-09-30 |
936 | Dual-Augmented Transformer Network for Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we explore the dual-augmented transformer network with self-regularization constraints for WSSS. |
Jingliang Deng; Zonghan Li; | arxiv-cs.CV | 2023-09-30 |
937 | SGNet: A Fast and Accurate Semantic Segmentation Network Based on Semantic Guidance Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we design a fast and accurate lightweight semantic segmentation method for mobile robots. Accurate semantic segmentation usually requires obtaining high-resolution … |
HENGYU LI et. al. | Advanced Robotics | 2023-09-29 |
938 | SegRCDB: Semantic Segmentation Via Formula-Driven Supervised Learning Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose the Segmentation Radial Contour DataBase (SegRCDB), which for the first time applies formula-driven supervised learning for semantic segmentation. |
RISA SHINODA et. al. | arxiv-cs.CV | 2023-09-29 |
939 | Perceptual Cue-guided Adaptive Image Downscaling for Enhanced Semantic Segmentation on Large Document Images Related Papers Related Patents Related Grants Related Venues Related Experts View |
Chulwoo Pack; Leen-Kiat Soh; Elizabeth M. Lorang; | Int. J. Document Anal. Recognit. | 2023-09-28 |
940 | COMNet: Co-Occurrent Matching for Weakly Supervised Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose a novel Co-Occurrent Matching Network (COMNet), which can promote the quality of the CAMs and enforce the network to pay attention to the entire parts of objects. |
Yukun Su; Jingliang Deng; Zonghan Li; | arxiv-cs.CV | 2023-09-28 |
941 | Model2Scene: Learning 3D Scene Representation Via Contrastive Language-CAD Models Pre-training Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we propose Model2Scene, a novel paradigm that learns free 3D scene representation from Computer-Aided Design (CAD) models and languages. |
RUNNAN CHEN et. al. | arxiv-cs.CV | 2023-09-28 |
942 | SATR: Zero-Shot Semantic Segmentation of 3D Shapes IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We explore the task of zero-shot semantic segmentation of 3D shapes by using large-scale off-the-shelf 2D im- age recognition models. |
Ahmed Abdelreheem; Ivan Skorokhodov; Maks Ovsjanikov; Peter Wonka; | iccv | 2023-09-27 |
943 | Class-incremental Continual Learning for Instance Segmentation with Image-level Weak Supervision Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a continual-learning method to segment object instances from image-level labels. |
YU-HSING HSIEH et. al. | iccv | 2023-09-27 |
944 | Segment Every Reference Object in Spatial and Temporal Spaces IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this work, we end the current fragmented situation and propose UniRef to unify the three reference-based object segmentation tasks with a single architecture. |
JIANNAN WU et. al. | iccv | 2023-09-27 |
945 | High Quality Entity Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Given the high-quality and -resolution nature of the dataset, we propose CropFormer which is designed to tackle the intractability of instance-level segmentation on high-resolution images. |
LU QI et. al. | iccv | 2023-09-27 |
946 | Coarse-to-Fine Amodal Segmentation with Shape Prior IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Amodal object segmentation is a challenging task that involves segmenting both visible and occluded parts of an object. In this paper, we propose a novel approach, called Coarse-to-Fine Segmentation (C2F-Seg), that addresses this problem by progressively modeling the amodal segmentation. |
JIANXIONG GAO et. al. | iccv | 2023-09-27 |
947 | CPCM: Contextual Point Cloud Modeling for Weakly-supervised Point Cloud Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose a simple yet effective Contextual Point Cloud Modeling (CPCM) method that consists of two parts: a region-wise masking (RegionMask) strategy and a contextual masked training (CMT) method. |
LIZHAO LIU et. al. | iccv | 2023-09-27 |
948 | Towards Content-based Pixel Retrieval in Revisited Oxford and Paris Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper introduces the first two landmark pixel retrieval benchmarks. |
GUOYUAN AN et. al. | iccv | 2023-09-27 |
949 | Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird’s-Eye View Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In contrast, we propose to use parametric depth distribution modeling for feature transformation. |
Jiayu Yang; Enze Xie; Miaomiao Liu; Jose M. Alvarez; | iccv | 2023-09-27 |
950 | Re:PolyWorld – A Graph Neural Network for Polygonal Scene Parsing Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: The objective of this work was to overcome weaknesses and shortcomings of the original model, as well as introducing an improved polygonal representation to obtain a general-purpose method for polygon extraction in images. |
Stefano Zorzi; Friedrich Fraundorfer; | iccv | 2023-09-27 |
951 | CMDA: Cross-Modality Domain Adaptation for Nighttime Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To this end, we propose a novel unsupervised Cross-Modality Domain Adaptation (CMDA) framework to leverage multi-modality (Images and Events) information for nighttime semantic segmentation, with only labels on daytime images. |
RUIHAO XIA et. al. | iccv | 2023-09-27 |
952 | Stochastic Segmentation with Conditional Categorical Diffusion Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this context, stochastic semantic segmentation methods must learn to predict conditional distributions of labels given the image, but this is challenging due to the typically multimodal distributions, high-dimensional output spaces, and limited annotation data. To address these challenges, we propose a conditional categorical diffusion model (CCDM) for semantic segmentation based on Denoising Diffusion Probabilistic Models. |
LUKAS ZBINDEN et. al. | iccv | 2023-09-27 |
953 | Weakly Supervised Referring Image Segmentation with Intra-Chunk and Inter-Chunk Consistency Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present a weakly supervised learning method for RIS that only uses readily available image-text pairs. |
JUNGBEOM LEE et. al. | iccv | 2023-09-27 |
954 | SegGPT: Towards Segmenting Everything in Context IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We present SegGPT, a generalist model for segmenting everything in context. |
XINLONG WANG et. al. | iccv | 2023-09-27 |
955 | Exploring Open-Vocabulary Semantic Segmentation from CLIP Vision Encoder Distillation Only IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: However, existing approaches often rely on expensive human annotations as supervision for model training, limiting their scalability to large, unlabeled datasets. To address this challenge, we present ZeroSeg, a novel method that leverages the existing pretrained vision-language (VL) model (e.g. CLIP vision encoder) to train open-vocabulary zero-shot semantic segmentation models. |
JUN CHEN et. al. | iccv | 2023-09-27 |
956 | Learning Cross-Modal Affinity for Referring Video Object Segmentation Targeting Limited Samples Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: However, in more realistic scenarios, only minimal annotations are available for a new scene, which poses significant challenges to existing RVOS methods. With this in mind, we propose a simple yet effective model with a newly designed cross-modal affinity (CMA) module based on a Transformer architecture. |
Guanghui Li; Mingqi Gao; Heng Liu; Xiantong Zhen; Feng Zheng; | iccv | 2023-09-27 |
957 | Domain Generalization of 3D Semantic Segmentation in Autonomous Driving IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Despite its importance, domain generalization is relatively unexplored in the case of 3D autonomous driving semantic segmentation. To fill this gap, this paper presents the first benchmark for this application by testing state-of-the-art methods and discussing the difficulty of tackling Laser Imaging Detection and Ranging (LiDAR) domain shifts. |
Jules Sanchez; Jean-Emmanuel Deschaud; François Goulette; | iccv | 2023-09-27 |
958 | Channel Affinity Knowledge Distillation for Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In recent years, convolutional neural networks have achieved significant success in computer vision tasks. However, the deployment of these algorithms remains challenging. … |
HUAKUN LI et. al. | 2023 IEEE 25th International Workshop on Multimedia Signal … | 2023-09-27 |
959 | Instance Neural Radiance Field IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents one of the first learning-based NeRF 3D instance segmentation pipelines, dubbed as Instance Neural Radiance Field, or Instance-NeRF. |
Yichen Liu; Benran Hu; Junkai Huang; Yu-Wing Tai; Chi-Keung Tang; | iccv | 2023-09-27 |
960 | 3D Segmentation of Humans in Point Clouds with Synthetic Data IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Few works have attempted to directly segment humans in cluttered 3D scenes, which is largely due to the lack of annotated training data of humans interacting with 3D scenes. We address this challenge and propose a framework for generating training data of synthetic humans interacting with real 3D scenes. |
AYÇA TAKMAZ et. al. | iccv | 2023-09-27 |
961 | MasQCLIP for Open-Vocabulary Universal Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present a new method for open-vocabulary universal image segmentation, which is capable of performing instance, semantic, and panoptic segmentation under a unified framework. |
Xin Xu; Tianyi Xiong; Zheng Ding; Zhuowen Tu; | iccv | 2023-09-27 |
962 | UniSeg: A Unified Multi-Modal LiDAR Segmentation Network and The OpenPCSeg Codebase IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we present a unified multi-modal LiDAR segmentation network, termed UniSeg, which leverages the information of RGB images and three views of the point cloud, and accomplishes semantic segmentation and panoptic segmentation simultaneously. |
YOUQUAN LIU et. al. | iccv | 2023-09-27 |
963 | Open-vocabulary Object Segmentation with Diffusion Models IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: The goal of this paper is to extract the visual-language correspondence from a pre-trained text-to-image diffusion model, in the form of segmentation map, i.e., simultaneously generating images and segmentation masks for the corresponding visual entities described in the text prompt. |
ZIYI LI et. al. | iccv | 2023-09-27 |
964 | Tracking Anything with Decoupled Video Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To ‘track anything’ without training on video data for every individual task, we develop a decoupled video segmentation approach (DEVA), composed of task-specific image-level segmentation and class/task-agnostic bi-directional temporal propagation. |
Ho Kei Cheng; Seoung Wug Oh; Brian Price; Alexander Schwing; Joon-Young Lee; | iccv | 2023-09-27 |
965 | LD-ZNet: A Latent Diffusion Approach for Text-Based Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This is because they have to synthesize intricate details about all objects in an image based on a text description. Therefore, we present a technique for segmenting real and AI-generated images using latent diffusion models (LDMs) trained on internet-scale datasets. |
Koutilya PNVR; Bharat Singh; Pallabi Ghosh; Behjat Siddiquie; David Jacobs; | iccv | 2023-09-27 |
966 | Boosting Semantic Segmentation from The Perspective of Explicit Class Embeddings Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we explore the mechanism of class embeddings and have an insight that more explicit and meaningful class embeddings can be generated based on class masks purposely. |
Yuhe Liu; Chuanjian Liu; Kai Han; Quan Tang; Zengchang Qin; | iccv | 2023-09-27 |
967 | Uni-3D: A Universal Model for Panoptic 3D Scene Reconstruction Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Inspired by recent advances in 2D vision that unify image segmentation and detection by Transformer-based models, we present Uni-3D, a holistic 3D scene parsing/reconstruction system for a single RGB image. |
Xiang Zhang; Zeyuan Chen; Fangyin Wei; Zhuowen Tu; | iccv | 2023-09-27 |
968 | Multi-Object Discovery By Low-Dimensional Object Motion Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: We propose to model pixel-wise geometry and object motion to remove ambiguity in reconstructing flow from a single image. |
Sadra Safadoust; Fatma Güney; | iccv | 2023-09-27 |
969 | Two-stage Coarse-to-fine Method for Pathological Images in Medical Decision-making Systems IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Artificial intelligence decision systems play an important supporting role in the field of medical information. Medical image analysis is an important part of decision systems and … |
Keke He; Jun Zhu; Limiao Li; Fangfang Gou; Jia Wu; | IET Image Process. | 2023-09-26 |
970 | Cross-City Matters: A Multimodal Remote Sensing Benchmark Dataset for Cross-City Semantic Segmentation Using High-Resolution Domain Adaptation Networks IF:6 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: Artificial intelligence (AI) approaches nowadays have gained remarkable success in single-modality-dominated remote sensing (RS) applications, especially with an emphasis on individual urban environments (e.g., single cities or regions). Yet these AI models tend to meet the performance bottleneck in the case studies across cities or regions, due to the lack of diverse RS information and cutting-edge solutions with high generalization ability. |
DANFENG HONG et. al. | arxiv-cs.CV | 2023-09-26 |
971 | Weakly Supervised Semantic Segmentation By Knowledge Graph Inference Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Furthermore, CNN-based local convolutions lack the ability to model the extensive inter-category dependencies. Therefore, this paper introduces a graph reasoning-based approach to enhance WSSS. |
Jia Zhang; Bo Peng; Xi Wu; | arxiv-cs.CV | 2023-09-25 |
972 | CLIP-DIY: CLIP Dense Inference Yields Open-Vocabulary Semantic Segmentation For-Free IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Several methods have proposed different modifications and learning schemes to produce dense output. |
Monika Wysoczańska; Michaël Ramamonjisoa; Tomasz Trzciński; Oriane Siméoni; | arxiv-cs.CV | 2023-09-25 |
973 | The Effect of Camera Data Degradation Factors on Panoptic Segmentation for Automated Driving Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Precise scene understanding based on perception sensors’ data is important for assisted and automated driving (AAD) functions, to enable accurate decision-making processes and … |
Yiting Wang; Haonan Zhao; Kurt Debattista; Valentina Donzella; | 2023 IEEE 26th International Conference on Intelligent … | 2023-09-24 |
974 | Scene Parsing Using Fully Convolutional Network for Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: When it comes to computer vision, scene parsing is a crucial part of semantic segmentation. It has a wide range of applications, including autonomous driving, robotics, gaming, … |
Nisar Ali; Ali Zeeshan Ijaz; Raja Hashim Ali; Zain Ul Abideen; Abdul Bais; | 2023 IEEE Canadian Conference on Electrical and Computer … | 2023-09-24 |
975 | A Lightweight RGB-T Fusion Network for Practical Semantic Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Semantic segmentation of RGB-T images is a complex task due to the challenges involved in fusing information from multi-modalities, which requires significant computational … |
Haoyuan Zhang; Zifeng Li; Zhenyu Wu; Danwei Wang; | 2023 IEEE 26th International Conference on Intelligent … | 2023-09-24 |
976 | OneSeg: Self-learning and One-shot Learning Based Single-slice Annotation for 3D Medical Image Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To significantly reduce annotation efforts while attaining competitive segmentation accuracy, we propose a self-learning and one-shot learning based framework for 3D medical image segmentation by annotating only one slice of each 3D image. |
Yixuan Wu; Bo Zheng; Jintai Chen; Danny Z. Chen; Jian Wu; | arxiv-cs.CV | 2023-09-24 |
977 | DCSeg-Net: Driver’s Cognition Based Semantic Segmentation By Multi-Level Feature Extraction Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Autonomous driving requires advanced technology to implement completely self-driving vehicles. Perceptual technologies, such as object detection, segmentation, and depth … |
Seungwoo Nham; Jinho Lee; Shunsuke Kamijo; | 2023 IEEE 26th International Conference on Intelligent … | 2023-09-24 |
978 | LOGICSEG: Parsing Visual Semantics with Neural Logic Learning and Reasoning IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This is in stark contrast to human cognition which abstracts visual perceptions at multiple levels and conducts symbolic reasoning with such structured abstraction. To fill these fundamental gaps, we devise LOGICSEG, a holistic visual semantic parser that integrates neural inductive learning and logic reasoning with both rich data and symbolic knowledge. |
Liulei Li; Wenguan Wang; Yi Yang; | arxiv-cs.CV | 2023-09-24 |
979 | Rewrite Caption Semantics: Bridging Semantic Gaps for Language-Supervised Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: Such semantic misalignment circulates in pre-training, leading to inferior zero-shot performance in dense predictions due to insufficient visual concepts captured in textual representations. To close such semantic gap, we propose Concept Curation (CoCu), a pipeline that leverages CLIP to compensate for the missing semantics. |
YUN XING et. al. | arxiv-cs.CV | 2023-09-23 |
980 | SCDA: A Style and Content Domain Adaptive Semantic Segmentation Method for Remote Sensing Images Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Due to the differences in imaging methods and acquisition areas, remote sensing datasets can exhibit significant variations in both image style and content. In addition, the … |
HONGFENG XIAO et. al. | Remote. Sens. | 2023-09-23 |
981 | ClusterFormer: Clustering As A Universal Visual Learner Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper presents CLUSTERFORMER, a universal vision model that is based on the CLUSTERing paradigm with TransFORMER. |
JAMES C. LIANG et. al. | arxiv-cs.CV | 2023-09-22 |
982 | PuzzleNN: A Neural Network for Image Segmentation Based on Clustering Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In order to create a system for Image Semantic Segmentation that works similarly to the human way of approaching visual problems, we decided to use divide-and-conquer to break an … |
Ada-Astrid Mocanu; Adrian Iftene; | 2023 International Conference on Innovations in Intelligent … | 2023-09-20 |
983 | MoDA: Leveraging Motion Priors from Videos for Advancing Unsupervised Domain Adaptation in Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we consider a more practical UDA setting where the target domain contains sequential frames of the unlabeled videos which are easy to collect in practice. |
Fei Pan; Xu Yin; Seokju Lee; Sungeui Yoon; In So Kweon; | arxiv-cs.CV | 2023-09-20 |
984 | Novel Approach For Scene Semantic Segmentation Using The Recurrent-Based UNET Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In this paper, we propose a novel approach for semantic segmentation based on Deep Learning methods for Autonomous Driving. Our main idea is to develop a fully convolutional … |
Ahmed Khlifi; Mohamed Othmani; M. Kherallah; | 2023 International Conference on Innovations in Intelligent … | 2023-09-20 |
985 | Intelligent Debris Mass Estimation Model for Autonomous Underwater Vehicle Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we use instance segmentation to calculate the area of individual objects within an image, we use YOLOV7 in Roboflow to generate a set of bounding boxes for each object in the image with a class label and a confidence score for every detection. |
Mohana Sri S; Swethaa S; Aouthithiye Barathwaj SR Y; Sai Ganesh CS; | arxiv-cs.CV | 2023-09-19 |
986 | CaveSeg: Deep Semantic Segmentation and Scene Parsing for Autonomous Underwater Cave Exploration Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we present CaveSeg – the first visual learning pipeline for semantic segmentation and scene parsing for AUV navigation inside underwater caves. |
A. ABDULLAH et. al. | arxiv-cs.RO | 2023-09-19 |
987 | DFormer: Rethinking RGBD Representation Learning for Semantic Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: We present DFormer, a novel RGB-D pretraining framework to learn transferable representations for RGB-D segmentation tasks. |
BOWEN YIN et. al. | arxiv-cs.CV | 2023-09-18 |
988 | Prompt Engineering in Medical Image Segmentation: An Overview of The Paradigm Shift Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Foundation AI models have emerged as powerful pre-trained models on a large scale, capable of seamlessly handling diverse tasks across multiple domains with minimal or no … |
Hazrat Ali; Mohammad Farhad Bulbul; Zubair Shah; | 2023 IEEE International Conference on Artificial … | 2023-09-16 |
989 | GCL: Gradient-Guided Contrastive Learning for Medical Image Segmentation with Multi-Perspective Meta Labels Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: In this paper, we tackle the issue of semantic contradiction in a gradient-guided manner using our proposed Gradient Mitigator method, which systematically unifies multi-perspective meta labels to enable a pre-trained model to attain a better high-level semantic recognition ability. |
YIXUAN WU et. al. | arxiv-cs.CV | 2023-09-16 |
990 | MA-SAM: Modality-agnostic SAM Adaptation for 3D Medical Image Segmentation IF:3 Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we introduce a modality-agnostic SAM adaptation framework, named as MA-SAM, that is applicable to various volumetric and video medical data. |
CHENG CHEN et. al. | arxiv-cs.CV | 2023-09-15 |
991 | Dynamic Weight HiLo Attention Network for Medical Image Multiple Organ Segmentation Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: In recent years, there has been a surge of research in the field of medical image segmentation using hybrid CNN‐Transformer network architectures. Most of these studies leverage … |
Yiyang Zhao; Jinjiang Li; Yepeng Liu; | International Journal of Imaging Systems and Technology | 2023-09-14 |
992 | JSMNet Improving Indoor Point Cloud Semantic and Instance Segmentation Through Self-Attention and Multiscale Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: To this end, we propose JSMNet, which combines a multi-layer network with a global feature self-attention module to jointly segment three-dimensional point cloud semantics and instances. |
Shuochen Xu; Zhenxin Zhang; | arxiv-cs.CV | 2023-09-14 |
993 | A Handheld LiDAR-Based Semantic Automatic Segmentation Method for Complex Railroad Line Model Reconstruction Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: To ensure efficient railroad operation and maintenance management, the accurate reconstruction of railroad BIM models is a crucial step. This paper proposes a workflow for … |
Junjie Chen; Qian Su; Yunbin Niu; Zongyu Zhang; Jinghao Liu; | Remote. Sens. | 2023-09-13 |
994 | Real-Time Semantic Segmentation: A Brief Survey and Comparative Study in Remote Sensing Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Real-time semantic segmentation of remote sensing imagery is a challenging task that requires a tradeoff between effectiveness and efficiency. It has many applications, including … |
Clifford Broni-Bediako; Junshi Xia; N. Yokoya; | IEEE Geoscience and Remote Sensing Magazine | 2023-09-12 |
995 | Real-Time Semantic Segmentation: A Brief Survey & Comparative Study in Remote Sensing Related Papers Related Patents Related Grants Related Venues Related Experts View Highlight: This paper begins with a summary of the fundamental compression methods for designing efficient deep neural networks and provides a brief but comprehensive survey, outlining the recent developments in real-time semantic segmentation of remote sensing imagery. We examine several seminal efficient deep learning methods, placing them in a taxonomy based on the network architecture design approach. |
Clifford Broni-Bediako; Junshi Xia; Naoto Yokoya; | arxiv-cs.CV | 2023-09-12 |
996 | Self-Correlation and Cross-Correlation Learning for Few-Shot Remote Sensing Image Semantic Segmentation Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: To further explore the self-correlation with the query image, we propose to adopt a classical spectral method to produce a class-agnostic segmentation mask based on the basic visual information of the image. |
LINHAN WANG et. al. | arxiv-cs.CV | 2023-09-11 |
997 | Panoptic Vision-Language Feature Fields Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this paper, we propose to the best of our knowledge the first algorithm for open-vocabulary panoptic segmentation in 3D scenes. |
Haoran Chen; Kenneth Blomqvist; Francesco Milano; Roland Siegwart; | arxiv-cs.CV | 2023-09-11 |
998 | Towards Content-based Pixel Retrieval in Revisited Oxford and Paris Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: This paper introduces the first two pixel retrieval benchmarks. |
GUOYUAN AN et. al. | arxiv-cs.CV | 2023-09-11 |
999 | Learning Semantic Segmentation with Query Points Supervision on Aerial Images Related Papers Related Patents Related Grants Related Venues Related Experts Related Code View Highlight: In this work, we present a weakly supervised learning algorithm to train semantic segmentation algorithms that only rely on query point annotations instead of full mask labels. |
Santiago Rivier; Carlos Hinojosa; Silvio Giancola; Bernard Ghanem; | arxiv-cs.CV | 2023-09-11 |
1000 | A Deep Learning Network for Individual Tree Segmentation in UAV Images with A Coupled CSPNet and Attention Mechanism IF:3 Summary Related Papers Related Patents Related Grants Related Venues Related Experts View Abstract: Accurate individual tree detection by unmanned aerial vehicles (UAVs) is a critical technique for smart forest management and serves as the foundation for evaluating ecological … |
LUJIN LV et. al. | Remote. Sens. | 2023-09-08 |