Related papers: Spatial-Semantic Collaborative Cropping for User G…

Semantic Image Cropping

Automatic image cropping techniques are commonly used to enhance the aesthetic quality of an image; they do it by detecting the most beautiful or the most salient parts of the image and removing the unwanted content to have a smaller image…

Computer Vision and Pattern Recognition · Computer Science 2021-07-16 Oriol Corcoll

U2Net: A General Framework with Spatial-Spectral-Integrated Double U-Net for Image Fusion

In image fusion tasks, images obtained from different sources exhibit distinct properties. Consequently, treating them uniformly with a single-branch network can lead to inadequate feature extraction. Additionally, numerous works have…

Image and Video Processing · Electrical Eng. & Systems 2023-10-03 Siran Peng , Chenhao Guo , Xiao Wu , Liang-Jian Deng

An Experience-based Direct Generation approach to Automatic Image Cropping

Automatic Image Cropping is a challenging task with many practical downstream applications. The task is often divided into sub-problems - generating cropping candidates, finding the visually important regions, and determining aesthetics to…

Computer Vision and Pattern Recognition · Computer Science 2023-01-02 Casper Christensen , Aneesh Vartakavi

Stacked U-Nets: A No-Frills Approach to Natural Image Segmentation

Many imaging tasks require global information about all pixels in an image. Conventional bottom-up classification networks globalize information by decreasing resolution; features are pooled and downsampled into a single output. But for…

Computer Vision and Pattern Recognition · Computer Science 2018-04-30 Sohil Shah , Pallabi Ghosh , Larry S Davis , Tom Goldstein

Dual Graph Convolutional Network for Semantic Segmentation

Exploiting long-range contextual information is key for pixel-wise prediction tasks such as semantic segmentation. In contrast to previous work that uses multi-scale feature fusion or dilated convolutions, we propose a novel…

Computer Vision and Pattern Recognition · Computer Science 2020-08-27 Li Zhang , Xiangtai Li , Anurag Arnab , Kuiyuan Yang , Yunhai Tong , Philip H. S. Torr

Semantics-enhanced Temporal Graph Networks for Content Popularity Prediction

The surging demand for high-definition video streaming services and large neural network models (e.g., Generative Pre-trained Transformer, GPT) implies a tremendous explosion of Internet traffic. To mitigate the traffic pressure,…

Artificial Intelligence · Computer Science 2023-03-15 Jianhang Zhu , Rongpeng Li , Xianfu Chen , Shiwen Mao , Jianjun Wu , Zhifeng Zhao

Sampled Image Tagging and Retrieval Methods on User Generated Content

Traditional image tagging and retrieval algorithms have limited value as a result of being trained with heavily curated datasets. These limitations are most evident when arbitrary search words are used that do not intersect with training…

Computer Vision and Pattern Recognition · Computer Science 2016-12-05 Karl Ni , Kyle Zaragoza , Charles Foster , Carmen Carrano , Barry Chen , Yonas Tesfaye , Alex Gude

On Quantifying Qualitative Geospatial Data: A Probabilistic Approach

Living in the era of data deluge, we have witnessed a web content explosion, largely due to the massive availability of User-Generated Content (UGC). In this work, we specifically consider the problem of geospatial information extraction…

Databases · Computer Science 2013-11-21 Georgios Skoumas , Dieter Pfoser , Anastasios Kyrillidis

Spatial Graph Convolutional Networks

Graph Convolutional Networks (GCNs) have recently become the primary choice for learning from graph-structured data, superseding hash fingerprints in representing chemical compounds. However, GCNs lack the ability to take into account the…

Machine Learning · Computer Science 2020-07-03 Tomasz Danel , Przemysław Spurek , Jacek Tabor , Marek Śmieja , Łukasz Struski , Agnieszka Słowik , Łukasz Maziarka

S2GS: Streaming Semantic Gaussian Splatting for Online Scene Understanding and Reconstruction

Existing offline feed-forward methods for joint scene understanding and reconstruction on long image streams often repeatedly perform global computation over an ever-growing set of past observations, causing runtime and GPU memory to…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Renhe Zhang , Yuyang Tan , Jingyu Gong , Zhizhong Zhang , Lizhuang Ma , Yuan Xie , Xin Tan

Repurposing Existing Deep Networks for Caption and Aesthetic-Guided Image Cropping

We propose a novel optimization framework that crops a given image based on user description and aesthetics. Unlike existing image cropping methods, where one typically trains a deep network to regress to crop parameters or cropping…

Computer Vision and Pattern Recognition · Computer Science 2022-01-10 Nora Horanyi , Kedi Xia , Kwang Moo Yi , Abhishake Kumar Bojja , Ales Leonardis , Hyung Jin Chang

Automatic Image Cropping for Visual Aesthetic Enhancement Using Deep Neural Networks and Cascaded Regression

Despite recent progress, computational visual aesthetic is still challenging. Image cropping, which refers to the removal of unwanted scene areas, is an important step to improve the aesthetic quality of an image. However, it is challenging…

Computer Vision and Pattern Recognition · Computer Science 2018-01-16 Guanjun Guo , Hanzi Wang , Chunhua Shen , Yan Yan , Hong-Yuan Mark Liao

S2RC-GCN: A Spatial-Spectral Reliable Contrastive Graph Convolutional Network for Complex Land Cover Classification Using Hyperspectral Images

Spatial correlations between different ground objects are an important feature of mining land cover research. Graph Convolutional Networks (GCNs) can effectively capture such spatial feature representations and have demonstrated promising…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Renxiang Guan , Zihao Li , Chujia Song , Guo Yu , Xianju Li , Ruyi Feng

Image Cropping with Composition and Saliency Aware Aesthetic Score Map

Aesthetic image cropping is a practical but challenging task which aims at finding the best crops with the highest aesthetic quality in an image. Recently, many deep learning methods have been proposed to address this problem, but they did…

Computer Vision and Pattern Recognition · Computer Science 2019-11-26 Yi Tu , Li Niu , Weijie Zhao , Dawei Cheng , Liqing Zhang

Image Captioning with Semantic Attention

Automatically generating a natural language description of an image has attracted interests recently both because of its importance in practical applications and because it connects two major artificial intelligence fields: computer vision…

Computer Vision and Pattern Recognition · Computer Science 2016-03-15 Quanzeng You , Hailin Jin , Zhaowen Wang , Chen Fang , Jiebo Luo

simNet: Stepwise Image-Topic Merging Network for Generating Detailed and Comprehensive Image Captions

The encode-decoder framework has shown recent success in image captioning. Visual attention, which is good at detailedness, and semantic attention, which is good at comprehensiveness, have been separately proposed to ground the caption on…

Computation and Language · Computer Science 2018-08-28 Fenglin Liu , Xuancheng Ren , Yuanxin Liu , Houfeng Wang , Xu Sun

Region2Vec: Community Detection on Spatial Networks Using Graph Embedding with Node Attributes and Spatial Interactions

Community Detection algorithms are used to detect densely connected components in complex networks and reveal underlying relationships among components. As a special type of networks, spatial networks are usually generated by the…

Social and Information Networks · Computer Science 2022-10-18 Yunlei Liang , Jiawei Zhu , Wen Ye , Song Gao

Semi-Global Shape-aware Network

Non-local operations are usually used to capture long-range dependencies via aggregating global context to each position recently. However, most of the methods cannot preserve object shapes since they only focus on feature similarity but…

Computer Vision and Pattern Recognition · Computer Science 2020-12-18 Pengju Zhang , Yihong Wu , Jiagang Zhu

Generative Semantic Communication for Joint Image Transmission and Segmentation

Semantic communication has emerged as a promising technology for enhancing communication efficiency. However, most existing research emphasizes single-task reconstruction, neglecting model adaptability and generalization across multi-task…

Information Theory · Computer Science 2025-04-01 Weiwen Yuan , Jinke Ren , Chongjie Wang , Ruichen Zhang , Jun Wei , Dong In Kim , Shuguang Cui

Superpixel Semantics Representation and Pre-training for Vision-Language Task

The key to integrating visual language tasks is to establish a good alignment strategy. Recently, visual semantic representation has achieved fine-grained visual understanding by dividing grids or image patches. However, the coarse-grained…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Siyu Zhang , Yeming Chen , Yaoru Sun , Fang Wang , Jun Yang , Lizhi Bai , Shangce Gao