Related papers: Distinct counting with a self-learning bitmap

Discrete Scale-invariant Metric Learning for Efficient Collaborative Filtering

Metric learning has attracted extensive interest for its ability to provide personalized recommendations based on the importance of observed user-item interactions. Current metric learning methods aim to push negative items away from the…

Information Retrieval · Computer Science 2025-12-24 Yan Zhang , Li Deng , Lixin Duan , Sami Azam

Monotonic Cardinality Estimation of Similarity Selection: A Deep Learning Approach

Due to the outstanding capability of capturing underlying data distributions, deep learning techniques have been recently utilized for a series of traditional database problems. In this paper, we investigate the possibilities of utilizing…

Databases · Computer Science 2021-09-27 Yaoshu Wang , Chuan Xiao , Jianbin Qin , Xin Cao , Yifang Sun , Wei Wang , Makoto Onizuka

Weakly supervised person search aims to jointly detect and match persons with only bounding box annotations. Existing approaches typically focus on improving the features by exploring relations of persons. However, scale variation problem…

Computer Vision and Pattern Recognition · Computer Science 2024-01-04 Benzhi Wang , Yang Yang , Jinlin Wu , Guo-jun Qi , Zhen Lei

Cardinality Estimation in a Virtualized Network Device Using Online Machine Learning

Cardinality estimation algorithms receive a stream of elements, with possible repetitions, and return the number of distinct elements in the stream. Such algorithms seek to minimize the required memory and CPU resource consumption at the…

Networking and Internet Architecture · Computer Science 2019-03-15 Reuven Cohen , Yuval Nezri

DISCO: accurate Discrete Scale Convolutions

Scale is often seen as a given, disturbing factor in many vision tasks. When doing so it is one of the factors why we need more data during learning. In recent work scale equivariance was added to convolutional neural networks. It was shown…

Computer Vision and Pattern Recognition · Computer Science 2021-11-02 Ivan Sosnovik , Artem Moskalev , Arnold Smeulders

Scale-Invariant Convolutional Neural Networks

Even though convolutional neural networks (CNN) has achieved near-human performance in various computer vision tasks, its ability to tolerate scale variations is limited. The popular practise is making the model bigger first, and then train…

Computer Vision and Pattern Recognition · Computer Science 2014-11-25 Yichong Xu , Tianjun Xiao , Jiaxing Zhang , Kuiyuan Yang , Zheng Zhang

A statistical analysis of probabilistic counting algorithms

This paper considers the problem of cardinality estimation in data stream applications. We present a statistical analysis of probabilistic counting algorithms, focusing on two techniques that use pseudo-random variates to form…

Computation · Statistics 2012-11-20 Peter Clifford , Ioana A. Cosma

QSketch: An Efficient Sketch for Weighted Cardinality Estimation in Streams

Estimating cardinality, i.e., the number of distinct elements, of a data stream is a fundamental problem in areas like databases, computer networks, and information retrieval. This study delves into a broader scenario where each element…

Databases · Computer Science 2024-06-28 Yiyan Qi , Rundong Li , Pinghui Wang , Yufang Sun , Rui Xing

A Recurrent Neural Network based Clustering Method for Binary Data Sets in Education

This paper studies an application of a recurrent neural network to clustering method for the S-P chart: a binary data set used widely in education. As the number of students increases, the S-P chart becomes hard to handle. In order to…

Machine Learning · Computer Science 2025-08-20 Mizuki Ohira , Toshimichi Saito

ByteCard: Enhancing ByteDance's Data Warehouse with Learned Cardinality Estimation

Cardinality estimation is a critical component and a longstanding challenge in modern data warehouses. ByteHouse, ByteDance's cloud-native engine for extensive data analysis in exabyte-scale environments, serves numerous internal…

Databases · Computer Science 2024-04-12 Yuxing Han , Haoyu Wang , Lixiang Chen , Yifeng Dong , Xing Chen , Benquan Yu , Chengcheng Yang , Weining Qian

DISCount: Counting in Large Image Collections with Detector-Based Importance Sampling

Many modern applications use computer vision to detect and count objects in massive image collections. However, when the detection task is very difficult or in the presence of domain shifts, the counts may be inaccurate even with…

Computer Vision and Pattern Recognition · Computer Science 2023-06-07 Gustavo Perez , Subhransu Maji , Daniel Sheldon

Scale-Net: Learning to Reduce Scale Differences for Large-Scale Invariant Image Matching

Most image matching methods perform poorly when encountering large scale changes in images. To solve this problem, firstly, we propose a scale-difference-aware image matching method (SDAIM) that reduces image scale differences before local…

Computer Vision and Pattern Recognition · Computer Science 2021-12-21 Yujie Fu , Yihong Wu

Self-Damaging Contrastive Learning

The recent breakthrough achieved by contrastive learning accelerates the pace for deploying unsupervised training on real-world data applications. However, unlabeled data in reality is commonly imbalanced and shows a long-tail distribution,…

Computer Vision and Pattern Recognition · Computer Science 2021-06-08 Ziyu Jiang , Tianlong Chen , Bobak Mortazavi , Zhangyang Wang

Utilizing Dynamic Properties of Sharing Bits and Registers to Estimate User Cardinalities over Time

Online monitoring user cardinalities (or degrees) in graph streams is fundamental for many applications. For example in a bipartite graph representing user-website visiting activities, user cardinalities (the number of distinct visited…

Data Structures and Algorithms · Computer Science 2018-11-27 Pinghui Wang , Peng Jia , Xiangliang Zhang , Jing Tao , Xiaohong Guan , Don Towsley

Random Binary Mappings for Kernel Learning and Efficient SVM

Support Vector Machines (SVMs) are powerful learners that have led to state-of-the-art results in various computer vision problems. SVMs suffer from various drawbacks in terms of selecting the right kernel, which depends on the image…

Computer Vision and Pattern Recognition · Computer Science 2014-03-31 Gemma Roig , Xavier Boix , Luc Van Gool

Scale-invariant unconstrained online learning

We consider a variant of online convex optimization in which both the instances (input vectors) and the comparator (weight vector) are unconstrained. We exploit a natural scale invariance symmetry in our unconstrained setting: the…

Machine Learning · Computer Science 2017-08-24 Wojciech Kotłowski

Private Synthetic Data for Multitask Learning and Marginal Queries

We provide a differentially private algorithm for producing synthetic data simultaneously useful for multiple tasks: marginal queries and multitask machine learning (ML). A key innovation in our algorithm is the ability to directly handle…

Machine Learning · Computer Science 2022-09-16 Giuseppe Vietri , Cedric Archambeau , Sergul Aydore , William Brown , Michael Kearns , Aaron Roth , Ankit Siva , Shuai Tang , Zhiwei Steven Wu

Differentiable Random Partition Models

Partitioning a set of elements into an unknown number of mutually exclusive subsets is essential in many machine learning problems. However, assigning elements, such as samples in a dataset or neurons in a network layer, to an unknown and…

Machine Learning · Computer Science 2023-11-10 Thomas M. Sutter , Alain Ryser , Joram Liebeskind , Julia E. Vogt

Interlayer and Intralayer Scale Aggregation for Scale-invariant Crowd Counting

Crowd counting is an important vision task, which faces challenges on continuous scale variation within a given scene and huge density shift both within and across images. These challenges are typically addressed using multi-column…

Computer Vision and Pattern Recognition · Computer Science 2020-05-26 Mingjie Wang , Hao Cai , Jun Zhou , Minglun Gong

Scalable Discrete Supervised Hash Learning with Asymmetric Matrix Factorization

Hashing method maps similar data to binary hashcodes with smaller hamming distance, and it has received a broad attention due to its low storage cost and fast retrieval speed. However, the existing limitations make the present algorithms…

Computer Vision and Pattern Recognition · Computer Science 2016-09-29 Shifeng Zhang , Jianmin Li , Jinma Guo , Bo Zhang