English
Related papers

Related papers: Algorithms for Massive Data -- Lecture Notes

200 papers

This lecture note is intended to prepare early-year master's and PhD students in data science or a related discipline with foundational ideas in machine learning. It starts with basic ideas in modern machine learning with classification as…

Machine Learning · Computer Science 2025-05-08 Kyunghyun Cho

These notes were compiled as lecture notes for a course developed and taught at the University of the Southern California. They should be accessible to a typical engineering graduate student with a strong background in Applied Mathematics.…

Machine Learning · Computer Science 2023-01-04 Deep Ray , Orazio Pinti , Assad A. Oberai

Many key problems in machine learning and data science are routinely modeled as optimization problems and solved via optimization algorithms. With the increase of the volume of data and the size and complexity of the statistical models used…

Optimization and Control · Mathematics 2020-08-28 Filip Hanzely

The problem of high-dimensional and large-scale representation of visual data is addressed from an unsupervised learning perspective. The emphasis is put on discrete representations, where the description length can be measured in bits and…

Machine Learning · Computer Science 2019-01-25 Sohrab Ferdowsi

This article considers "compressive learning," an approach to large-scale machine learning where datasets are massively compressed before learning (e.g., clustering, classification, or regression) is performed. In particular, a "sketch" is…

These are lecture notes based on the first part of a course on 'Mathematical Data Science', which I taught to final year BSc students in the UK in 2019-2020. Topics include: concentration of measure in high dimensions; Gaussian random…

Functional Analysis · Mathematics 2024-09-24 Sven-Ake Wegner

Neural compression is the application of neural networks and other machine learning methods to data compression. Recent advances in statistical machine learning have opened up new possibilities for data compression, allowing compression…

Machine Learning · Computer Science 2023-08-22 Yibo Yang , Stephan Mandt , Lucas Theis

We introduce an algorithm to reduce large data sets using so-called digital nets, which are well distributed point sets in the unit cube. These point sets together with weights, which depend on the data set, are used to represent the data.…

Numerical Analysis · Mathematics 2021-05-31 Josef Dick , Michael Feischl

This work consists of a study of a set of techniques and strategies related with algorithm's design, whose purpose is the resolution of problems on massive data sets, in an efficient way. This field is known as Algorithms for Big Data. In…

Data Structures and Algorithms · Computer Science 2017-08-29 Sergio García Prado

These lecture notes focus on the recent advancements in neural information retrieval, with particular emphasis on the systems and models exploiting transformer networks. These networks, originally proposed by Google in 2017, have seen a…

Information Retrieval · Computer Science 2022-09-13 Nicola Tonellotto

Algorithmic information theory roots the concept of information in computation rather than probability. These lecture notes were constructed in conjunction with the graduate course I taught at Universit\`a della Svizzera italiana in the…

Information Theory · Computer Science 2025-04-29 Charles Alexandre Bédard

This is a set of lecture notes used in a graduate topic class in applied mathematics called ``Quantum Algorithms for Scientific Computation'' at the Department of Mathematics, UC Berkeley during the fall semester of 2021. These lecture…

Quantum Physics · Physics 2022-01-21 Lin Lin

These lecture notes aim to provide a clear and comprehensive introduction to using open quantum system theory for quantum algorithms. The main arguments are Variational Quantum Algorithms, Quantum Error Correction, Dynamical Decoupling and…

Quantum Physics · Physics 2024-06-18 Matteo Carlesso

In general, large datasets enable deep learning models to perform with good accuracy and generalizability. However, massive high-fidelity simulation datasets (from molecular chemistry, astrophysics, computational fluid dynamics (CFD), etc.…

Machine Learning · Computer Science 2022-07-27 Wai Tong Chung , Ki Sung Jung , Jacqueline H. Chen , Matthias Ihme

This article reviews recent advances in convex optimization algorithms for Big Data, which aim to reduce the computational, storage, and communications bottlenecks. We provide an overview of this emerging field, describe contemporary…

Optimization and Control · Mathematics 2014-11-05 Volkan Cevher , Stephen Becker , Mark Schmidt

Big-data applications often involve a vast number of observations and features, creating new challenges for variable selection and parameter estimation. This paper presents a novel technique called ``slow kill,'' which utilizes nonconvex…

Machine Learning · Statistics 2023-05-04 Yiyuan She , Jianhui Shen , Adrian Barbu

As data volume grows extensively, data profiling helps to extract metadata of large-scale data. However, one kind of metadata, order statistics, is difficult to be computed because they are not mergeable or incremental. Thus, the limitation…

Data Structures and Algorithms · Computer Science 2020-06-29 Zhiwei Chen , Aoqian Zhang

Big data are data on a massive scale in terms of volume, intensity, and complexity that exceed the capacity of standard software tools. They present opportunities as well as challenges to statisticians. The role of computational…

Computation · Statistics 2018-06-13 Chun Wang , Ming-Hui Chen , Elizabeth Schifano , Jing Wu , Jun Yan

Deep learning have achieved promising results on a wide spectrum of AI applications. Larger datasets and models consistently yield better performance. However, we generally spend longer training time on more computation and communication.…

Machine Learning · Computer Science 2021-11-03 Xiaoxin He , Fuzhao Xue , Xiaozhe Ren , Yang You

An approach to the classification problem of machine learning, based on building local classification rules, is developed. The local rules are considered as projections of the global classification rules to the event we want to classify. A…

Machine Learning · Computer Science 2007-05-23 Vladislav Malyshkin , Ray Bakhramov , Andrey Gorodetsky
‹ Prev 1 2 3 10 Next ›