English
Related papers

Related papers: Engineering A Large Language Model From Scratch

200 papers

With a focus on natural language processing (NLP) and the role of large language models (LLMs), we explore the intersection of machine learning, deep learning, and artificial intelligence. As artificial intelligence continues to…

In recent years, Natural Language Processing (NLP) models have achieved phenomenal success in linguistic and semantic tasks like text classification, machine translation, cognitive dialogue systems, information retrieval via Natural…

Computation and Language · Computer Science 2021-05-18 Sushant Singh , Ausif Mahmood

Natural language processing (NLP) enables the understanding and generation of meaningful human language, typically using a pre-trained complex architecture on a large dataset to learn the language and next fine-tune its weights to implement…

Computation and Language · Computer Science 2025-09-04 Yarden Tzach , Ronit D. Gross , Ella Koresh , Shalom Rosner , Or Shpringer , Tal Halevi , Ido Kanter

In recent years, advancements in natural language processing (NLP) have been fueled by deep learning techniques, particularly through the utilization of powerful computing resources like GPUs and TPUs. Models such as BERT and GPT-3, trained…

Computation and Language · Computer Science 2024-02-27 Shuning Huo , Yafei Xiang , Hanyi Yu , Mengran Zhu , Yulu Gong

Natural Language Processing (NLP) has witnessed a transformative leap with the advent of transformer-based architectures, which have significantly enhanced the ability of machines to understand and generate human-like text. This paper…

Computation and Language · Computer Science 2025-03-27 Tianhao Wu , Yu Wang , Ngoc Quach

Natural Language Processing (NLP) helps empower intelligent machines by enhancing a better understanding of the human language for linguistic-based human-computer communication. Recent developments in computational power and the advent of…

Computation and Language · Computer Science 2021-03-02 Amirsina Torfi , Rouzbeh A. Shirvani , Yaser Keneshloo , Nader Tavaf , Edward A. Fox

Natural Language Understanding (NLU) is a branch of Natural Language Processing (NLP) that uses intelligent computer software to understand texts that encode human knowledge. Recent years have witnessed notable progress across various NLU…

Computation and Language · Computer Science 2022-03-01 Xinliang Frederick Zhang

Natural language processing (NLP) can be done using either top-down (theory driven) and bottom-up (data driven) approaches, which we call mechanistic and phenomenological respectively. The approaches are frequently considered to stand in…

Computation and Language · Computer Science 2019-03-26 Simon Dobnik , John D. Kelleher

Transformer-based Language Models have become ubiquitous in Natural Language Processing (NLP) due to their impressive performance on various tasks. However, expensive training as well as inference remains a significant impediment to their…

Machine Learning · Computer Science 2024-06-06 Amit Dhurandhar , Tejaswini Pedapati , Ronny Luss , Soham Dan , Aurelie Lozano , Payel Das , Georgios Kollias

Deep Learning methods employ multiple processing layers to learn hierarchial representations of data. They have already been deployed in a humongous number of applications and have produced state-of-the-art results. Recently with the growth…

Computation and Language · Computer Science 2018-08-15 Sarvesh Patil

Natural Language Processing (NLP) has become one of the leading application areas in the current Artificial Intelligence boom. Transfer learning has enabled large deep learning neural networks trained on the language modeling task to vastly…

Computation and Language · Computer Science 2022-06-16 Csaba Veres

The design of complex engineering systems is an often long and articulated process that highly relies on engineers' expertise and professional judgment. As such, the typical pitfalls of activities involving the human factor often manifest…

Computation and Language · Computer Science 2022-11-22 Shaohong Zhong , Andrea Scarinci , Alice Cicirello

We propose a unified neural network architecture and learning algorithm that can be applied to various natural language processing tasks including: part-of-speech tagging, chunking, named entity recognition, and semantic role labeling. This…

Machine Learning · Computer Science 2011-03-03 Ronan Collobert , Jason Weston , Leon Bottou , Michael Karlen , Koray Kavukcuoglu , Pavel Kuksa

Boosted by deep learning, natural language processing (NLP) techniques have recently seen spectacular progress, mainly fueled by breakthroughs both in representation learning with word embeddings (e.g. word2vec) as well as novel…

Networking and Internet Architecture · Computer Science 2022-07-26 Zied Ben Houidi , Dario Rossi

Recent advancements in Large Language Models (LLMs), particularly those built on Transformer architectures, have significantly broadened the scope of natural language processing (NLP) applications, transcending their initial use in chatbot…

Computation and Language · Computer Science 2024-05-29 Chen Wang , Jin Zhao , Jiaqi Gong

Artificial intelligence (AI) has achieved human-level performance in specialized tasks such as Go, image recognition, and protein folding, raising the prospect of an AI singularity-where machines not only match but surpass human reasoning.…

Fluid Dynamics · Physics 2025-12-30 Zhongxin Yang , Yuanwei Bin , Yipeng Shi , Xiang I. A. Yang

Large language models (LLMs) have revolutionized natural language processing (NLP) by excelling at understanding and generating human-like text. However, their widespread deployment can be prohibitively expensive. SortedNet is a recent…

Computation and Language · Computer Science 2024-02-12 Parsa Kavehzadeh , Mojtaba Valipour , Marzieh Tahaei , Ali Ghodsi , Boxing Chen , Mehdi Rezagholizadeh

Transformer is a state-of-the-art model in the field of natural language processing (NLP). Current NLP models primarily increase the number of transformers to improve processing performance. However, this technique requires a lot of…

Computation and Language · Computer Science 2023-10-18 Woohyeon Moon , Taeyoung Kim , Bumgeun Park , Dongsoo Har

The performance of Neural Network (NN)-based language models is steadily improving due to the emergence of new architectures, which are able to learn different natural language characteristics. This paper presents a novel framework, which…

Computation and Language · Computer Science 2017-08-24 Youssef Oualil , Dietrich Klakow

Natural Language Processing (NLP) is a key technique for developing Medical Artificial Intelligence (AI) systems that leverage Electronic Health Record (EHR) data to build diagnostic and prognostic models. NLP enables the conversion of…

‹ Prev 1 2 3 10 Next ›