Related papers: Engineering A Large Language Model From Scratch

Deep Learning and Machine Learning -- Natural Language Processing: From Theory to Application

With a focus on natural language processing (NLP) and the role of large language models (LLMs), we explore the intersection of machine learning, deep learning, and artificial intelligence. As artificial intelligence continues to…

Computation and Language · Computer Science 2025-11-19 Keyu Chen , Cheng Fei , Ziqian Bi , Junyu Liu , Benji Peng , Sen Zhang , Xuanhe Pan , Jiawei Xu , Jinlang Wang , Caitlyn Heqi Yin , Yichao Zhang , Pohsun Feng , Yizhu Wen , Tianyang Wang , Ming Li , Jintao Ren , Qian Niu , Silin Chen , Weiche Hsieh , Lawrence K. Q. Yan , Chia Xin Liang , Han Xu , Hong-Ming Tseng , Xinyuan Song , Zekun Jiang , Ming Liu

The NLP Cookbook: Modern Recipes for Transformer based Deep Learning Architectures

In recent years, Natural Language Processing (NLP) models have achieved phenomenal success in linguistic and semantic tasks like text classification, machine translation, cognitive dialogue systems, information retrieval via Natural…

Computation and Language · Computer Science 2021-05-18 Sushant Singh , Ausif Mahmood

Learning Mechanism Underlying NLP Pre-Training and Fine-Tuning

Natural language processing (NLP) enables the understanding and generation of meaningful human language, typically using a pre-trained complex architecture on a large dataset to learn the language and next fine-tune its weights to implement…

Computation and Language · Computer Science 2025-09-04 Yarden Tzach , Ronit D. Gross , Ella Koresh , Shalom Rosner , Or Shpringer , Tal Halevi , Ido Kanter

Deep Learning Approaches for Improving Question Answering Systems in Hepatocellular Carcinoma Research

In recent years, advancements in natural language processing (NLP) have been fueled by deep learning techniques, particularly through the utilization of powerful computing resources like GPUs and TPUs. Models such as BERT and GPT-3, trained…

Computation and Language · Computer Science 2024-02-27 Shuning Huo , Yafei Xiang , Hanyi Yu , Mengran Zhu , Yulu Gong

Advancements in Natural Language Processing: Exploring Transformer-Based Architectures for Text Understanding

Natural Language Processing (NLP) has witnessed a transformative leap with the advent of transformer-based architectures, which have significantly enhanced the ability of machines to understand and generate human-like text. This paper…

Computation and Language · Computer Science 2025-03-27 Tianhao Wu , Yu Wang , Ngoc Quach

Natural Language Processing Advancements By Deep Learning: A Survey

Natural Language Processing (NLP) helps empower intelligent machines by enhancing a better understanding of the human language for linguistic-based human-computer communication. Recent developments in computational power and the advent of…

Computation and Language · Computer Science 2021-03-02 Amirsina Torfi , Rouzbeh A. Shirvani , Yaser Keneshloo , Nader Tavaf , Edward A. Fox

Towards More Robust Natural Language Understanding

Natural Language Understanding (NLU) is a branch of Natural Language Processing (NLP) that uses intelligent computer software to understand texts that encode human knowledge. Recent years have witnessed notable progress across various NLU…

Computation and Language · Computer Science 2022-03-01 Xinliang Frederick Zhang

Modular Mechanistic Networks: On Bridging Mechanistic and Phenomenological Models with Deep Neural Networks in Natural Language Processing

Natural language processing (NLP) can be done using either top-down (theory driven) and bottom-up (data driven) approaches, which we call mechanistic and phenomenological respectively. The approaches are frequently considered to stand in…

Computation and Language · Computer Science 2019-03-26 Simon Dobnik , John D. Kelleher

NeuroPrune: A Neuro-inspired Topological Sparse Training Algorithm for Large Language Models

Transformer-based Language Models have become ubiquitous in Natural Language Processing (NLP) due to their impressive performance on various tasks. However, expensive training as well as inference remains a significant impediment to their…

Machine Learning · Computer Science 2024-06-06 Amit Dhurandhar , Tejaswini Pedapati , Ronny Luss , Soham Dan , Aurelie Lozano , Payel Das , Georgios Kollias

Deep Learning Based Natural Language Processing for End to End Speech Translation

Deep Learning methods employ multiple processing layers to learn hierarchial representations of data. They have already been deployed in a humongous number of applications and have produced state-of-the-art results. Recently with the growth…

Computation and Language · Computer Science 2018-08-15 Sarvesh Patil

Large Language Models are not Models of Natural Language: they are Corpus Models

Natural Language Processing (NLP) has become one of the leading application areas in the current Artificial Intelligence boom. Transfer learning has enabled large deep learning neural networks trained on the language modeling task to vastly…

Computation and Language · Computer Science 2022-06-16 Csaba Veres

Natural Language Processing for Systems Engineering: Automatic Generation of Systems Modelling Language Diagrams

The design of complex engineering systems is an often long and articulated process that highly relies on engineers' expertise and professional judgment. As such, the typical pitfalls of activities involving the human factor often manifest…

Computation and Language · Computer Science 2022-11-22 Shaohong Zhong , Andrea Scarinci , Alice Cicirello

Natural Language Processing (almost) from Scratch

We propose a unified neural network architecture and learning algorithm that can be applied to various natural language processing tasks including: part-of-speech tagging, chunking, named entity recognition, and semantic role labeling. This…

Machine Learning · Computer Science 2011-03-03 Ronan Collobert , Jason Weston , Leon Bottou , Michael Karlen , Koray Kavukcuoglu , Pavel Kuksa

Neural language models for network configuration: Opportunities and reality check

Boosted by deep learning, natural language processing (NLP) techniques have recently seen spectacular progress, mainly fueled by breakthroughs both in representation learning with word embeddings (e.g. word2vec) as well as novel…

Networking and Internet Architecture · Computer Science 2022-07-26 Zied Ben Houidi , Dario Rossi

A Survey on Large Language Models from Concept to Implementation

Recent advancements in Large Language Models (LLMs), particularly those built on Transformer architectures, have significantly broadened the scope of natural language processing (NLP) applications, transcending their initial use in chatbot…

Computation and Language · Computer Science 2024-05-29 Chen Wang , Jin Zhao , Jiaqi Gong

Large Language Model Driven Development of Turbulence Models

Artificial intelligence (AI) has achieved human-level performance in specialized tasks such as Go, image recognition, and protein folding, raising the prospect of an AI singularity-where machines not only match but surpass human reasoning.…

Fluid Dynamics · Physics 2025-12-30 Zhongxin Yang , Yuanwei Bin , Yipeng Shi , Xiang I. A. Yang

Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference

Large language models (LLMs) have revolutionized natural language processing (NLP) by excelling at understanding and generating human-like text. However, their widespread deployment can be prohibitively expensive. SortedNet is a recent…

Computation and Language · Computer Science 2024-02-12 Parsa Kavehzadeh , Mojtaba Valipour , Marzieh Tahaei , Ali Ghodsi , Boxing Chen , Mehdi Rezagholizadeh

Enhanced Transformer Architecture for Natural Language Processing

Transformer is a state-of-the-art model in the field of natural language processing (NLP). Current NLP models primarily increase the number of transformers to improve processing performance. However, this technique requires a lot of…

Computation and Language · Computer Science 2023-10-18 Woohyeon Moon , Taeyoung Kim , Bumgeun Park , Dongsoo Har

A Neural Network Approach for Mixing Language Models

The performance of Neural Network (NN)-based language models is steadily improving due to the emergence of new architectures, which are able to learn different natural language characteristics. This paper presents a novel framework, which…

Computation and Language · Computer Science 2017-08-24 Youssef Oualil , Dietrich Klakow

An Introduction to Natural Language Processing Techniques and Framework for Clinical Implementation in Radiation Oncology

Natural Language Processing (NLP) is a key technique for developing Medical Artificial Intelligence (AI) systems that leverage Electronic Health Record (EHR) data to build diagnostic and prognostic models. NLP enables the conversion of…

Computation and Language · Computer Science 2023-11-09 Reza Khanmohammadi , Mohammad M. Ghassemi , Kyle Verdecchia , Ahmed I. Ghanem , Luo Bing , Indrin J. Chetty , Hassan Bagher-Ebadian , Farzan Siddiqui , Mohamed Elshaikh , Benjamin Movsas , Kundan Thind