Related papers: WWW Spiders: an introduction

A Brief History of Web Crawlers

Web crawlers visit internet applications, collect data, and learn about new web pages from visited pages. Web crawlers have a long and interesting history. Early web crawlers collected statistics about the web. In addition to collecting…

Information Retrieval · Computer Science 2014-05-06 Seyed M. Mirtaheri , Mustafa Emre Dinçktürk , Salman Hooshmand , Gregor V. Bochmann , Guy-Vincent Jourdan , Iosif Viorel Onut

A Survey on Information Retrieval, Text Categorization, and Web Crawling

This paper is a survey discussing Information Retrieval concepts, methods, and applications. It goes deep into the document and query modelling involved in IR systems, in addition to pre-processing operations such as removing stop words and…

Information Retrieval · Computer Science 2012-12-11 Youssef Bassil

A Comparative Study of Hidden Web Crawlers

A large amount of data on the WWW remains inaccessible to crawlers of Web search engines because it can only be exposed on demand as users fill out and submit forms. The Hidden web refers to the collection of Web data which can be accessed…

Information Retrieval · Computer Science 2014-07-23 Sonali Gupta , Komal Kumar Bhatia

Exploiting Locality in Searching the Web

Published experiments on spidering the Web suggest that, given training data in the form of a (relatively small) subgraph of the Web containing a subset of a selected class of target pages, it is possible to conduct a directed search and…

Information Retrieval · Computer Science 2012-12-12 Joel Young , Thomas L. Dean

IntelligentWeb Agent for Search Engines

In this paper we review studies of the growth of the Internet and technologies that are useful for information search and retrieval on the Web. Search engines are retrieve the efficient information. We collected data on the Internet from…

Information Retrieval · Computer Science 2013-10-18 Avinash N Bhute , B. B. Meshram

CRATOR: a Dark Web Crawler

Dark web crawling is a complex process that involves specific methodologies and techniques to navigate the Tor network and extract data from hidden services. This study proposes a general dark web crawler designed to extract pages handling…

Cryptography and Security · Computer Science 2024-05-13 Daniel De Pascale , Giuseppe Cascavilla , Damian A. Tamburri , Willem-Jan Van Den Heuvel

Design of Automatically Adaptable Web Wrappers

Nowadays, the huge amount of information distributed through the Web motivates studying techniques to be adopted in order to extract relevant data in an efficient and reliable way. Both academia and enterprises developed several approaches…

Artificial Intelligence · Computer Science 2013-06-06 Emilio Ferrara , Robert Baumgartner

Comparative analysis of various web crawler algorithms

This presentation focuses on the importance of web crawling and page ranking algorithms in dealing with the massive amount of data present on the World Wide Web. As the web continues to grow exponentially, efficient search and retrieval…

Information Retrieval · Computer Science 2023-06-22 Nithin T K , Chandana S , Barani G , Chavva Dharani , M S Karishma

Penerapan teknik web scraping pada mesin pencari artikel ilmiah

Search engines are a combination of hardware and computer software supplied by a particular company through the website which has been determined. Search engines collect information from the web through bots or web crawlers that crawls the…

Information Retrieval · Computer Science 2014-10-22 Ahmad Josi , Leon Andretti Abdillah , Suryayusra

Software systems through complex networks science: Review, analysis and applications

Complex software systems are among most sophisticated human-made systems, yet only little is known about the actual structure of 'good' software. We here study different software systems developed in Java from the perspective of network…

Social and Information Networks · Computer Science 2013-05-24 Lovro Šubelj , Marko Bajec

Complexity: An Introduction

This article summarises a Web-book on "Complexity" that was developed to introduce undergraduate students to interesting complex systems in the biological, physical and social sciences, and the common tools, principles and concepts used for…

Physics Education · Physics 2007-05-23 Rajesh R. Parwani

Design and development of a software system for swarm intelligence based research studies

This paper introduce a software system including widely-used Swarm Intelligence algorithms or approaches to be used for the related scientific research studies associated with the subject area. The programmatic infrastructure of the system…

Artificial Intelligence · Computer Science 2017-04-05 Utku Kose

The network approach: basic concepts and algorithms

What is a complex network? How do we characterize complex networks? Which systems can be studied from a network approach? In this text, we motivate the use of complex networks to study and understand a broad panoply of systems, ranging from…

Physics and Society · Physics 2007-11-27 Pedro G. Lind

A Study on Web Application Vulnerabilities to find an optimal Security Architecture

Over the past three decades, computers have managed to make their way into a majority of households. Due to this enormous transition, the surge in the internets popularity was inevitable. Just like everything else, whatever has a pro also…

Cryptography and Security · Computer Science 2024-09-02 C. Amuthadevi , Sparsh Srivastava , Raghav Khatoria , Varun Sangwan

Spider networks

In this investigation we study a family of networks, called spiders, which covers a range of networks going from chains to complete graphs. These spiders are characterized by three parameters: the number of nodes in the core, the number of…

Combinatorics · Mathematics 2024-10-15 Leo Egghe , Li Li , Ronald Rousseau

Tourism networks and computer networks

The body of knowledge accumulated in recent years on the structure and the dynamics of complex networks has offered useful insights on the behaviour of many natural and artificial complex systems. The analysis of some of these, namely those…

Physics and Society · Physics 2008-01-16 Rodolfo Baggio

Web Engineering

Web Engineering is the application of systematic, disciplined and quantifiable approaches to development, operation, and maintenance of Web-based applications. It is both a pro-active approach and a growing collection of theoretical and…

Software Engineering · Computer Science 2007-05-23 Yogesh Deshpande , San Murugesan , Athula Ginige , Steve Hansen , Daniel Schwabe , Martin Gaedke , Bebo White

Robin: A Web Security Tool

Thanks to the advance of technology, all kinds of applications are becoming more complete and capable of performing complex tasks that save much of our time. But to perform these tasks, applications require that some personal information…

Cryptography and Security · Computer Science 2020-07-15 Guilherme Girotto , Avelino Francisco Zorzo

Analyzing Web Services Networks: a WS-NEXT Application

Web services represent a system with a huge number of units and many various and complex interactions. Complex networks as a tool for modelling and analyzing natural environments seem to be well adapted to such a complex system. To describe…

Software Engineering · Computer Science 2013-05-02 Chantal Cherifi , Jean-François Santucci

Service Wrapper: a system for converting web data into web services

Web services are widely used in many areas via callable APIs, however, data are not always available in this way. We always need to get some data from web pages whose structure is not in order. Many developers use web data extraction…

Databases · Computer Science 2019-10-18 Naibo Wang , Zhiling Luo , Xiya Lyu , Zitong Yang , Jianwei Yin