A review on web information extraction and hidden predictive information from large databases


연구 분야: Databases



학회: Multimedia Tools and Applications


초록

Extracting the relevant data from Internet sources is a difficult task due to an increased amount of data. Therefore, an effective and flexible information extraction mechanism is established to transform the website pages into program-friendly structures like a relational database. So far, several types of research have been conducted on web information extraction using various methods. In recent years, retrieving information from website links has received more attention due to the increasing growth of Internet facilities. However, gaining better outcomes in the information extraction process is a critical issue faced by several existing methods. The current web information extraction survey presents only some techniques used to retrieve information and does not present exact limitations. In addition, there is a lack of verification of existing techniques used to retrieve hidden forecast information in various databases. In order to recommend suitable techniques for web information retrieval, identifying the limitations of existing methods is more important. Therefore, this review article covers varied techniques used to extract web and hidden prediction information from various databases and mentions the pros and cons of such methods in 2017–2024. To enhance the effectiveness of this article, it briefly describes the challenges of retrieving information from the web, applications of web information extraction, and useful future recommendations. Based on this review article, the most efficient techniques suitable for the web information extraction process can be exhibited for future use.


Author Profile
Dhanraj Jadhav

Om Parkash Jogender Singh University Churu Rajasthan 331303 India

India
Author Profile
Jaibir Singh

Department of Computer Science and Engineering Om Parkash Jogender Singh University Churu Rajasthan 331303 India

Andorra

📄 논문 정보

발행 연도 2025년
인용수 0
출판 국가 Andorra, India
사이트 Springer
좋아요 수 0

연관 논문 목록 (365건)