The hidden Web, XML and the Semantic Web: Scientific data management perspectives

Fabian M. Suchanek, Aparna S. Varde, Richi Nayak, Pierre Senellart

Research output: Chapter in Book/Report/Conference proceedingConference contribution

8 Citations (Scopus)

Abstract

The World Wide Web no longer consists just of HTML pages. Our work sheds light on a number of trends on the Internet that go beyond simple Web pages. The hidden Web provides a wealth of data in semi-structured form, accessible through Web forms and Web services. These services, as well as numerous other applications on the Web, commonly use XML, the eXtensible Markup Language. XML has become the lingua franca of the Internet that allows customized markups to be defined for specific domains. On top of XML, the Semantic Web grows as a common structured data source. In this work, we first explain each of these developments in detail. Using real-world examples from scientific domains of great interest today, we then demonstrate how these new developments can assist the managing, harvesting, and organization of data on the Web. On the way, we also illustrate the current research avenues in these domains. We believe that this effort would help bridge multiple database tracks, thereby attracting researchers with a view to extend database technology.

Original languageEnglish
Title of host publicationAdvances in Database Technology - EDBT 2011
Subtitle of host publication14th International Conference on Extending Database Technology, Proceedings
Pages534-537
Number of pages4
DOIs
StatePublished - 18 Apr 2011
Event14th International Conference on Extending Database Technology: Advances in Database Technology, EDBT 2011 - Uppsala, Sweden
Duration: 22 Mar 201124 Mar 2011

Publication series

NameACM International Conference Proceeding Series

Other

Other14th International Conference on Extending Database Technology: Advances in Database Technology, EDBT 2011
CountrySweden
CityUppsala
Period22/03/1124/03/11

Fingerprint

Semantic Web
XML
World Wide Web
Information management
Internet
HTML
Web services
Websites

Keywords

  • Deep web
  • Domain-specific markup languages
  • Hidden Web
  • Multidisciplinary work
  • Scientific data
  • Semantic web
  • XML

Cite this

Suchanek, F. M., Varde, A. S., Nayak, R., & Senellart, P. (2011). The hidden Web, XML and the Semantic Web: Scientific data management perspectives. In Advances in Database Technology - EDBT 2011: 14th International Conference on Extending Database Technology, Proceedings (pp. 534-537). (ACM International Conference Proceeding Series). https://doi.org/10.1145/1951365.1951433
Suchanek, Fabian M. ; Varde, Aparna S. ; Nayak, Richi ; Senellart, Pierre. / The hidden Web, XML and the Semantic Web : Scientific data management perspectives. Advances in Database Technology - EDBT 2011: 14th International Conference on Extending Database Technology, Proceedings. 2011. pp. 534-537 (ACM International Conference Proceeding Series).
@inproceedings{d2912b33865645f8a9af322941c4283a,
title = "The hidden Web, XML and the Semantic Web: Scientific data management perspectives",
abstract = "The World Wide Web no longer consists just of HTML pages. Our work sheds light on a number of trends on the Internet that go beyond simple Web pages. The hidden Web provides a wealth of data in semi-structured form, accessible through Web forms and Web services. These services, as well as numerous other applications on the Web, commonly use XML, the eXtensible Markup Language. XML has become the lingua franca of the Internet that allows customized markups to be defined for specific domains. On top of XML, the Semantic Web grows as a common structured data source. In this work, we first explain each of these developments in detail. Using real-world examples from scientific domains of great interest today, we then demonstrate how these new developments can assist the managing, harvesting, and organization of data on the Web. On the way, we also illustrate the current research avenues in these domains. We believe that this effort would help bridge multiple database tracks, thereby attracting researchers with a view to extend database technology.",
keywords = "Deep web, Domain-specific markup languages, Hidden Web, Multidisciplinary work, Scientific data, Semantic web, XML",
author = "Suchanek, {Fabian M.} and Varde, {Aparna S.} and Richi Nayak and Pierre Senellart",
year = "2011",
month = "4",
day = "18",
doi = "10.1145/1951365.1951433",
language = "English",
isbn = "9781450305280",
series = "ACM International Conference Proceeding Series",
pages = "534--537",
booktitle = "Advances in Database Technology - EDBT 2011",

}

Suchanek, FM, Varde, AS, Nayak, R & Senellart, P 2011, The hidden Web, XML and the Semantic Web: Scientific data management perspectives. in Advances in Database Technology - EDBT 2011: 14th International Conference on Extending Database Technology, Proceedings. ACM International Conference Proceeding Series, pp. 534-537, 14th International Conference on Extending Database Technology: Advances in Database Technology, EDBT 2011, Uppsala, Sweden, 22/03/11. https://doi.org/10.1145/1951365.1951433

The hidden Web, XML and the Semantic Web : Scientific data management perspectives. / Suchanek, Fabian M.; Varde, Aparna S.; Nayak, Richi; Senellart, Pierre.

Advances in Database Technology - EDBT 2011: 14th International Conference on Extending Database Technology, Proceedings. 2011. p. 534-537 (ACM International Conference Proceeding Series).

Research output: Chapter in Book/Report/Conference proceedingConference contribution

TY - GEN

T1 - The hidden Web, XML and the Semantic Web

T2 - Scientific data management perspectives

AU - Suchanek, Fabian M.

AU - Varde, Aparna S.

AU - Nayak, Richi

AU - Senellart, Pierre

PY - 2011/4/18

Y1 - 2011/4/18

N2 - The World Wide Web no longer consists just of HTML pages. Our work sheds light on a number of trends on the Internet that go beyond simple Web pages. The hidden Web provides a wealth of data in semi-structured form, accessible through Web forms and Web services. These services, as well as numerous other applications on the Web, commonly use XML, the eXtensible Markup Language. XML has become the lingua franca of the Internet that allows customized markups to be defined for specific domains. On top of XML, the Semantic Web grows as a common structured data source. In this work, we first explain each of these developments in detail. Using real-world examples from scientific domains of great interest today, we then demonstrate how these new developments can assist the managing, harvesting, and organization of data on the Web. On the way, we also illustrate the current research avenues in these domains. We believe that this effort would help bridge multiple database tracks, thereby attracting researchers with a view to extend database technology.

AB - The World Wide Web no longer consists just of HTML pages. Our work sheds light on a number of trends on the Internet that go beyond simple Web pages. The hidden Web provides a wealth of data in semi-structured form, accessible through Web forms and Web services. These services, as well as numerous other applications on the Web, commonly use XML, the eXtensible Markup Language. XML has become the lingua franca of the Internet that allows customized markups to be defined for specific domains. On top of XML, the Semantic Web grows as a common structured data source. In this work, we first explain each of these developments in detail. Using real-world examples from scientific domains of great interest today, we then demonstrate how these new developments can assist the managing, harvesting, and organization of data on the Web. On the way, we also illustrate the current research avenues in these domains. We believe that this effort would help bridge multiple database tracks, thereby attracting researchers with a view to extend database technology.

KW - Deep web

KW - Domain-specific markup languages

KW - Hidden Web

KW - Multidisciplinary work

KW - Scientific data

KW - Semantic web

KW - XML

UR - http://www.scopus.com/inward/record.url?scp=79953901961&partnerID=8YFLogxK

U2 - 10.1145/1951365.1951433

DO - 10.1145/1951365.1951433

M3 - Conference contribution

AN - SCOPUS:79953901961

SN - 9781450305280

T3 - ACM International Conference Proceeding Series

SP - 534

EP - 537

BT - Advances in Database Technology - EDBT 2011

ER -

Suchanek FM, Varde AS, Nayak R, Senellart P. The hidden Web, XML and the Semantic Web: Scientific data management perspectives. In Advances in Database Technology - EDBT 2011: 14th International Conference on Extending Database Technology, Proceedings. 2011. p. 534-537. (ACM International Conference Proceeding Series). https://doi.org/10.1145/1951365.1951433