TY - GEN
T1 - The hidden Web, XML and the Semantic Web
T2 - 14th International Conference on Extending Database Technology: Advances in Database Technology, EDBT 2011
AU - Suchanek, Fabian M.
AU - Varde, Aparna S.
AU - Nayak, Richi
AU - Senellart, Pierre
PY - 2011
Y1 - 2011
N2 - The World Wide Web no longer consists just of HTML pages. Our work sheds light on a number of trends on the Internet that go beyond simple Web pages. The hidden Web provides a wealth of data in semi-structured form, accessible through Web forms and Web services. These services, as well as numerous other applications on the Web, commonly use XML, the eXtensible Markup Language. XML has become the lingua franca of the Internet that allows customized markups to be defined for specific domains. On top of XML, the Semantic Web grows as a common structured data source. In this work, we first explain each of these developments in detail. Using real-world examples from scientific domains of great interest today, we then demonstrate how these new developments can assist the managing, harvesting, and organization of data on the Web. On the way, we also illustrate the current research avenues in these domains. We believe that this effort would help bridge multiple database tracks, thereby attracting researchers with a view to extend database technology.
AB - The World Wide Web no longer consists just of HTML pages. Our work sheds light on a number of trends on the Internet that go beyond simple Web pages. The hidden Web provides a wealth of data in semi-structured form, accessible through Web forms and Web services. These services, as well as numerous other applications on the Web, commonly use XML, the eXtensible Markup Language. XML has become the lingua franca of the Internet that allows customized markups to be defined for specific domains. On top of XML, the Semantic Web grows as a common structured data source. In this work, we first explain each of these developments in detail. Using real-world examples from scientific domains of great interest today, we then demonstrate how these new developments can assist the managing, harvesting, and organization of data on the Web. On the way, we also illustrate the current research avenues in these domains. We believe that this effort would help bridge multiple database tracks, thereby attracting researchers with a view to extend database technology.
KW - Deep web
KW - Domain-specific markup languages
KW - Hidden Web
KW - Multidisciplinary work
KW - Scientific data
KW - Semantic web
KW - XML
UR - http://www.scopus.com/inward/record.url?scp=79953901961&partnerID=8YFLogxK
U2 - 10.1145/1951365.1951433
DO - 10.1145/1951365.1951433
M3 - Conference contribution
AN - SCOPUS:79953901961
SN - 9781450305280
T3 - ACM International Conference Proceeding Series
SP - 534
EP - 537
BT - Advances in Database Technology - EDBT 2011
PB - Association for Computing Machinery
Y2 - 22 March 2011 through 24 March 2011
ER -