You are here

Hybrid Keyword Search Across Peer-to-Peer Federated Data

Title: Hybrid Keyword Search Across Peer-to-Peer Federated Data.
34 views
8 downloads
Name(s): Kim, Jungkee, author
Riccardi, Gregory, professor co-directing dissertation
Fox, Geoffrey C., professor co-directing dissertation
Dennis, Lawrence, outside committee member
Erlebacher, Gordon, committee member
Whalley, David, committee member
Department of Computer Science, degree granting department
Florida State University, degree granting institution
Type of Resource: text
Genre: Text
Issuance: monographic
Date Issued: 2005
Publisher: Florida State University
Place of Publication: Tallahassee, Florida
Physical Form: computer
online resource
Extent: 1 online resource
Language(s): English
Abstract/Description: The Internet provides a general communication environment for distributed resource sharing. XML has become a key technology for information representation and exchange on the Internet, increasing the opportunity for integration of the various data formats. The World Wide Web (WWW) is the example par excellence of a document-based distributed system on the Internet. As the size of the Web has increased, various problems with looking up a resource location on the Internet have emerged. Web search engines provide clues for resource location, but they have no semantic schema and often produce meaningless keyword search results. The Semantic Web suggests an alternative solution for the semantic problem on the Web. It provides multiple relation links with directed labeled graphs, and machines like Web crawlers can understand the relationship between different resources. But due to the need for sophisticated domain description and lack of unified definitions, many Web pages are not part of the Semantic Web. Meanwhile, recent public attention to peer-to-peer (P2P) networks has stimulated research on overlay P2P networks on top of the Internet. Those studies open possibilities for another form of distributed resource sharing on the Internet. In this dissertation we describe the design of a hybrid search that combines metadata search with a traditional keyword search over unstructured context data. This hybrid search paradigm provides the inquirer additional options to narrow the search with some semantic aspects through the XML metadata query. We tackle the scalability limitations of a single-machine implementation by adopting a distributed architecture. This scalable hybrid search provides a total query result from the collection of individual inquiries against independent data fragments distributed in a computer cluster. We demonstrate our architecture extends the scalability of a native XML query limited in a single machine and improves the performance of queries. Finally we generalize our hybrid architecture to more scalable searches over a P2P overlay network. This generalization may give an intermediate search paradigm on the Internet---providing semantic value through XML metadata that are simpler than those of the Semantic Web.
Identifier: FSU_migr_etd-3052 (IID)
Submitted Note: A Dissertation submitted to the Department of Computer Science in partial fulfillment of the requirements for the degree of Doctor of Philosophy.
Degree Awarded: Spring Semester, 2005.
Date of Defense: April 6, 2005.
Keywords: Keyword Search, Data Integration, Peer-To-Peer, Information Retrieval
Bibliography Note: Includes bibliographical references.
Advisory Committee: Gregory Riccardi, Professor Co-Directing Dissertation; Geoffrey C. Fox, Professor Co-Directing Dissertation; Lawrence Dennis, Outside Committee Member; Gordon Erlebacher, Committee Member; David Whalley, Committee Member.
Subject(s): Computer science
Persistent Link to This Record: http://purl.flvc.org/fsu/fd/FSU_migr_etd-3052
Owner Institution: FSU

Choose the citation style.
Kim, J. (2005). Hybrid Keyword Search Across Peer-to-Peer Federated Data. Retrieved from http://purl.flvc.org/fsu/fd/FSU_migr_etd-3052