Some of the material in is restricted to members of the community. By logging in, you may be able to gain additional access to certain collections or items. If you have questions about access or logging in, please use the form on the Contact Page.
Yuan, J. (2014). Indexing, Searching, and Mining Large-Scale Visualdata via Structured Vector Quantization. Retrieved from http://purl.flvc.org/fsu/fd/FSU_migr_etd-9121
This dissertation is centered on indexing, searching, and mining methods for large-scale and high-dimensional visual data. While the processing to such data has been widely acknowledged to be difficult, the problem becomes more serious when we encounter "big data'', which has shifted the focus of many problems in computational science. There are urgent requirements of new approaches to processing the huge collections of visual information, e.g., images/videos on the Internet. The study first investigates difficulties of similarity search in high-dimensional spaces, and presents a new model of local intrinsic dimensionality that better fits to similarity search problems, e.g., the nearest neighbor search. Then it turns the focus to discussions of the advantages and the problems when various structured vector quantization applying to the large-scale visual data processing. While many structured vector quantization models can be found, this study is focused to three families of them, including product quantization (PQ), residual quantization (RQ), and tree-structured vector quantization (TSVQ). The main contributions of this work can be seen in following pipelines: 1) Two novel methods have been proposed to tackle a problem that exists in RQ for decades, and they have been used to improve residual k-means trees for scalable clustering, and to optimize two most advanced ANN search systems; 2) a new inverted index has been proposed for fast approximate nearest neighbor (ANN) search; 3) a systematic framework has been proposed for repetition mining in long video streams; 4) a tree embedding PQ model has been proposed to improve PQ codes quality for ANN search. The experimental results have shown the proposed methods are substantially better than the existing solutions in terms of trade-off among speeds, memory usage, and accuracy.
ANN search, Image Retrieval, Inverted index, K-Means, Vector quantization, Video mining
Date of Defense
July 15, 2014.
Submitted Note
A Dissertation submitted to the Department of Computer Science in partial fulfillment of the requirements for the degree of Doctor of Philosophy.
Bibliography Note
Includes bibliographical references.
Advisory Committee
Xiuwen Liu, Professor Directing Dissertation; Mark Sussman, University Representative; Piyush Kumar, Committee Member; Ashok Srinivasan, Committee Member; Peixiang Zhao, Committee Member.
Publisher
Florida State University
Identifier
FSU_migr_etd-9121
Use and Reproduction
This Item is protected by copyright and/or related rights. You are free to use this Item in any way that is permitted by the copyright and related rights legislation that applies to your use. For other uses you need to obtain permission from the rights-holder(s). The copyright in theses and dissertations completed at Florida State University is held by the students who author them.
Yuan, J. (2014). Indexing, Searching, and Mining Large-Scale Visualdata via Structured Vector Quantization. Retrieved from http://purl.flvc.org/fsu/fd/FSU_migr_etd-9121