Scholarly Commons at Miami University Scholarly Commons @ MU
    • Login
    • Scholarly Commons FAQs
    • SHERPA/RoMEO
    • SPARC Author Addendum Engine
    View Item 
    •   SC Home
    • Faculty Research and Scholarship
    • College of Engineering and Computing
    • Computer Science and Software Engineering
    • Computer Science and Software Engineering Technical Reports
    • View Item
    •   SC Home
    • Faculty Research and Scholarship
    • College of Engineering and Computing
    • Computer Science and Software Engineering
    • Computer Science and Software Engineering Technical Reports
    • View Item
    JavaScript is disabled for your browser. Some features of this site may not work without it.

    Concepts and Effectiveness of the Cover Coefficient Based Clustering Methodology for Text Databases

    Thumbnail
    View/Open
    fulltext.pdf (2.350Mb)
    Date
    1987-12-01
    Author
    Can, Fazli
    Ozkarahan, Esen
    Metadata
    Show full item record
    Abstract
    An algorithm for document clustering is introduced. The base concept of the algorithm, Cover Coefficient (CC) concept, provides means of estimating the number of clusters within a document database. The CC concept is used also to identify the cluster seeds, to form clusters with the seeds, and to calculate Term Discrimination and Document Significance values (TDV, DSV). TDVs and DSVs are used to optimize document descriptions. The CC concept also relates indexing and clustering analytically. Experimental results indicate that the clustering performance in terms of the percentage of useful information accessed (precision) is forty percent higher, with accompanying reduction in search space, than that of random assignment of documents to clusters. The experiments have validated the indexing-clustering relationships and shown improvements in retrieval precision when TDV and DSV optimizations are used.
    URI

    http://hdl.handle.net/2374.MIA/246
    Collections
    • Computer Science and Software Engineering Technical Reports

    Browse

    All of Scholarly CommonsCommunities & CollectionsBy Issue DateAuthorsTitlesSubjectsThis CollectionBy Issue DateAuthorsTitlesSubjects

    Statistics

    View Usage Statistics

    - Miami University Libraries
    - Center for Digital Scholarship
    - Contact Us
    DSpace software
    Mirage 2 Theme
    htmlmap