In this paper we describe work relating to classification of Web documents using a graph-based model instead of the traditional vector-based model for document representation. We compare the classification accuracy of the vector model approach using the k-nearest neighbor (k-NN) algorithm to a novel approach which allows the use of graphs for document representation in the k-NN algorithm. The proposed method is evaluated on three different Web document collections using the leave-one-out approach for measuring classification accuracy. The results show that the graph-based k-NN approach can outperform traditional vector-based k-NN methods in terms of both accuracy and execution time.


    Zugriff

    Zugriff prüfen

    Verfügbarkeit in meiner Bibliothek prüfen

    Bestellung bei Subito €


    Exportieren, teilen und zitieren



    Titel :

    Classification of Web documents using a graph model


    Beteiligte:
    Schenker, A. (Autor:in) / Last, M. (Autor:in) / Bunke, H. (Autor:in) / Kandel, A. (Autor:in)


    Erscheinungsdatum :

    2003-01-01


    Format / Umfang :

    264185 byte




    Medientyp :

    Aufsatz (Konferenz)


    Format :

    Elektronische Ressource


    Sprache :

    Englisch



    Classification of Web Documents Using a Graph Model

    Schenker, A. / Last, M. / Bunke, H. et al. | British Library Conference Proceedings | 2003


    K-Graph: Knowledgeable Graph for Text Documents

    Mittal Varsha / Gangodkar Durgaprasad / Pant Bhaskar | DOAJ | 2021

    Freier Zugriff

    Classification and Postprocessing of Documents Using an Error-Correcting Parser

    Bunke, H. / Liviero, R. | British Library Conference Proceedings | 1995


    Classification and Functional Decomposition of Business Documents

    Taylor, S. L. / Lipshutz, M. / Nilson, R. W. | British Library Conference Proceedings | 1995


    An Adaptive System for Automatic Invoice-Documents Classification

    Alippi, C. / Pessina, F. / Roveri, M. | British Library Conference Proceedings | 2005