Search
 
Home| Join Our Mailing List| New Reviews| New Titles
Editor's Choice| Bestsellers| Textbooks| Book Series| Study Guides| E-Catalogues
  COMPUTER SCIENCE
  Artificial Intelligence
Database/ Information
Sciences

Decision Sciences
Digital Security
Fuzzy Logic
Machine Vision/ Pattern
Recognition

Neural Networks/ Networking
Parallel Processing/
Supercomputing

Software Engineering
Theoretical Computer Science
General
New Titles
May Bestsellers
Editor's Choice
Nobel Lectures
Textbooks
Recent Reviews
Book Series
Related Journals
  • International Journal of Semantic Computing (IJSC)
  • International Journal of Information Acquisition (IJIA)
  • Journal of Information & Knowledge Management (JIKM)
  • Computer Science Journals
  • New Mathematics and Natural Computation (NMNC)
  • Request for related catalogues
     
      PRODUCTS
      Journals
    eBooks
    Journals Archives
    eProceedings
     
      RESOURCES
      For Librarians
    For Authors
    For Booksellers
    For Translation Rights About Us
    Contact Us
    How to Order News
    Inspection Copy
     

    GRAPH-THEORETIC TECHNIQUES FOR WEB CONTENT MINING

    by Adam Schenker (University of South Florida, USA) , Abraham Kandel (University of South Florida, USA) , Horst Bunke (University of Bern, Switzerland) , & Mark Last (Ben-Gurion University of the Negev, Israel)

    Table of Contents (57k)
    Preface (55k)
    Chapter 1: Introduction to Web Mining (442k)

    This book describes exciting new opportunities for utilizing robust graph representations of data with common machine learning algorithms. Graphs can model additional information which is often not present in commonly used data representations, such as vectors. Through the use of graph distance — a relatively new approach for determining graph similarity — the authors show how well-known algorithms, such as k-means clustering and k-nearest neighbors classification, can be easily extended to work with graphs instead of vectors. This allows for the utilization of additional information found in graph representations, while at the same time employing well-known, proven algorithms.

    To demonstrate and investigate these novel techniques, the authors have selected the domain of web content mining, which involves the clustering and classification of web documents based on their textual substance. Several methods of representing web document content by graphs are introduced; an interesting feature of these representations is that they allow for a polynomial time distance computation, something which is typically an NP-complete problem when using graphs. Experimental results are reported for both clustering and classification in three web document collections using a variety of graph representations, distance measures, and algorithm parameters.

    In addition, this book describes several other related topics, many of which provide excellent starting points for researchers and students interested in exploring this new area of machine learning further. These topics include creating graph-based multiple classifier ensembles through random node selection and visualization of graph-based data using multidimensional scaling.

     
    Contents:
    • Introduction to Web Mining
    • Graph Similarity Techniques
    • Graph Models for Web Documents
    • Graph-Based Clustering
    • Graph-Based Classification
    • The Graph Hierarchy Construction Algorithm for Web Search Clustering
     
    Readership: Researchers and graduate students who are interested in computer science, specifically machine learning. Also of interest to researchers in academia or industry in disciplines such as information science or information technology who are interested in text and web documents.
     
     
    248pp    Pub. date: May 2005  
    ISBN:   978-981-256-339-2
    981-256-339-3
       US$116 / £66

     


     

    Imperial College Press  |  Global Publishing  |  Asia-Pacific Biotech News  |  Innovation Magazine
    Labcreations Co  |  Meeting Matters  |  National Academies Press

    Copyright © 2009 World Scientific Publishing Co. All rights reserved.
    Updated on 3 July 2009