TEL-8 Query Interfaces
Abstract
This dataset contains the original query interfaces and their manually extracted query capabilities of 447 deep Web sources from 8 representative domains, which form 3 groups "TEL" (and thus the name of the dataset)-- in the Travel group: Airfares, Hotels, and Car Rentals;
in the Entertainment group: Books, Movies, and Music Records; in
the Living group: Jobs and Automobiles.
For each source, this dataset archives its root homepages and query-interface
pages. In addition, it includes the manually extracted query capability for each interface.
Documentation
The document describes the creation and
usage of this dataset.
Data files
Note that, the zipped file can be properly uncompressed in Linux. However, it may cause errors when uncompressing in Windows due to the problem of unsupported file name format.
Tasks Using This Dataset
Sources
Original Owners
Kevin Chen-Chuan Chang, Bin He, Chengkai Li, and Zhen Zhang
Computer Science Department
University at Illinois at Urbana-Champaign
binhe[at]uiuc.edu
Date Created: May 2003
Back to UIUC Web Integration Repository