IE for Semi-structured Document: Supervised Approach
1. Machine-learning based Semi-structured IE Chia-Hui Chang Department of Computer Science & Information Engineering National Central University [email_address]
2.
3.
4.
5.
6. WIEN N. Kushmerick, D. S. Weld, R. Doorenbos, University of Washington, 1997 http://www.cs.ucd.ie/staff/nick/
For 15 minutes CONALD talk, skip slides 5, 9, 12, 13, 17-19, 21 For 25 minutes AIII talk, use all
How many Web sites have these problems? 9 out of the 30 sites surveyed by Kushmerick in 1997 Semistructured data (see e.g., Buneman PODS-97) Web CGI software becoming sophisticated The percentage will increase quickly, need a more powerful wrapper representation