Proceedings of 2010 Cross-Strait Conference on Information Science and Technology (CSCIST 2010 E-BOOK)

E-mail Password New User

Proceedings of 2010 Cross-Strait Conference on Information Science and Technology (CSCIST 2010 E-BOOK)

Qinhuangdao,China,7.9-7.13，2010

ISBN: 978-1-935068-15-0 Scientific Research Publishing, USA

E-Book 840pp Pub. Date: July 2010

Category: Computer Science & Communications

Price: $120

Title:	Study on Text Information Extraction Model and Algorithm of HTML Documents
Source:	Proceedings of 2010 Cross-Strait Conference on Information Science and Technology (CSCIST 2010 E-BOOK) (pp 399-403)
Author(s):	Chunyan Li, Tangshan Teachers College Haiyang Jiang, Tangshan Teachers College
Abstract:	This article improves the automatic data extraction method of Web information based on HTML. This method can extract structure data from non-structure information on the Web. What this article shows is as follows:Firstly, the method of EXALG system is analyzed and its problems are found out.Secondly, the improved EXALG system is provided Thirdly, the privilege of preciseness and completeness of the new system is examined by data resource and experiment results of the author of EXALG.

Proceedings by Subjects

Books by Subjects

Resources

Products

Contact us

	book@scirp.org
	+86 18163351462(WhatsApp)
	1243940697

	Book Publishing WeChat

Follow SCIRP

Free SCIRP Newsletters

Add your e-mail address to receive free newsletters from SCIRP.

Copyright © 2006-2024 Scientific Research Publishing Inc. All Rights Reserved.

Top