作者:David Mertz
出版日期:June 06, 2003
出版社:Addison Wesley
页数:544
ISBN:0-321-11254-7
文件格式:CHM
Text Processing in Pythonis an example-driven, hands-on tutorial that carefully teachesprogrammers how to accomplish numerous text processing tasks using thePython language. Filled with concrete examples, this book providesefficient and effective solutions to specific text processing problemsand practical strategies for dealing with all types of text processingchallenges.
Text Processing in Pythonbegins with an introduction to text processing and contains a quickPython tutorial to get you up to speed. It then delves into essentialtext processing subject areas, including string operations, regularexpressions, parsers and state machines, and Internet tools andtechniques. Appendixes cover such important topics as data compressionand Unicode. A comprehensive index and plentiful cross-referencingoffer easy access to available information. In addition, exercisesthroughout the book provide readers with further opportunity to honetheir skills either on their own or in the classroom. A companion Website (http://gnosis.cx/TPiP) contains source code and examples from thebook.
Here is some of what you will find in thie book:
When do I use formal parsers to process structured and semi-structured data? Page 257
How do I work with full text indexing? Page 199
What patterns in text can be expressed using regular expressions? Page 204
How do I find a URL or an email address in text? Page 228
How do I process a report with a concrete state machine? Page 274
How do I parse, create, and manipulate internet formats? Page 345
How do I handle lossless and lossy compression? Page 454
How do I find codepoints in Unicode? Page 465