In DetailFor programmers, working with text is not about reading their newspaper on a break; it's about taking textual data in one form and doing something to it. Extract, decrypt, parse, restructure — these are just some of the text tasks that can occupy much of a programmer's life. If this is your life, this book will make it better — a practical guide on how to do what you want with textual data in Python.
Python 2.6 Text Processing Beginner's Guide is the easiest way to learn how to manipulate text with Python. Packed with examples, it will teach you text processing techniques and give you the skills to work with the most popular Python libraries for transforming text from one form to another.
The book gets you going with a quick look at some data formats, and installing the supporting libraries and components so that you're ready to get started. You move on to extracting text from a collection of sources and handling it using Python's built-in string functions and regular expressions. You look into processing structured text documents such as XML and HTML, JSON, and CSV. Then you progress to generating documents and creating templates. Finally you look at ways to enhance text output via a collection of third-party packages such as Nucular, PyParsing, NLTK, and Mako.
Learn text processing techniques and work with the most popular Python libraries for transforming text from one form to another
ApproachThis book is part of the Beginner's Guide series. Each chapter covers the steps for various tasks to process data followed by brief explanation of what is happening in each task. The explanation is followed by a few questions on the topic under discussion that will serve as a refresher course for you.
Who this book is forThis book is for people who have text in one format, and need it in another, as quickly as possible. You don't need any experience with text processing, but you will need some basic knowledge of Python.