Materials
Book Code and Data
You can download the zipped archive of all the book materials from here (last updated: 2015-01-27). You can also clone the book materials directory from our GitHub repository.
- Chapter 1: Introduction
- Part 1: A Primer on Web and Data Technologies
- Chapter 2: HTML
- Chapter 3: XML and JSON
- Chapter 4: XPath
- Chapter 5: HTTP
- Chapter 6: AJAX
- Chapter 7: SQL and Relational Databases
- Chapter 8: Regular Expressions and String Functions
- Part 2: A Practical Toolbox for Web Scraping and Text Mining
- Chapter 9: Scraping the Web
- Chapter 10: Statistical Text Processing
- Chapter 11: Managing Data Projects
- Part 3: A Bag of Case Studies
- Chapter 12: Collaboration Networks in the U.S. Senate
- Chapter 13: Parsing Information from Semi-Structured Documents
- Chapter 14: Predicting the 2014 Academy Awards using Twitter
- Chapter 15: Mapping the Geographic Distribution of Names
- Chapter 16: Gathering Data on Mobile Phones
- Chapter 17: Analyzing Sentiments of Product Reviews