updating ch 7 ordering

mattmakai · mattmakai · commit eba6ce5a0f20 · 2014-08-13T20:06:41.000-04:00
diff --git a/feeds/all.atom.xml b/feeds/all.atom.xml
@@ -1,2 +1,2 @@
 <?xml version="1.0" encoding="utf-8"?>
-<feed xmlns="http://www.w3.org/2005/Atom"><title>Matt Makai</title><link href="http://www.fullstackpython.com/" rel="alternate"></link><link href="http://www.fullstackpython.com/feeds/all.atom.xml" rel="self"></link><id>http://www.fullstackpython.com/</id><updated>2014-08-13T09:27:55Z</updated></feed>
+<feed xmlns="http://www.w3.org/2005/Atom"><title>Matt Makai</title><link href="http://www.fullstackpython.com/" rel="alternate"></link><link href="http://www.fullstackpython.com/feeds/all.atom.xml" rel="self"></link><id>http://www.fullstackpython.com/</id><updated>2014-08-13T20:06:30Z</updated></feed>
diff --git a/source/content/pages/07-performance/0701-static-content.markdown b/source/content/pages/07-performance/0701-static-content.markdown
@@ -1,7 +1,7 @@
 title: Static Content
 category: page
 slug: static-content
-sort-order: 071
+sort-order: 0701
 choice1url: /caching.html
 choice1icon: fa-repeat
 choice1text: How do I cache repeated operations to improve performance?
diff --git a/source/content/pages/07-performance/0702-caching.markdown b/source/content/pages/07-performance/0702-caching.markdown
@@ -0,0 +1,70 @@
+title: Caching
+category: page
+slug: caching
+sort-order: 0702
+choice1url: /task-queues.html
+choice1icon: fa-tasks
+choice1text: How do I run Python outside the HTTP request-response cycle?
+choice2url: /web-analytics.html
+choice2icon: fa-dashboard
+choice2text: What can I learn about my users through web analytics?
+choice3url: /web-application-security.html
+choice3icon: fa-lock fa-inverse
+choice3text: What should I know about security to protect my app?
+choice4url: /configuration-management.html
+choice4icon: fa-gears fa-inverse
+choice4text: How do I automate the server configuration that I set up?
+
+
+# Caching
+Caching can reduce the load on servers by storing the results of common 
+operations and serving the precomputed answers to clients. 
+
+For example, instead of retrieving data from database tables that rarely 
+change, you can store the values in-memory. Retrieving values from an 
+in-memory location is far faster than retrieving them from a database (which
+stores them on a persistent disk like a hard drive.) When the cached values 
+change the system can invalidate the cache and re-retrieve the updated values
+for future requests.
+
+A cache can be created for multiple layers of the stack. 
+
+
+## Caching backends
+* [memcached](http://memcached.org/) is a common in-memory caching system.
+
+* [Redis](http://redis.io/) is a key-value in-memory data store that can
+  easily be configured for caching with libraries such as 
+  [django-redis-cache](https://github.com/sebleier/django-redis-cache).
+
+
+## Caching resources
+* "[Caching: Varnish or Nginx?](https://bjornjohansen.no/caching-varnish-or-nginx)"
+  reviews some considerations such as SSL and SPDY support when choosing
+  reverse proxy Nginx or Varnish.
+
+* [Caching is Hard, Draw me a Picture](http://bizcoder.com/caching-is-hard-draw-me-a-picture)
+  has diagrams of how web request caching layers work. The post is relevant
+  reading even though the author is describing his Microsoft code as the 
+  impetus for writing the content.
+
+
+## Caching learning checklist
+<i class="fa fa-check-square-o"></i>
+Analyze your web application for the slowest parts. It's likely there are
+complex database queries that can be precomputed and stored in an in-memory
+data store.
+
+<i class="fa fa-check-square-o"></i>
+Leverage your existing in-memory data store already used for session data
+to cache the results of those complex database queries. 
+A [task queue](/task-queues.html) can often be used to precompute the results 
+on a regular basis and save them in the data store.
+
+<i class="fa fa-check-square-o"></i>
+Incorporate a cache invalidation scheme so the precomputed results remain 
+accurate when served up to the user.
+
+
+
+### What do you want to learn now that your app is responding faster?
diff --git a/source/content/pages/07-performance/0703-task-queues.markdown b/source/content/pages/07-performance/0703-task-queues.markdown
@@ -0,0 +1,164 @@
+title: Task Queues
+category: page
+slug: task-queues
+sort-order: 0703
+choice1url: /logging.html
+choice1icon: fa-align-left fa-inverse
+choice1text: How do I monitor my app and its task queues with logging?
+choice2url: /web-analytics.html
+choice2icon: fa-dashboard
+choice2text: How can I learn more about the users of my application? 
+choice3url: /monitoring.html
+choice3icon: fa-bar-chart-o fa-inverse
+choice3text: What tools exist for monitoring a live web application?
+choice4url:
+choice4icon:
+choice4text:
+
+
+# Task queues
+Task queues manage background work that must be executed outside the usual
+HTTP request-response cycle.
+
+
+## Why are task queues necessary?
+Tasks are handled asynchronously either because they are not initiated by 
+an HTTP request or because they are long-running jobs that would dramatically
+reduce the performance of an HTTP response.
+
+For example, a web application could poll the GitHub API every 10 minutes to
+collect the names of the top 100 starred repositories. A task queue would
+handle invoking code to call the GitHub API, process the results and store them
+in a persistent database for later use.
+
+Another example is when a database query would take too long during the HTTP
+request-response cycle. The query could be performed in the background on a
+fixed interval with the results stored in the database. When an
+HTTP request comes in that needs those results a query would simply fetch the
+precalculated result instead of re-executing the longer query.
+This precalculation scenario is a form of [caching](/caching.html) enabled 
+by task queues.
+
+Other types of jobs for task queues include
+
+* spreading out large numbers of independent database inserts over time 
+  instead of inserting everything at once
+
+* aggregating collected data values on a fixed interval, such as every
+  15 minutes
+
+* scheduling periodic jobs such as batch processes
+
+
+## Task queue projects
+The defacto standard Python task queue is Celery. The other task queue 
+projects that arise tend to come from the perspective that Celery is overly
+complicated for simple use cases. My recommendation is to put the effort into
+Celery's reasonable learning curve as it is worth the time it takes to 
+understand how to use the project.
+
+* The [Celery](http://www.celeryproject.org/) distributed task queue is the
+  most commonly used Python library for handling asynchronous tasks and 
+  scheduling.
+
+* The [RQ (Redis Queue)](http://python-rq.org/) is a simple Python
+  library for queueing jobs and processing them in the background with workers.
+  RQ is backed by Redis and is designed to have a low barrier to entry.
+  The [intro post](http://nvie.com/posts/introducing-rq/) contains information
+  on design decisions and how to use RQ.
+
+* [Taskmaster](https://github.com/dcramer/taskmaster) is a lightweight simple
+  distributed queue for handling large volumes of one-off tasks. 
+
+
+## Hosted message and task queue services
+Task queue third party services aim to solve the complexity issues that arise
+when scaling out a large deployment of distributed task queues.
+
+* [Iron.io](http://www.iron.io/) is a distributed messaging service platform 
+  that works with many types of task queues such as Celery. It also is built
+  to work with other IaaS and PaaS environments such as Amazon Web Services
+  and Heroku.
+
+* [Amazon Simple Queue Service (SQS)](http://aws.amazon.com/sqs/) is a
+  set of five APIs for creating, sending, receiving, modifying and deleting
+  messages.
+
+* [CloudAMQP](http://www.cloudamqp.com/) is at its core managed servers with
+  RabbitMQ installed and configured. This service is an option if you are 
+  using RabbitMQ and do not want to maintain RabbitMQ installations on your 
+  own servers.
+
+
+## Task queue resources
+* [Getting Started Scheduling Tasks with Celery](http://www.caktusgroup.com/blog/2014/06/23/scheduling-tasks-celery/)
+  is a detailed walkthrough for setting up Celery with Django (although
+  Celery can also be used without a problem with other frameworks).
+
+* [Distributing work without Celery](http://justcramer.com/2012/05/04/distributing-work-without-celery/)
+  provides a scenario in which Celery and RabbitMQ are not the right tool
+  for scheduling asynchronous jobs.
+
+* [Evaluating persistent, replicated message queues](http://www.warski.org/blog/2014/07/evaluating-persistent-replicated-message-queues/)
+  is a detailed comparison of Amazon SQS, MongoDB, RabbitMQ, HornetQ and
+  Kafka's designs and performance.
+
+* [Queues.io](http://queues.io/) is a collection of task queue systems with
+  short summaries for each one. The task queues are not all compatible with
+  Python but ones that work with it are tagged with the "Python" keyword.
+
+* [Why Task Queues](http://www.slideshare.net/bryanhelmig/task-queues-comorichweb-12962619) 
+  is a presentation for what task queues are and why they are needed. 
+
+* [How to use Celery with RabbitMQ](https://www.digitalocean.com/community/articles/how-to-use-celery-with-rabbitmq-to-queue-tasks-on-an-ubuntu-vps)
+  is a detailed walkthrough for using these tools on an Ubuntu VPS.
+
+* Heroku has a clear walkthrough for using 
+  [RQ for background tasks](https://devcenter.heroku.com/articles/python-rq).
+
+* [Introducing Celery for Python+Django](http://www.linuxforu.com/2013/12/introducing-celery-pythondjango/) 
+  provides an introduction to the Celery task queue.
+
+* [Celery - Best Practices](https://denibertovic.com/posts/celery-best-practices/)
+  explains things you should not do with Celery and shows some underused 
+  features for making task queues easier to work with.
+
+* The "Django in Production" series by 
+  [Rob Golding](https://twitter.com/robgolding63) contains a post 
+  specifically on [Background Tasks](http://www.robgolding.com/blog/2011/11/27/django-in-production-part-2---background-tasks/).
+
+* [Asynchronous Processing in Web Applications Part One](http://blog.thecodepath.com/2012/11/15/asynchronous-processing-in-web-applications-part-1-a-database-is-not-a-queue/) 
+  and [Part Two](http://blog.thecodepath.com/2013/01/06/asynchronous-processing-in-web-applications-part-2-developers-need-to-understand-message-queues/)
+  are great reads for understanding the difference between a task queue and
+  why you shouldn't use your database as one.
+
+* [A 4 Minute Intro to Celery](https://www.youtube.com/watch?v=68QWZU_gCDA) is
+  a short introductory task queue screencast.
+
+
+## Task queue learning checklist
+<i class="fa fa-check-square-o"></i> 
+Pick a slow function in your project that is called during an HTTP request.
+
+<i class="fa fa-check-square-o"></i> 
+Determine if you can precompute the results on a fixed interval instead of
+during the HTTP request. If so, create a separate function you can call
+from elsewhere then store the precomputed value in the database.
+
+<i class="fa fa-check-square-o"></i> 
+Read the Celery documentation and the links in the resources section below
+to understand how the project works.
+
+<i class="fa fa-check-square-o"></i> 
+Install a message broker such as RabbitMQ or Redis and then add Celery to your 
+project. Configure Celery to work with the installed message broker.
+
+<i class="fa fa-check-square-o"></i> 
+Use Celery to invoke the function from step one on a regular basis.
+
+<i class="fa fa-check-square-o"></i>
+Have the HTTP request function use the precomputed value instead of the 
+slow running code it originally relied upon.
+ 
+
+### What's next after task queues?

Original file line number	Diff line number	Diff line change
`@@ -1,2 +1,2 @@`
`1`	`1`	`<?xml version="1.0" encoding="utf-8"?>`
`2`		`-<feed xmlns="http://www.w3.org/2005/Atom"><title>Matt Makai</title><link href="http://www.fullstackpython.com/" rel="alternate"></link><link href="http://www.fullstackpython.com/feeds/all.atom.xml" rel="self"></link><id>http://www.fullstackpython.com/</id><updated>2014-08-13T09:27:55Z</updated></feed>`
	`2`	`+<feed xmlns="http://www.w3.org/2005/Atom"><title>Matt Makai</title><link href="http://www.fullstackpython.com/" rel="alternate"></link><link href="http://www.fullstackpython.com/feeds/all.atom.xml" rel="self"></link><id>http://www.fullstackpython.com/</id><updated>2014-08-13T20:06:30Z</updated></feed>`