|
| 1 | +title: Data analysis |
| 2 | +category: page |
| 3 | +slug: data-analysis |
| 4 | +sortorder: 0917 |
| 5 | +toc: False |
| 6 | +sidebartitle: Data analysis |
| 7 | +meta: Data analysis is a broad set of activities that involves cleaning, processing, transforming and understanding a data collection. |
| 8 | + |
| 9 | + |
| 10 | +# Data analysis |
| 11 | +Data analysis involves a broad set of activities to clean, process and |
| 12 | +transform a data collection to learn from it. Python is commonly used |
| 13 | +as a programming language to perform data analysis because many tools, |
| 14 | +such as [Jupyter Notebook](/jupyter-notebook.html), |
| 15 | +[pandas](/pandas.html) and [Bokeh](/bokeh.html), are written in Python |
| 16 | +and can be quickly applied rather than coding your own data analysis |
| 17 | +libraries from scratch. |
| 18 | + |
| 19 | + |
| 20 | +### Data analysis resources |
| 21 | +* The following series on data exploration uses Python as the |
| 22 | + implementation language while walking through various stages of |
| 23 | + how to analyze a data set. |
| 24 | + |
| 25 | + * [Part 1](http://blog.districtdatalabs.com/data-exploration-with-python-1) |
| 26 | + gives insight into how you should think about data and clarify |
| 27 | + what you are looking to learn. |
| 28 | + * [Part 2](http://blog.districtdatalabs.com/data-exploration-with-python-2) |
| 29 | + explains categorization and transforming a data set into one that |
| 30 | + is easier to analyze. |
| 31 | + * [Part 3](http://blog.districtdatalabs.com/data-exploration-with-python-3) |
| 32 | + shows how to visualize the results of your data exploration. |
| 33 | + |
| 34 | +* [The Python Data Science Handbook](https://jakevdp.github.io/PythonDataScienceHandbook/) |
| 35 | + is available to read for free online, although I also recommend |
| 36 | + buying the book as it is a great resource for learning the topic. |
| 37 | + |
| 38 | +* [PyData TV](https://www.youtube.com/user/PyDataTV) contains all the |
| 39 | + videos from the PyData conference series. The conference talks are |
| 40 | + often given by professional data scientists and the developers who |
| 41 | + write these analysis libraries, so there is a wealth of information |
| 42 | + not necessarily captured anywhere else. |
| 43 | + |
0 commit comments