Jun Zhu
The purpose of this project is to demonstrate the collection and cleaning of source data and produce a tidy data set that can be used for subsequent analysis. A full description of the data used in this project can be found at The UCI Machine Learning Repository
The source data for this project can be found here.
This script is assumed to be ruined at default R Programming work directory in R Studio.
You should create one R script called run_analysis.R that does the following:
- Load activity labels and features;
- Extracts only the data on the mean and standard deviation for each measurement;
- Load the datasets from "train" and "test" folder;
- merge datasets and add labels;
- turn activities & subjects into factors;
- produce tidy file named, "tidy.txt”.
You can find additional information about the variables, data and transformations in the CodeBook.MD file.