Skip to content
Merged
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
62 changes: 31 additions & 31 deletions BigDataandCloud.md
Original file line number Diff line number Diff line change
@@ -1,43 +1,28 @@
##Big Data and Cloud

1. nova
OpenStack Nova provides a cloud computing fabric controller, supporting a wide variety of virtualization technologies, including KVM, Xen, LXC, VMware, and more.
Project Source: https://github.com/openstack/nova
Project Homepage: http://www.openstack.org/
1. aws-cli
Universal Command Line Interface for Amazon Web Services
Project Source: https://github.com/aws/aws-cli

1. boto
Boto is a Python package that provides interfaces to Amazon Web Services.
Project Source: https://github.com/boto/boto
Project Documentation: http://docs.pythonboto.org/en/latest/

1. aws-cli
Universal Command Line Interface for Amazon Web Services
Project Source: https://github.com/aws/aws-cli

1. mrjob
Run MapReduce jobs on Hadoop or Amazon Web Services.
Project Source: https://github.com/Yelp/mrjob
Project Documentation: https://pythonhosted.org/mrjob/

1. s3cmd
Command line tool for managing Amazon S3 and CloudFront services.
Project Source: https://github.com/s3tools/s3cmd
Project Homepage: http://s3tools.org/s3cmd

1. dumbo
Python module that allows one to easily write and run Hadoop programs.
Project Source: https://github.com/klbostee/dumbo
Project Homepage: http://klbostee.github.io/dumbo/

1. dpark
DPark is a Python clone of Spark, MapReduce alike computing framework supporting iterative computation.
Project Source: https://github.com/douban/dpark
Project Documentation (Chinese): https://github.com/jackfengji/test_pro/wiki

1. Hue
Hue is an open source Web UI for doing big data with Hadoop.
Project Source: https://github.com/cloudera/hue
Project Homepage: http://gethue.com/
1. dumbo
Python module that allows one to easily write and run Hadoop programs.
Project Source: https://github.com/klbostee/dumbo
Project Homepage: http://klbostee.github.io/dumbo/

1. Fabulous
Deploy django apps to Amazon EC2 with ONE command.
Project Source: https://github.com/gcollazo/Fabulous
Project Homepage: http://elweb.co/

1. falcon
Falcon is a high-performance Python framework for building cloud APIs and web app backends.
Expand All @@ -54,17 +39,32 @@ a gae proxy
Project Source: https://github.com/goagent/goagent
Project Homepage: https://code.google.com/p/goagent/

1. Hue
Hue is an open source Web UI for doing big data with Hadoop.
Project Source: https://github.com/cloudera/hue
Project Homepage: http://gethue.com/

1. libcloud
Apache Libcloud is a Python library which hides differences between different cloud provider APIs and allows you to manage different cloud resources through a unified and easy to use API.
Project Source: https://github.com/apache/libcloud
Project Documentation: http://libcloud.apache.org/

1. Fabulous
Deploy django apps to Amazon EC2 with ONE command.
Project Source: https://github.com/gcollazo/Fabulous
Project Homepage: http://elweb.co/
1. mrjob
Run MapReduce jobs on Hadoop or Amazon Web Services.
Project Source: https://github.com/Yelp/mrjob
Project Documentation: https://pythonhosted.org/mrjob/

1. nova
OpenStack Nova provides a cloud computing fabric controller, supporting a wide variety of virtualization technologies, including KVM, Xen, LXC, VMware, and more.
Project Source: https://github.com/openstack/nova
Project Homepage: http://www.openstack.org/

1. pyes
pyes is a pythonic way to use ElasticSearch since 2010.
Project Source: https://github.com/aparo/pyes
Project Documentation: https://pypi.python.org/pypi/pyes/

1. s3cmd
Command line tool for managing Amazon S3 and CloudFront services.
Project Source: https://github.com/s3tools/s3cmd
Project Homepage: http://s3tools.org/s3cmd