BigData

Resources

BigData.Resources History

Hide minor edits - Show changes to output

Added lines 11-12:

Data.World https://data.world/
Added lines 11-12:

Data for Everyone https://www.crowdflower.com/data-for-everyone/
May 18, 2016, at 11:36 AM EST by 64.106.39.101 -
Added line 13:
May 18, 2016, at 11:36 AM EST by 64.106.39.101 -
Added line 12:
Crisis Text Line http://www.fastcompany.com/3056936/crisis-text-line-is-opening-its-treasure-trove-of-data-to-researchers
Added lines 11-14:

Yahoo datasets http://yahoolabs.tumblr.com/post/137281912191/yahoo-releases-the-largest-ever-machine-learning

Kaggle datasets https://www.kaggle.com/datasets
Added lines 11-14:

Movie sentiment analysis http://ai.stanford.edu/~amaas/data/sentiment/

Public data http://enigma.io/
Added lines 11-20:

80 million tiny images http://horatio.cs.nyu.edu/mit/tiny/data/

Google public data http://www.google.com/publicdata/directory

Stanford Large Network Dataset Collection http://snap.stanford.edu/data/index.html

Wikipedia database https://en.wikipedia.org/wiki/Wikipedia:Database_download

Open street map http://planet.openstreetmap.org/
Changed lines 8-12 from:
!! Guidelines for final report & poster

[[http://www.cs.unm.edu/~estrada/teaching/trilce/index.php?n=ML.GuidelinesForFinalDelivery| Final report and poster]]


to:

Changed lines 34-39 from:
Book-Crossing Dataset: http://www2.informatik.uni-freiburg.de/~cziegler/BX/ (for recommender systems)
to:
Book-Crossing Dataset: http://www2.informatik.uni-freiburg.de/~cziegler/BX/ (for recommender systems)


!! Guidelines for final report & poster

[[http://www.cs.unm.edu/~estrada/teaching/trilce/index.php?n=ML.GuidelinesForFinalDelivery| Final report and poster]]
Added line 6:
* [[X11 port-forwarding in Galles]]
Changed lines 3-5 from:
[[Get your account in CARC]]
[[Run Hadoop in Galles]]
[[Run Mahout in Galles]]
to:
* [[Get your account in CARC]]
* [[Run Hadoop in Galles]]
* [[Run Mahout in Galles]]
Changed lines 1-7 from:
!! Guidelines fro final report & poster
to:
!! How to

[[Get your account in CARC]]
[[Run Hadoop in Galles]]
[[Run Mahout in Galles]]

!! Guidelines for
final report & poster
Added lines 1-5:
!! Guidelines fro final report & poster

[[http://www.cs.unm.edu/~estrada/teaching/trilce/index.php?n=ML.GuidelinesForFinalDelivery| Final report and poster]]

Changed line 25 from:
 Book-Crossing Dataset: http://www2.informatik.uni-freiburg.de/~cziegler/BX/ (for recommender systems)
to:
Book-Crossing Dataset: http://www2.informatik.uni-freiburg.de/~cziegler/BX/ (for recommender systems)
Changed lines 23-25 from:
Amazon public datasets: http://aws.amazon.com/public-data-sets/
to:
Amazon public datasets: http://aws.amazon.com/public-data-sets/

 Book-Crossing Dataset: http://www2.informatik.uni-freiburg.de/~cziegler/BX/ (for recommender systems)
Changed lines 19-23 from:
Wireless networks: http://crawdad.cs.dartmouth.edu/about.php
to:
Wireless networks: http://crawdad.cs.dartmouth.edu/about.php

Friendster Social Network Dataset: https://archive.org/details/friendster-dataset-201107

Amazon public datasets: http://aws.amazon.com/public-data-sets/
July 31, 2014, at 01:40 PM EST by 64.106.39.101 -
Added lines 1-19:
!! Datasets

Newsgroups http://qwone.com/~jason/20Newsgroups/

Social network analysis http://www.growmeme.com/snas

Data.GOV http://catalog.data.gov/dataset

Webscope http://webscope.sandbox.yahoo.com/catalog.php

Gene NCBI http://www.ncbi.nlm.nih.gov/gene

DrugBank (open data drug & drug target database) http://www.drugbank.ca/databases

UCI http://archive.ics.uci.edu/ml/datasets.html

Random hacks of kindness http://www.rhok.org/problems

Wireless networks: http://crawdad.cs.dartmouth.edu/about.php