Resources

Public Data Sets

Public Data Sets on AWS provides a centralized repository of public data sets that can be seamlessly integrated into AWS cloud-based applications. AWS is hosting the public data sets at no charge for the community, and like all AWS services, users pay only for the compute and storage they use for their own applications. Learn more about Public Data Sets on AWS and visit the Public Data Sets forum.

Jay Flatley (CEO of Illumina) human genome data set.

Complete genome sequence data for three Yoruba individuals from Ibadan, Nigeria

The Sloan Digital Sky Survey is the most ambitious astronomical survey ever undertaken.