• Log in
Anwen  Share and Create
  • Book
  • Movies
  • Music
  • SF
  • Goodlink
  • Asks
  • Eyeopen
  • Create

some dataset in Kaggle stored in IPFS/Filecoin

Sharer: 柏舟 November 11, 2020 at 2:06 pm

already stored in Filecoin Network by us

done

  • https://www.kaggle.com/aishwr/coco2017 Data files © Original Authors 19GB
  • https://www.kaggle.com/awsaf49/coco-2017-dataset CC0: Public Domain
  • https://www.kaggle.com/omeret/not-safe-for-work GPL 2 19GB
  • https://www.kaggle.com/fedorshakhovskiy/magic-kids openimages280 17G
  • https://www.kaggle.com/imsparsh/musicnet-dataset CC0: Public Domain
  • https://www.kaggle.com/timoboz/clevr-dataset CC BY 4.0
  • https://www.kaggle.com/arnaud58/flickrfaceshq-dataset-ffhq CC0: Public Domain 19GB
  • https://www.kaggle.com/prasoonkottarathil/face-mask-lite-dataset CC BY-SA 4.0 23GB
  • https://www.kaggle.com/mathurinache/social-iq GPL 2 30GB
  • https://www.kaggle.com/tapakah68/audio-dataset CC BY-ND 24GB score=9.4
  • https://www.kaggle.com/sivaprasads/fashion-dataset. CC0 19G

done2

  • https://www.kaggle.com/mozillaorg/common-voice CC0 12GB
  • https://www.kaggle.com/starktony45/image-dataset CC0 13GB
  • https://www.kaggle.com/peterhu/speech_data CC0 15GB
  • https://www.kaggle.com/tunguz/xview2-challenge-dataset-tier-3-data CC BY-NC-SA 4.0 17GB

ing

  • https://www.kaggle.com/usharengaraju/pandaset-dataset CC0: Public Domain. 31GB score=8.0
  • https://www.kaggle.com/zfturbo/audioset CC BY-SA 29GB score=7.6
  • https://www.kaggle.com/leighplt/glove-reddit-comments Apache2 24G score=10
  • https://www.kaggle.com/reddit/reddit-comments-may-2015 api 20G
  • https://www.kaggle.com/warmth/wmt18 23G
  • https://www.kaggle.com/vepnar/nft-art-dataset Original Authors 32G
  • https://www.kaggle.com/jkkphys/english-wikipedia-articles-20170820-sqlite 20G CC BY-SA 3.0
  • https://www.kaggle.com/xhlulu/huggingface-bert 24G
  • https://www.kaggle.com/imsparsh/fma-free-music-archive-small-medium CC0: Public Domain 32GB
  • https://www.kaggle.com/georgemac510/top-100-crypto-dataset none 19G score=4.4
  • https://www.kaggle.com/tunguz/1-million-fake-faces CC BY-NC 17G
  • https://www.kaggle.com/sabermalek/iranian-traditional-music CC BY 4.0. 16GB
  • https://www.kaggle.com/kenshoresearch/kensho-derived-wikimedia-data cc-by-sa 8G
  • https://www.kaggle.com/hsankesara/flickr-image-dataset CC0 8G
  • https://www.kaggle.com/paulrohan2020/huge-books-in-plain-text-for-train-language-models 2GB CC0
  • https://www.kaggle.com/ikarus777/best-artworks-of-all-time 2GB CC BY-NC-SA 4.0
  • https://www.kaggle.com/jacksoncrow/wikipedia-multimodal-dataset-of-good-articles CC0 2G
  • https://www.kaggle.com/alvations/old-newspapers CC0 2G
  • https://www.kaggle.com/dorianlazar/medium-articles-dataset 1GB CC0

todo

  • https://www.kaggle.com/carlfm01/120h-spanish-speech CC0 13GB score=8.2
  • https://www.kaggle.com/brkurzawa/original-150-pokemon-image-search-results GPL2 9GB score=8.8
  • https://www.kaggle.com/bryanpark/the-world-english-bible-speech-dataset cc-by-nc-sa 10G
  • https://www.kaggle.com/chrisfilo/fruit-recognition cc-by 8G
  • https://www.kaggle.com/bryanpark/chinese-single-speaker-speech-dataset 2G CC0
  • https://www.kaggle.com/raynardj/zh-wenyanwen-wikisource 2GB CC
  • https://www.kaggle.com/hsankesara/flickr-image-dataset 4GB CC0
  • https://www.kaggle.com/facebook/fatsttext-common-crawl 4G CC0
  • https://www.kaggle.com/crawford/emnist CC0 1G

todo

  • https://www.kaggle.com/landlord/handwriting-recognition CC0
  • https://www.kaggle.com/sabermalek/iranian-traditional-music cc-by
  • https://www.kaggle.com/evgeniumakov/images4k
  • https://www.kaggle.com/wanghaohan/imagenetsketch
  • https://www.kaggle.com/vic006/beginner
  • https://www.kaggle.com/google/tinyquickdraw 11G
  • https://www.kaggle.com/skylord/coronawhy 13G
  • https://www.kaggle.com/abhishek/gpt2-pytorch 10G
  • https://www.kaggle.com/yelp-dataset/yelp-dataset 4G
  • https://www.kaggle.com/kmader/food41 5G
  • https://www.kaggle.com/mittalshubham/spoken-languages 14G
  • https://www.kaggle.com/ashirwadsangwan/imdb-dataset 1G
  • https://www.kaggle.com/vaibhao/fashiondatacolor-images 13GB
  • https://www.kaggle.com/paramaggarwal/fashion-product-images-dataset 15GB
  • https://www.kaggle.com/crawford/qureai-headct 38GB

ref

  • https://www.kaggle.com/datasets


Tips: Until now, everytime you want to store your article, we will help you store it in Filecoin network. In the future, you can store it in Filecoin network using your own filecoin.


Support author:
Author's Filecoin address:
Or you can use Likecoin to support author:
tags:dataset

0 0

2012-2018 Anwen All of our posts are default licensed under CC BY 4.0 About Help Changelog Telegram
Today Quote: 极权统治的实质就是消除一切自发的政治生活,把社会中的人分裂成一个个的原子,其目的在于使每个人只能孤立地面对整个制度,从而使人感到形单影只,而且往往茫然若失,敢怒不敢言。 -- 米奇尼克