some dataset in Kaggle stored in IPFS/Filecoin
already stored in Filecoin Network by us
done
- https://www.kaggle.com/aishwr/coco2017 Data files © Original Authors 19GB
- https://www.kaggle.com/awsaf49/coco-2017-dataset CC0: Public Domain
- https://www.kaggle.com/omeret/not-safe-for-work GPL 2 19GB
- https://www.kaggle.com/fedorshakhovskiy/magic-kids openimages280 17G
- https://www.kaggle.com/imsparsh/musicnet-dataset CC0: Public Domain
- https://www.kaggle.com/timoboz/clevr-dataset CC BY 4.0
- https://www.kaggle.com/arnaud58/flickrfaceshq-dataset-ffhq CC0: Public Domain 19GB
- https://www.kaggle.com/prasoonkottarathil/face-mask-lite-dataset CC BY-SA 4.0 23GB
- https://www.kaggle.com/mathurinache/social-iq GPL 2 30GB
- https://www.kaggle.com/tapakah68/audio-dataset CC BY-ND 24GB score=9.4
- https://www.kaggle.com/sivaprasads/fashion-dataset. CC0 19G
done2
- https://www.kaggle.com/mozillaorg/common-voice CC0 12GB
- https://www.kaggle.com/starktony45/image-dataset CC0 13GB
- https://www.kaggle.com/peterhu/speech_data CC0 15GB
- https://www.kaggle.com/tunguz/xview2-challenge-dataset-tier-3-data CC BY-NC-SA 4.0 17GB
ing
- https://www.kaggle.com/usharengaraju/pandaset-dataset CC0: Public Domain. 31GB score=8.0
- https://www.kaggle.com/zfturbo/audioset CC BY-SA 29GB score=7.6
- https://www.kaggle.com/leighplt/glove-reddit-comments Apache2 24G score=10
- https://www.kaggle.com/reddit/reddit-comments-may-2015 api 20G
- https://www.kaggle.com/warmth/wmt18 23G
- https://www.kaggle.com/vepnar/nft-art-dataset Original Authors 32G
- https://www.kaggle.com/jkkphys/english-wikipedia-articles-20170820-sqlite 20G CC BY-SA 3.0
- https://www.kaggle.com/xhlulu/huggingface-bert 24G
- https://www.kaggle.com/imsparsh/fma-free-music-archive-small-medium CC0: Public Domain 32GB
- https://www.kaggle.com/georgemac510/top-100-crypto-dataset none 19G score=4.4
- https://www.kaggle.com/tunguz/1-million-fake-faces CC BY-NC 17G
- https://www.kaggle.com/sabermalek/iranian-traditional-music CC BY 4.0. 16GB
- https://www.kaggle.com/kenshoresearch/kensho-derived-wikimedia-data cc-by-sa 8G
- https://www.kaggle.com/hsankesara/flickr-image-dataset CC0 8G
- https://www.kaggle.com/paulrohan2020/huge-books-in-plain-text-for-train-language-models 2GB CC0
- https://www.kaggle.com/ikarus777/best-artworks-of-all-time 2GB CC BY-NC-SA 4.0
- https://www.kaggle.com/jacksoncrow/wikipedia-multimodal-dataset-of-good-articles CC0 2G
- https://www.kaggle.com/alvations/old-newspapers CC0 2G
- https://www.kaggle.com/dorianlazar/medium-articles-dataset 1GB CC0
todo
- https://www.kaggle.com/carlfm01/120h-spanish-speech CC0 13GB score=8.2
- https://www.kaggle.com/brkurzawa/original-150-pokemon-image-search-results GPL2 9GB score=8.8
- https://www.kaggle.com/bryanpark/the-world-english-bible-speech-dataset cc-by-nc-sa 10G
- https://www.kaggle.com/chrisfilo/fruit-recognition cc-by 8G
- https://www.kaggle.com/bryanpark/chinese-single-speaker-speech-dataset 2G CC0
- https://www.kaggle.com/raynardj/zh-wenyanwen-wikisource 2GB CC
- https://www.kaggle.com/hsankesara/flickr-image-dataset 4GB CC0
- https://www.kaggle.com/facebook/fatsttext-common-crawl 4G CC0
- https://www.kaggle.com/crawford/emnist CC0 1G
todo
- https://www.kaggle.com/landlord/handwriting-recognition CC0
- https://www.kaggle.com/sabermalek/iranian-traditional-music cc-by
- https://www.kaggle.com/evgeniumakov/images4k
- https://www.kaggle.com/wanghaohan/imagenetsketch
- https://www.kaggle.com/vic006/beginner
- https://www.kaggle.com/google/tinyquickdraw 11G
- https://www.kaggle.com/skylord/coronawhy 13G
- https://www.kaggle.com/abhishek/gpt2-pytorch 10G
- https://www.kaggle.com/yelp-dataset/yelp-dataset 4G
- https://www.kaggle.com/kmader/food41 5G
- https://www.kaggle.com/mittalshubham/spoken-languages 14G
- https://www.kaggle.com/ashirwadsangwan/imdb-dataset 1G
- https://www.kaggle.com/vaibhao/fashiondatacolor-images 13GB
- https://www.kaggle.com/paramaggarwal/fashion-product-images-dataset 15GB
- https://www.kaggle.com/crawford/qureai-headct 38GB
ref
Tips: Until now, everytime you want to store your article, we will help you store it in Filecoin network. In the future, you can store it in Filecoin network using your own filecoin.
Support author:
Author's Filecoin address:
Or you can use Likecoin to support author: