• Log in
Anwen  Share and Create
  • Book
  • Film
  • Music
  • SF
  • Goodlink
  • Asks
  • Eyeopen
  • Create
  • RSS

some dataset in Kaggle stored in IPFS/Filecoin

Sharer: 柏舟 November 11, 2020 at 2:06 pm

already in Filecoin Network

  • https://www.kaggle.com/mozillaorg/common-voice 12G CC0 archive_20201110T055143Z_5793_9812_common_voice.zip
  • https://www.kaggle.com/starktony45/image-dataset 13GB CC0 archive_20201110T060233Z_cocotrain2014.zip
  • https://www.kaggle.com/peterhu/speech_data 15G CC0 archive_20201110T060817Z_chinese_speech_data.zip

ing

  • https://www.kaggle.com/aishwr/coco2017 19G © Original Authors
  • https://www.kaggle.com/omeret/not-safe-for-work 19G GPL 2
  • https://www.kaggle.com/fedorshakhovskiy/magic-kids 17G openimages280
  • https://www.kaggle.com/kenshoresearch/kensho-derived-wikimedia-data cc-by-sa 8G
  • https://www.kaggle.com/hsankesara/flickr-image-dataset CC0 8G
  • https://www.kaggle.com/paulrohan2020/huge-books-in-plain-text-for-train-language-models 2GB CC0
  • https://www.kaggle.com/ikarus777/best-artworks-of-all-time 2GB CC BY-NC-SA 4.0
  • https://www.kaggle.com/jacksoncrow/wikipedia-multimodal-dataset-of-good-articles CC0 2G
  • https://www.kaggle.com/alvations/old-newspapers CC0 2G
  • https://www.kaggle.com/dorianlazar/medium-articles-dataset 1GB CC0

todo

  • https://www.kaggle.com/reddit/reddit-comments-may-2015 api 20G
  • https://www.kaggle.com/warmth/wmt18 23G
  • https://www.kaggle.com/bryanpark/the-world-english-bible-speech-dataset cc-by-nc-sa 10G
  • https://www.kaggle.com/chrisfilo/fruit-recognition cc-by 8G
  • https://www.kaggle.com/bryanpark/chinese-single-speaker-speech-dataset 2G CC0
  • https://www.kaggle.com/raynardj/zh-wenyanwen-wikisource 2GB CC
  • https://www.kaggle.com/hsankesara/flickr-image-dataset 4GB CC0
  • https://www.kaggle.com/facebook/fatsttext-common-crawl 4G CC0
  • https://www.kaggle.com/crawford/emnist CC0 1G

todo:

  • https://www.kaggle.com/jkkphys/english-wikipedia-articles-20170820-sqlite 20G CC BY-SA 3.0
  • https://www.kaggle.com/landlord/handwriting-recognition CC0
  • https://www.kaggle.com/sabermalek/iranian-traditional-music cc-by
  • https://www.kaggle.com/mathurinache/social-iq gpl-2
  • https://www.kaggle.com/evgeniumakov/images4k
  • https://www.kaggle.com/wanghaohan/imagenetsketch
  • https://www.kaggle.com/vic006/beginner
  • https://www.kaggle.com/xhlulu/huggingface-bert 24G
  • https://www.kaggle.com/google/tinyquickdraw 11G
  • https://www.kaggle.com/skylord/coronawhy 13G
  • https://www.kaggle.com/abhishek/gpt2-pytorch 10G
  • https://www.kaggle.com/yelp-dataset/yelp-dataset 4G
  • https://www.kaggle.com/kmader/food41 5G
  • https://www.kaggle.com/mittalshubham/spoken-languages 14G
  • https://www.kaggle.com/ashirwadsangwan/imdb-dataset 1G

ref

  • https://www.kaggle.com/datasets


Tips: Tips: Until now, everytime you want to store your article, we will help you store it in Filecoin network. In the future, you can store it in Filecoin network using your own filecoin.


Support author:
Author's Filecoin address:
Or you can use Likecoin to support author:
tags:dataset

0 0

2012-2018 Anwen All of our posts are default licensed under CC BY 4.0 About Help Changelog Telegram
Today Quote: 一个从未犯错的人是因为他不曾尝试新鲜事物。 -- 爱因斯坦