• Log in
Anwen  Share and Create
  • Book
  • Movies
  • Music
  • SF
  • Goodlink
  • Asks
  • Eyeopen
  • Create

Tatoeba Translation Challenge Data (Storing in IPFS/Filecoin)

Sharer: 柏舟 September 19, 2020 at 8:38 am

A new challenge set for machine translation covering over 500 languages and thousands of language pairs.

License

These data are released under this licensing scheme:

CC-BY-NC-SA CC-BY-NC-SA 4.0 license

Tips: If you provide a dataset link in the article, we will help store it in filecoin network, for example:

https://object.pouta.csc.fi/Tatoeba-Challenge/eng-fra.tar

  • Size: 19484078080 bytes or 18.15 GiB
  • lotus client retrieve --miner=t020742 QmYAVeayHyZ6VmD1g5ef8tUN4xJJFkDszbMwMmMHQ8h8zc eng-fra.tar
  • lotus client retrieve --miner=t03275 QmYAVeayHyZ6VmD1g5ef8tUN4xJJFkDszbMwMmMHQ8h8zc eng-fra.tar
  • lotus client retrieve --miner=t05317 QmYAVeayHyZ6VmD1g5ef8tUN4xJJFkDszbMwMmMHQ8h8zc eng-fra.tar

  • Size: 19484078080 bytes or 18.15 GiB

  • lotus client retrieve --miner=t020742 QmYAVeayHyZ6VmD1g5ef8tUN4xJJFkDszbMwMmMHQ8h8zc https://object.pouta.csc.fi/Tatoeba-Challenge/eng-fra.tar

  • lotus client retrieve --miner=t03275 QmYAVeayHyZ6VmD1g5ef8tUN4xJJFkDszbMwMmMHQ8h8zc https://object.pouta.csc.fi/Tatoeba-Challenge/eng-fra.tar

  • lotus client retrieve --miner=t05317 QmYAVeayHyZ6VmD1g5ef8tUN4xJJFkDszbMwMmMHQ8h8zc https://object.pouta.csc.fi/Tatoeba-Challenge/eng-fra.tar

https://object.pouta.csc.fi/Tatoeba-Challenge/eng-spa.tar

  • Size: 13768458240 bytes or 12.82 GiB
  • lotus client retrieve --miner=t016563 QmP8DLCKrdLQ71qhxsyjGgCbzxUwccPyWRfzUtd9WE9FDm eng-spa.tar
  • lotus client retrieve --miner=t03339 QmP8DLCKrdLQ71qhxsyjGgCbzxUwccPyWRfzUtd9WE9FDm eng-spa.tar
  • lotus client retrieve --miner=t05317 QmP8DLCKrdLQ71qhxsyjGgCbzxUwccPyWRfzUtd9WE9FDm eng-spa.tar
  • lotus client retrieve --miner=t03275 QmP8DLCKrdLQ71qhxsyjGgCbzxUwccPyWRfzUtd9WE9FDm eng-spa.tar
  • lotus client retrieve --miner=t01272 QmP8DLCKrdLQ71qhxsyjGgCbzxUwccPyWRfzUtd9WE9FDm eng-spa.tar

  • Size: 13768458240 bytes or 12.82 GiB

  • lotus client retrieve --miner=t016563 QmP8DLCKrdLQ71qhxsyjGgCbzxUwccPyWRfzUtd9WE9FDm https://object.pouta.csc.fi/Tatoeba-Challenge/eng-spa.tar

  • lotus client retrieve --miner=t03339 QmP8DLCKrdLQ71qhxsyjGgCbzxUwccPyWRfzUtd9WE9FDm https://object.pouta.csc.fi/Tatoeba-Challenge/eng-spa.tar

  • lotus client retrieve --miner=t05317 QmP8DLCKrdLQ71qhxsyjGgCbzxUwccPyWRfzUtd9WE9FDm https://object.pouta.csc.fi/Tatoeba-Challenge/eng-spa.tar

  • lotus client retrieve --miner=t03275 QmP8DLCKrdLQ71qhxsyjGgCbzxUwccPyWRfzUtd9WE9FDm https://object.pouta.csc.fi/Tatoeba-Challenge/eng-spa.tar

  • lotus client retrieve --miner=t01272 QmP8DLCKrdLQ71qhxsyjGgCbzxUwccPyWRfzUtd9WE9FDm https://object.pouta.csc.fi/Tatoeba-Challenge/eng-spa.tar

https://object.pouta.csc.fi/Tatoeba-Challenge/eng-por.tar

  • Size: 7254394880 bytes or 6.76 GiB
  • lotus client retrieve --miner=t01782 QmPUBRhE3UJUxLhb1dHhCvMo1W82DcKHrjx3GXoYKTgA5N eng-por.tar
  • lotus client retrieve --miner=t016563 QmPUBRhE3UJUxLhb1dHhCvMo1W82DcKHrjx3GXoYKTgA5N eng-por.tar
  • lotus client retrieve --miner=t01272 QmPUBRhE3UJUxLhb1dHhCvMo1W82DcKHrjx3GXoYKTgA5N eng-por.tar
  • lotus client retrieve --miner=t03339 QmPUBRhE3UJUxLhb1dHhCvMo1W82DcKHrjx3GXoYKTgA5N eng-por.tar
  • lotus client retrieve --miner=t020742 QmPUBRhE3UJUxLhb1dHhCvMo1W82DcKHrjx3GXoYKTgA5N eng-por.tar

  • Size: 7254394880 bytes or 6.76 GiB

  • lotus client retrieve --miner=t01782 QmPUBRhE3UJUxLhb1dHhCvMo1W82DcKHrjx3GXoYKTgA5N https://object.pouta.csc.fi/Tatoeba-Challenge/eng-por.tar

  • lotus client retrieve --miner=t016563 QmPUBRhE3UJUxLhb1dHhCvMo1W82DcKHrjx3GXoYKTgA5N https://object.pouta.csc.fi/Tatoeba-Challenge/eng-por.tar

  • lotus client retrieve --miner=t01272 QmPUBRhE3UJUxLhb1dHhCvMo1W82DcKHrjx3GXoYKTgA5N https://object.pouta.csc.fi/Tatoeba-Challenge/eng-por.tar

  • lotus client retrieve --miner=t03339 QmPUBRhE3UJUxLhb1dHhCvMo1W82DcKHrjx3GXoYKTgA5N https://object.pouta.csc.fi/Tatoeba-Challenge/eng-por.tar

  • lotus client retrieve --miner=t020742 QmPUBRhE3UJUxLhb1dHhCvMo1W82DcKHrjx3GXoYKTgA5N https://object.pouta.csc.fi/Tatoeba-Challenge/eng-por.tar

https://object.pouta.csc.fi/Tatoeba-Challenge/eng-rus.tar

  • Size: 5759713280 bytes or 5.36 GiB
  • lotus client retrieve --miner=t03275 QmZPnW6UM6DWrrBbn1wE5H2xsi5qcXuXHQ1GJpRsYAGiQU eng-rus.tar
  • lotus client retrieve --miner=t05317 QmZPnW6UM6DWrrBbn1wE5H2xsi5qcXuXHQ1GJpRsYAGiQU eng-rus.tar
  • lotus client retrieve --miner=t03223 QmZPnW6UM6DWrrBbn1wE5H2xsi5qcXuXHQ1GJpRsYAGiQU eng-rus.tar
  • lotus client retrieve --miner=t02622 QmZPnW6UM6DWrrBbn1wE5H2xsi5qcXuXHQ1GJpRsYAGiQU eng-rus.tar
  • lotus client retrieve --miner=t01782 QmZPnW6UM6DWrrBbn1wE5H2xsi5qcXuXHQ1GJpRsYAGiQU eng-rus.tar
  • lotus client retrieve --miner=t03339 QmZPnW6UM6DWrrBbn1wE5H2xsi5qcXuXHQ1GJpRsYAGiQU eng-rus.tar

  • Size: 5759713280 bytes or 5.36 GiB

  • lotus client retrieve --miner=t03275 QmZPnW6UM6DWrrBbn1wE5H2xsi5qcXuXHQ1GJpRsYAGiQU https://object.pouta.csc.fi/Tatoeba-Challenge/eng-rus.tar

  • lotus client retrieve --miner=t05317 QmZPnW6UM6DWrrBbn1wE5H2xsi5qcXuXHQ1GJpRsYAGiQU https://object.pouta.csc.fi/Tatoeba-Challenge/eng-rus.tar

  • lotus client retrieve --miner=t03223 QmZPnW6UM6DWrrBbn1wE5H2xsi5qcXuXHQ1GJpRsYAGiQU https://object.pouta.csc.fi/Tatoeba-Challenge/eng-rus.tar

  • lotus client retrieve --miner=t02622 QmZPnW6UM6DWrrBbn1wE5H2xsi5qcXuXHQ1GJpRsYAGiQU https://object.pouta.csc.fi/Tatoeba-Challenge/eng-rus.tar

  • lotus client retrieve --miner=t01782 QmZPnW6UM6DWrrBbn1wE5H2xsi5qcXuXHQ1GJpRsYAGiQU https://object.pouta.csc.fi/Tatoeba-Challenge/eng-rus.tar

  • lotus client retrieve --miner=t03339 QmZPnW6UM6DWrrBbn1wE5H2xsi5qcXuXHQ1GJpRsYAGiQU https://object.pouta.csc.fi/Tatoeba-Challenge/eng-rus.tar

https://object.pouta.csc.fi/Tatoeba-Challenge/eng-zho.tar

  • Size: 3668695040 bytes or 3.42 GiB
  • lotus client retrieve --miner=t07998 QmNxRjDdjAbXYaV9xp2yJC16eEpv81GuXcfo22pkdkUcjH eng-zho.tar
  • lotus client retrieve --miner=t08403 QmNxRjDdjAbXYaV9xp2yJC16eEpv81GuXcfo22pkdkUcjH eng-zho.tar
  • lotus client retrieve --miner=t02387 QmNxRjDdjAbXYaV9xp2yJC16eEpv81GuXcfo22pkdkUcjH eng-zho.tar
  • lotus client retrieve --miner=t016563 QmNxRjDdjAbXYaV9xp2yJC16eEpv81GuXcfo22pkdkUcjH eng-zho.tar
  • lotus client retrieve --miner=t019437 QmNxRjDdjAbXYaV9xp2yJC16eEpv81GuXcfo22pkdkUcjH eng-zho.tar

  • Size: 3668695040 bytes or 3.42 GiB

  • lotus client retrieve --miner=t07998 QmNxRjDdjAbXYaV9xp2yJC16eEpv81GuXcfo22pkdkUcjH https://object.pouta.csc.fi/Tatoeba-Challenge/eng-zho.tar

  • lotus client retrieve --miner=t08403 QmNxRjDdjAbXYaV9xp2yJC16eEpv81GuXcfo22pkdkUcjH https://object.pouta.csc.fi/Tatoeba-Challenge/eng-zho.tar

  • lotus client retrieve --miner=t02387 QmNxRjDdjAbXYaV9xp2yJC16eEpv81GuXcfo22pkdkUcjH https://object.pouta.csc.fi/Tatoeba-Challenge/eng-zho.tar

  • lotus client retrieve --miner=t016563 QmNxRjDdjAbXYaV9xp2yJC16eEpv81GuXcfo22pkdkUcjH https://object.pouta.csc.fi/Tatoeba-Challenge/eng-zho.tar

  • lotus client retrieve --miner=t019437 QmNxRjDdjAbXYaV9xp2yJC16eEpv81GuXcfo22pkdkUcjH https://object.pouta.csc.fi/Tatoeba-Challenge/eng-zho.tar

Reference: - https://github.com/Helsinki-NLP/Tatoeba-Challenge/blob/master/Data.md



Tips: Until now, everytime you want to store your article, we will help you store it in Filecoin network. In the future, you can store it in Filecoin network using your own filecoin.

You can retrieve the markdown file of this article by running:
`lotus client retrieve --miner=t016594 bafk2bzacebe35woynofjg7cw4h32xh7vniafbsc22kp5xkrgumsx7dbnxqn7o 73855_bafk2b_t016594.md`
`lotus client retrieve --miner=t014394 bafk2bzaceanhi7hzmxfkridmared2k3pperj3r3ydes34g3mlvl5gut7x4i2g 73855_bafk2b_t014394.md`
`lotus client retrieve --miner=t014394 bafk2bzacebieehop4e3qjsfhapvhewi4plmqtijsknyiou475qi632v542zgq 73855_bafk2b_t014394.md`
`lotus client retrieve --miner=t08371 bafk2bzacecwdqz3samkep3gnqp55eaqsnyxwrlnq3mnkot4nqiy3op4jg6ifc 73855_bafk2b_t08371.md`
`lotus client retrieve --miner=t07998 bafk2bzacecwdqz3samkep3gnqp55eaqsnyxwrlnq3mnkot4nqiy3op4jg6ifc 73855_bafk2b_t07998.md`

Support author:
Author's Filecoin address:
Or you can use Likecoin to support author:
tags:dataset

0 0

2012-2018 Anwen All of our posts are default licensed under CC BY 4.0 About Help Changelog Telegram
Today Quote: 许多有名的作家,都是每天早上安排3-4小时的写作,一天的其余时间进行散步、通信、午睡和其他智力要求较低的活动。 --《早晨写作》