diff --git a/ChID/OpenDataLab___ChID/raw/ChID.tar.gz b/ChID.tar.gz similarity index 98% rename from ChID/OpenDataLab___ChID/raw/ChID.tar.gz rename to ChID.tar.gz index 90a4dcc..45ed690 100644 Binary files a/ChID/OpenDataLab___ChID/raw/ChID.tar.gz and b/ChID.tar.gz differ diff --git a/ChID/OpenDataLab___ChID/README.md b/ChID/OpenDataLab___ChID/README.md deleted file mode 100644 index d0096fa..0000000 --- a/ChID/OpenDataLab___ChID/README.md +++ /dev/null @@ -1,14 +0,0 @@ - ## 简介 - ChID 是一个用于完形填空测试的大规模中文成语数据集。 ChID 包含 581K 段落和 729K 空白,涵盖多个领域。在 ChID 中,段落中的成语被替换为空白符号。对于每个空白,提供包括黄金成语在内的候选成语列表作为选择。 - ## 类定义 - null - ## 引文 - ``` -@article{zheng2019chid, - title={ChID: A large-scale Chinese IDiom dataset for cloze test}, - author={Zheng, Chujie and Huang, Minlie and Sun, Aixin}, - journal={arXiv preprint arXiv:1906.01265}, - year={2019} -} -``` - ‌​‌‌​​​​‌​​​‌‌‌‌‌​​‌‌​‌​‌​​‌​​​‌‌​‌‌‌​‌‌‌​​‌‌‌‌​‌​​​‌​‌‌‌​​‌‌‌‌​‌​‌‌​​‌‌‌​​‌‌‌‌​‌​​‌‌‌​‌ \ No newline at end of file diff --git a/ChID/OpenDataLab___ChID/metafile.yaml b/ChID/OpenDataLab___ChID/metafile.yaml deleted file mode 100644 index 6128fe4..0000000 --- a/ChID/OpenDataLab___ChID/metafile.yaml +++ /dev/null @@ -1,20 +0,0 @@ -displayName: ChID(Chinese IDiom dataset) -labelTypes: -- Chinese Corpus -license: -- Apache 2.0 -mediaTypes: -- Text -paperUrl: https://arxiv.org/pdf/1906.01265v3.pdf -publishDate: "2019-01-01" -publishUrl: https://github.com/chujiezheng/ChID-Dataset -publisher: -- Nanyang Technological University -- Tsinghua University -- Beijing National Research Center for Information Science and Technology -tags: [] -taskTypes: -- Machine Reading Comprehension -- Reading Comprehension -- Language Modelling -‌​‌‌​​​​‌​​​‌‌‌‌‌​​‌‌​‌​‌​​‌​​​‌‌​‌‌‌​‌‌‌​​‌‌‌‌​‌​​​‌​‌‌‌​​‌‌‌‌​‌​‌‌​​‌‌‌​​‌‌‌‌​‌​​‌‌‌​‌ \ No newline at end of file