Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

download this collection #12

Open
bbei-z opened this issue Jun 16, 2023 · 3 comments
Open

download this collection #12

bbei-z opened this issue Jun 16, 2023 · 3 comments

Comments

@bbei-z
Copy link

bbei-z commented Jun 16, 2023

hi, when I download this collection of qrecc, it always returns an error of 503, so I want to know the size of the collection-paragraph that is splited collections into little. If it is not big enough, can you share it with us?

@wickcode
Copy link

wickcode commented Oct 2, 2024

were you able to resolve the issue? When you follow do you get 54M passages as mentioned?
@RavitejaAnantha @tuzhucheng

@tuzhucheng
Copy link
Contributor

Sorry about the late reply. You can find a pre-built collection of passages here on AWS S3: aws s3 ls s3://mt-qrecc/collection-paragraph/.

@hankcs
Copy link

hankcs commented Dec 19, 2024

@tuzhucheng Access Denied when ls your S3, could you confirm?

BTW, the raw web pages can be downloaded from Zenodo (passages.zip).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants