Skip to content
This repository has been archived by the owner on Jan 9, 2023. It is now read-only.

Export more than the first 1000 issues #42

Open
RyanCavanaugh opened this issue May 19, 2021 · 5 comments
Open

Export more than the first 1000 issues #42

RyanCavanaugh opened this issue May 19, 2021 · 5 comments
Assignees
Labels
bug Something isn't working

Comments

@RyanCavanaugh
Copy link

I ran this on microsoft/TypeScript and was surprised to see only a 1.6 MB file produced, since I know we have much more content than that. The generated JSON, though, only includes 1,000 issues, so it misses the other ~95% of the issues to have crossed our repo.

@Chocrates
Copy link
Contributor

Thanks for the report @RyanCavanaugh ! That seems bad! I will dig in to this and see what is going on.

@Chocrates Chocrates added the bug Something isn't working label May 19, 2021
@Chocrates
Copy link
Contributor

I am able to repro this:
./bin/run search:issues --token <token> --owner microsoft --repo typescript --format JSON --since=2020-06-01 > typescript.json

@Chocrates Chocrates self-assigned this May 19, 2021
@Chocrates
Copy link
Contributor

Looks like the GraphQL query gets to page 10 and then can't find the next page (hasNextPage is false). Debugging this to see what is happening.

@RyanCavanaugh
Copy link
Author

RyanCavanaugh commented May 19, 2021

The Search API is documented to only return the first 1,000 results

I've written a similar tool for bulk export and AFAIK the only alternative is to go through the Issues graph instead of the Search graph.

@Chocrates
Copy link
Contributor

I opened #45 to discuss how we want to solve this issue, since it will require a bit of re-architecting.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants