Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[PRE REVIEW]: BenchmarkDataNLP.jl: Synthetic Data Generation for NLP Benchmarking #7730

Open
editorialbot opened this issue Jan 27, 2025 · 7 comments
Labels
Julia pre-review TeX Track: 5 (DSAIS) Data Science, Artificial Intelligence, and Machine Learning

Comments

@editorialbot
Copy link
Collaborator

Submitting author: @mantzaris (Alexander V. Mantzaris)
Repository: https://github.com/mantzaris/BenchmarkDataNLP.jl
Branch with paper.md (empty if default branch): main
Version: v.1.0.0
Editor: Pending
Reviewers: Pending
Managing EiC: Chris Vernon

Status

status

Status badge code:

HTML: <a href="https://joss.theoj.org/papers/f2b3e3efb0fd234665e864d64551e5d8"><img src="https://joss.theoj.org/papers/f2b3e3efb0fd234665e864d64551e5d8/status.svg"></a>
Markdown: [![status](https://joss.theoj.org/papers/f2b3e3efb0fd234665e864d64551e5d8/status.svg)](https://joss.theoj.org/papers/f2b3e3efb0fd234665e864d64551e5d8)

Author instructions

Thanks for submitting your paper to JOSS @mantzaris. Currently, there isn't a JOSS editor assigned to your paper.

@mantzaris if you have any suggestions for potential reviewers then please mention them here in this thread (without tagging them with an @). You can search the list of people that have already agreed to review and may be suitable for this submission.

Editor instructions

The JOSS submission bot @editorialbot is here to help you find and assign reviewers and start the main review. To find out what @editorialbot can do for you type:

@editorialbot commands
@editorialbot editorialbot added pre-review Track: 5 (DSAIS) Data Science, Artificial Intelligence, and Machine Learning labels Jan 27, 2025
@editorialbot
Copy link
Collaborator Author

Hello human, I'm @editorialbot, a robot that can help you with some common editorial tasks.

For a list of things I can do to help you, just type:

@editorialbot commands

For example, to regenerate the paper pdf after making changes in the paper's md or bib files, type:

@editorialbot generate pdf

@editorialbot
Copy link
Collaborator Author

Reference check summary (note 'MISSING' DOIs are suggestions that need verification):

✅ OK DOIs

- None

🟡 SKIP DOIs

- No DOI given, and none found for title: Applying natural language processing techniques to...
- No DOI given, and none found for title: Generalized context-free grammars

❌ MISSING DOIs

- 10.1016/j.tcs.2016.05.030 may be a valid DOI for title: Survey: Finite-state technology in natural languag...
- 10.46298/arima.1956 may be a valid DOI for title: A survey of RDF storage approaches
- 10.1109/hpec58863.2023.10363447 may be a valid DOI for title: From words to watts: Benchmarking the energy costs...

❌ INVALID DOIs

- None

@editorialbot
Copy link
Collaborator Author

Software report:

github.com/AlDanial/cloc v 1.98  T=0.02 s (729.9 files/s, 112956.9 lines/s)
-------------------------------------------------------------------------------
Language                     files          blank        comment           code
-------------------------------------------------------------------------------
Julia                            8            498            434           1391
Markdown                         3             46              0            102
YAML                             2              7              0             63
TeX                              1             10              0             38
TOML                             2              4              0             17
Text                             1              5              0             16
-------------------------------------------------------------------------------
SUM:                            17            570            434           1627
-------------------------------------------------------------------------------

Commit count by author:

    39	mantzaris
     4	a.v.mantzaris
     2	fan

@editorialbot
Copy link
Collaborator Author

Paper file info:

📄 Wordcount for paper.md is 706

✅ The paper includes a Statement of need section

@editorialbot
Copy link
Collaborator Author

License info:

✅ License found: MIT License (Valid open source OSI approved license)

@editorialbot
Copy link
Collaborator Author

👉📄 Download article proof 📄 View article proof on GitHub 📄 👈

@editorialbot
Copy link
Collaborator Author

Five most similar historical JOSS papers:

Jury: A Comprehensive Evaluation Toolkit
Submitting author: @devrimcavusoglu
Handling editor: @crvernon (Active)
Reviewers: @evamaxfield, @KennethEnevoldsen
Similarity score: 0.7208

WordTokenizers.jl: Basic tools for tokenizing natural language in Julia
Submitting author: @oxinabox
Handling editor: @will-rowe (Retired)
Reviewers: @leios, @ninjin
Similarity score: 0.7150

DataDepsGenerators.jl: making reusing data easy by automatically generating DataDeps.jl registration code
Submitting author: @oxinabox
Handling editor: @arfon (Active)
Reviewers: @ninjin
Similarity score: 0.7019

Lerche: Generating data file processors in Julia from EBNF grammars
Submitting author: @jamesrhester
Handling editor: @sbenthall (Active)
Reviewers: @ziotom78, @eschnett
Similarity score: 0.7012

SyntheticEddyMethod.jl: A Julia package for the creation of inlet flow conditions for LES
Submitting author: @carlodev
Handling editor: @philipcardiff (Active)
Reviewers: @atzberg, @akshaysridhar
Similarity score: 0.6999

⚠️ Note to editors: If these papers look like they might be a good match, click through to the review issue for that paper and invite one or more of the authors before considering asking the reviewers of these papers to review again for JOSS.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Julia pre-review TeX Track: 5 (DSAIS) Data Science, Artificial Intelligence, and Machine Learning
Projects
None yet
Development

No branches or pull requests

1 participant