-
Notifications
You must be signed in to change notification settings - Fork 53
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
TASK-6347 Fix normalization issue in ClinVar #693
Conversation
… clinical variants, #TASK-6347 On branch TASK-6347 Changes to be committed: modified: cellbase-lib/src/main/java/org/opencb/cellbase/lib/builders/clinical/variant/ClinicalIndexer.java
…nd Gwas, files to download; and update Cosmic version, #TASK-6347 On branch TASK-6347 Changes to be committed: modified: cellbase-core/src/main/resources/configuration.yml modified: cellbase-lib/src/main/java/org/opencb/cellbase/lib/builders/clinical/variant/CosmicIndexer.java modified: cellbase-lib/src/main/java/org/opencb/cellbase/lib/download/ClinicalDownloadManager.java
…on of the GWAS Catalog, #TASK-6347 On branch TASK-6347 Changes to be committed: modified: cellbase-core/src/main/resources/configuration.yml
…sion, #TASK-6347 On branch TASK-6347 Changes to be committed: modified: cellbase-lib/src/main/java/org/opencb/cellbase/lib/EtlCommons.java
On branch TASK-6347 Changes to be committed: modified: cellbase-lib/src/main/java/org/opencb/cellbase/lib/builders/clinical/variant/ClinVarIndexer.java
…file; and fix some sonnar issues, #TASK-6347 On branch TASK-6347 Changes to be committed: modified: cellbase-lib/src/main/java/org/opencb/cellbase/lib/builders/clinical/variant/GwasIndexer.java
try { | ||
accession = publicSet.getReferenceClinVarAssertion().getClinVarAccession().getAcc(); | ||
} catch (Exception e) { | ||
logger.warn("Error getting accession\n" + StringUtils.join(e.getStackTrace(), "\n")); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Better let the logger to print the stacktrace:
logger.warn("Error getting accession", e);
and you can also indicate what will happen with the accession:
logger.warn("Error getting accession. Ignore error and leave accession as null.", e);
This applies to all other fields read in this method
while ((line = inputReader.readLine()) != null) { | ||
++lineCounter; | ||
if (!line.isEmpty()) { | ||
processedGwasLines++; |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Might want to use a org.opencb.commons.ProgressLogger
here
ProgressLogger progressLogger = new ProgressLogger("Lines parsed").setBatchSize(10000);
...
if (!line.isEmpty() ) {
progressLogger.increment(1);
}
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is just a comment, no action is required (i.e. it's optional)
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hmm, that's useful, but I can't find a method to get the current number of parsed lines. I need this information to include in the log messages when parsing fails.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
That is very inconvenient... there is no way of obtaining that..
cellbase-lib/src/main/java/org/opencb/cellbase/lib/builders/clinical/variant/GwasIndexer.java
Show resolved
Hide resolved
cellbase-lib/src/main/java/org/opencb/cellbase/lib/builders/clinical/variant/GwasIndexer.java
Show resolved
Hide resolved
On branch TASK-6347 Changes to be committed: modified: cellbase-lib/src/main/java/org/opencb/cellbase/lib/builders/clinical/variant/ClinVarIndexer.java
TASK-6347 normalization issue in ClinVar