Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Problem with hypenated inventor name search #374

Open
pgcharetteUdeS opened this issue Jan 11, 2025 · 0 comments
Open

Problem with hypenated inventor name search #374

pgcharetteUdeS opened this issue Jan 11, 2025 · 0 comments

Comments

@pgcharetteUdeS
Copy link

pgcharetteUdeS commented Jan 11, 2025

pybliometrics version: 4.1

Code to reproduce the bug:

query_str = 'in=("Pratte" prox/distance<3 "Jean-François")'
results = Inpadoc.objects.filter(cql_query=query_str).to_pandas()
print(f"Search done, {len(results)} results!")

Expected behavior:

This search should find at least one result, but none are found. The problem is caused by the hyphenated name.

The search finds one result by dropping the second name after the hyphen, by changing the query to: query_str = 'in=("Pratte" prox/distance<3 "Jean")'

Currently, espacenet has two "smart search" web interfaces:

  1. "Classic" (i.e. old): https://worldwide.espacenet.com/?locale=en_EP
  2. "New": https://worldwide.espacenet.com/

If you try the above search query string including the hyphen with the "old" interface, the search finds nothing, whereas it correctly finds one document with the "new" interface. The espacenet documentation says to replace hypens with space, but this makes no difference to the results.

The code above behaves like the "old" espacenet interface, i.e. the search finds no results. Is this a bug in Inpadoc.objects.filter() or is there a workaround or procedure to access the "new" interface via Inpadoc.objects.filter() to support hyphens?

Thanks!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant