Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Provide Dataverse Statistics to OpenAIRE #9

Open
sourcedump opened this issue Apr 12, 2021 · 2 comments
Open

Provide Dataverse Statistics to OpenAIRE #9

sourcedump opened this issue Apr 12, 2021 · 2 comments

Comments

@sourcedump
Copy link

sourcedump commented Apr 12, 2021

Hello!
We are interested in providing view and download statistics to the openaire content dashboard ( https://openaire.github.io/usage-statistics-guidelines/ ).
The Generic matomo tracker script doesn’t seem to be sufficient for that, have anyone build a solution for that?
I've also opened a twin issue here: IQSS/dataverse#7782

Thank you!

@dimitrispie
Copy link
Collaborator

Hi,
Can you please further elaborate on the above?

@juancorr
Copy link

Hi @dimitrispie,
Sorry for the delay in the response. We are trying to return to pick up this thread.

I have a provider ready in OpenAIRE to do the tests and installed the Generic Matomo Tracker. The problem found is that our log format is not a standard Apache log format, but one created to be compatible with the Make Data Count

Our fields are :
#Fields: event_time ¬ client_ip ¬ session_cookie_id ¬ user_cookie_id ¬ user_id ¬ request_url ¬ identifier ¬ filename ¬ size ¬ user-agent ¬ title ¬ publisher ¬ publisher_id ¬ authors ¬ publication_date ¬ version ¬ other_id ¬ target_url ¬ publication_year

I have change tabs for " ¬ " in the log.

Downloads starts with "/api/access/datafile/" in the target_url field

2022-06-01T11:06:12+0200 ¬ 165.225.194.191 ¬ e816be5b4ff1fb016c7f9a421ee3 ¬ - ¬ :guest ¬ /api/access/datafile/3490 ¬ doi:10.21950/QSEEM9 ¬ file://169e3f9658a-c79edf5526d6 ¬ 5320526530 ¬ Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/102.0.0.0 Safari/537.36 ¬ Informante 1 (Igreja, Cabreiro). Plantas ¬ grid ¬ tbd ¬ Álvarez Pérez, Xosé Afonso (coord.) ¬ 2022-03-08T09:01:29Z ¬ 1 ¬ - ¬ /api/access/datafile/3490 ¬ 2022

All lines in the log without "/api/access/datafile/" in the target_url field, are related to views of the dataset information page. It could be the main dataset information page, a previous dataset version or information about a file.

Here is an example:
2022-06-01T01:01:03+0200 ¬ 203.250.102.140 ¬ c59e8a1483d06b92dd21b6f563c7 ¬ - ¬ :guest ¬ /dataset.xhtml?persistentId=doi:10.21950/TRCQ6U ¬ doi:10.21950/TRCQ6U ¬ - ¬ - ¬ Java/1.8.0_181 ¬ Juan Manuel Plata "Manecas" (La Alamedilla). La casa. El hombre ¬ grid ¬ tbd ¬ Álvarez Pérez, Xosé Afonso (coord.) ¬ 2022-03-04T09:31:10Z ¬ 1 ¬ - ¬ /dataset.xhtml?persistentId=doi:10.21950/TRCQ6U ¬ 2022

Could be configured the Generic Matomo Tracker to read this log?

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants