Tutorial about how to structure data in a relational database system #283

niconoe · 2021-11-16T13:23:14Z

I've had the chance in the last few years to work in collaboration with biodiversity professionals, and from my IT-oriented perspective there's often a lack of knowledge about how to efficiently organise / structure the data in a relational database systems. To be more precise: there's generally enough knowledge to make something "that works", but a better structured database would be much more future-proof (less data errors, easier to reuse the data in other contexts such as data publication, web portals, ...)

That tutorial would not cover SQL nor the technicalities of a given database engine (SQLite, PostgreSQL, ...), but rather help answering questions such as

should a given piece of information be placed in a new field or in a new table?
how should I link tables X, Y and Z so they can be queried to answer a wide range of questions?
what constraint can I configure early when I create a database so human errors (i.e. typos when entering data) are detected as early as possible (and the database doesn't get messier when it gets more used/bigger)

Is there any demand for this from scientists, or is it just me?

If so, I'd be happy to help contributing to a tutorial (but I think it can be a pretty large task, so I'd like to have an idea of the interest first).

florisvdh · 2021-11-17T09:25:54Z

Hi Nico, this sounds great, and indeed seems very useful to many scientsts (INBO and non-INBO) - thanks so much for your enthusiasm!

IMHO it would be interesting that you connect with @fredericpiesschaert and @gertvanspaendonk in order to see which material they already have in that direction (it may be in Dutch, and SQL Server oriented, but still honouring the ideas that you describe). Ideally they could participate in this effort.

Moreover, this has relationships with issue #10 and the associated (still open) PR #140 (designing databases), for which further collaboration with @ThierryO was suggested in order to make it more technology-agnostic, similar to what you suggest.

Maybe @niconoe it would be great if you could bring the involved people together for a dedicated brainstorm and to refresh plans?

florisvdh · 2021-12-09T16:09:36Z

Moreover, this has relationships with issue #10 and the associated (still open) PR #140 (designing databases), for which further collaboration with @ThierryO was suggested in order to make it more technology-agnostic, similar to what you suggest.

Update: PR #140 has been merged since the authors wish to limit the scope to MQ SQL Server. So there's still room to tackle issue #10 further.

Advice on how to structure data in a (any) RDBMS is most welcome indeed.

niconoe added the proposal tutorial label Nov 16, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Tutorial about how to structure data in a relational database system #283

Tutorial about how to structure data in a relational database system #283

niconoe commented Nov 16, 2021

florisvdh commented Nov 17, 2021

florisvdh commented Dec 9, 2021

Tutorial about how to structure data in a relational database system #283

Tutorial about how to structure data in a relational database system #283

Comments

niconoe commented Nov 16, 2021

florisvdh commented Nov 17, 2021

florisvdh commented Dec 9, 2021