This sentence from Drizzle documentation was surprising to me:
Regardless of how many nested relations you query - Drizzle will always make exactly one SQL query to the database, it makes it extremely explicit and easy to tune performance with indexes.
In my mind, using exactly one SQL query will not be performant in all cases. In fact, I have seen some cases where doing multiple joins hurt the performance a lot. Am I missing something?! maybe Drizzle is doing something I don't know ...
So I created this benchmark to experiment with Drizzle a bit and see what I am understanding wrong.
- I used Postgres inside a docker container
- I created a simple blog database
users
id int primary key auto-increment
name varchar(255)
email varchar(255)
bio text
posts
id primary key auto-increment
user_id int foreign key (users.id)
title text
content text
comments
id primary key auto-increment
user_id int foreign key (users.id)
post_id int foreign key (posts.id)
content text
- I inserted a lot of data into the database: 100k users, 500k posts and 5M comments
- I wrote two scripts to fetch the same data: a number of comments with their author name, post title and post author name. The count of comments to fetch is given as a parameter to the script.
- The first script
src/single.ts
uses Drizzledb.query...
to fetch all the data using a single query. - The second script
src/multiple.ts
uses Drizzledb.query...
three times to fetch data from the 3 tables and combine them manually. Both scripts give exactly the same data!
- The first script
count of comments | single query on v0.26.5 | multiple queries + combination on v0.26.5 | single query on v0.28.0 | multiple queries + combination on v0.28.0 | single query with Orchid ORM v1.10.5 |
---|---|---|---|---|---|
1 | 1900 ms | 27 ms | 11 ms | 17 ms | 12 ms |
100 | 3250 ms | 30 ms | 13 ms | 21 ms | 13 ms |
1000 | 16_000 ms | 55 ms | 23 ms | 37 ms | 21 ms |
5000 | 71_000 ms | 150 ms | 57 ms | 120 ms | 50 ms |
SQL produced by Drizzle ORM v0.26.5
SELECT "id",
"user_id",
"post_id",
"content",
"user"::JSON,
"post"::JSON
FROM
(SELECT "comments".*,
CASE
WHEN count("comments_post"."id") = 0 THEN '[]'
ELSE json_agg(json_build_array("comments_post"."title", "comments_post"."user"::JSON))::text
END AS "post"
FROM
(SELECT "comments".*,
CASE
WHEN count("comments_user"."id") = 0 THEN '[]'
ELSE json_agg(json_build_array("comments_user"."name"))::text
END AS "user"
FROM "comments"
LEFT JOIN
(SELECT "comments_user".*
FROM "users" "comments_user") "comments_user" ON "comments"."user_id" = "comments_user"."id"
GROUP BY "comments"."id",
"comments"."user_id",
"comments"."post_id",
"comments"."content") "comments"
LEFT JOIN
(SELECT "comments_post".*
FROM
(SELECT "comments_post".*,
CASE
WHEN count("comments_post_user"."id") = 0 THEN '[]'
ELSE json_agg(json_build_array("comments_post_user"."name"))
END AS "user"
FROM "posts" "comments_post"
LEFT JOIN
(SELECT "comments_post_user".*
FROM "users" "comments_post_user") "comments_post_user" ON "comments_post"."user_id" = "comments_post_user"."id"
GROUP BY "comments_post"."id") "comments_post") "comments_post" ON "comments"."post_id" = "comments_post"."id"
GROUP BY "comments"."id",
"comments"."user_id",
"comments"."post_id",
"comments"."content",
"comments"."user") "comments"
LIMIT 1
SQL produced by Drizzle ORM v0.28.0
select "comments"."id",
"comments"."user_id",
"comments"."post_id",
"comments"."content",
"comments_user"."data" as "user",
"comments_post"."data" as "post"
from "comments"
left join lateral (select json_build_array("comments_user"."name") as "data"
from (select *
from "users" "comments_user"
where "comments_user"."id" = "comments"."user_id"
limit 1) "comments_user") "comments_user" on true
left join lateral (select json_build_array("comments_post"."title", "comments_post_user"."data") as "data"
from (select *
from "posts" "comments_post"
where "comments_post"."id" = "comments"."post_id"
limit 1) "comments_post"
left join lateral (select json_build_array("comments_post_user"."name") as "data"
from (select *
from "users" "comments_post_user"
where "comments_post_user"."id" = "comments_post"."user_id"
limit 1) "comments_post_user") "comments_post_user"
on true) "comments_post" on true
order by "comments"."id"
limit 1
SQL produced by Orchid ORM v1.10.5
SELECT "comments".*,
row_to_json("user".*) "user",
row_to_json("post".*) "post"
FROM "comments"
LEFT JOIN LATERAL
(SELECT "user"."name"
FROM "users" AS "user"
WHERE "user"."id" = "comments"."user_id") "user" ON TRUE
LEFT JOIN LATERAL
(SELECT "post"."title",
row_to_json("user2".*) "user"
FROM "posts" AS "post"
LEFT JOIN LATERAL
(SELECT "user"."name"
FROM "users" AS "user"
WHERE "user"."id" = "post"."user_id") "user2" ON TRUE
WHERE "post"."id" = "comments"."post_id") "post" ON TRUE
LIMIT 1
count of comments | single query with Drizzle ORM on v0.28.0 | multiple queries + data combination on v0.28.0 |
---|---|---|
1 | 17 ms | 20 ms |
100 | 18 ms | 22 ms |
1000 | 24 ms | 35 ms |
5000 | 73 ms | 91 ms |
SQL produced by Drizzle ORM v0.28.0
select `comments`.`id`,
`comments`.`user_id`,
`comments`.`post_id`,
`comments`.`content`,
`comments_user`.`data` as `user`,
`comments_post`.`data` as `post`
from `comments`
left join lateral (select json_array(`comments_user`.`name`) as `data`
from (select *
from `users` `comments_user`
where `comments_user`.`id` = `comments`.`user_id`
limit 1) `comments_user`) `comments_user` on true
left join lateral (select json_array(`comments_post`.`title`, `comments_post_user`.`data`) as `data`
from (select *
from `posts` `comments_post`
where `comments_post`.`id` = `comments`.`post_id`
limit 1) `comments_post`
left join lateral (select json_array(`comments_post_user`.`name`) as `data`
from (select *
from `users` `comments_post_user`
where `comments_post_user`.`id` = `comments_post`.`user_id`
limit 1) `comments_post_user`) `comments_post_user`
on true) `comments_post` on true
order by `comments`.`id`
limit 1
count of comments | single query with Drizzle ORM v0.28.0 | multiple queries + data combination on v0.28.0 |
---|---|---|
1 | 2 ms | 2 ms |
100 | 3 ms | 4 ms |
1000 | 15 ms | 20 ms |
5000 | 47 ms | 67 ms |
SQL produced by Drizzle ORM v0.28.0:
select "id",
"user_id",
"post_id",
"content",
(select json_array("name") as "data"
from (select *
from "users" "comments_user"
where "comments_user"."id" = "comments"."user_id"
limit 1) "comments_user") as "user",
(select json_array("title", (select json_array("name") as "data"
from (select *
from "users" "comments_post_user"
where "comments_post_user"."id" = "comments_post"."user_id"
limit 1) "comments_post_user")) as "data"
from (select *
from "posts" "comments_post"
where "comments_post"."id" = "comments"."post_id"
limit 1) "comments_post") as "post"
from "comments"
order by "comments"."id"
limit 1
Requirements: You will need to have docker-compose and Nodejs installed on your system.
Then follow these steps:
- Clone this repo
- Install dependencies
yarn install
- Generate the SQL file that inserts data into the database
yarn generate-seed
(You can change the amount of data to generate by changing thecounts
variable insrc/seed/generators.ts
) - Start the database
yarn start-db
(This will take some time to insert all the data, it took about 5mins on my system. Wait untill you seedatabase system is ready to accept connections
)- To seed sqlite database - use
yarn seed-sqlite
- To seed sqlite database - use
- Keep the database running, open a new terminal to run the scripts
For PostgreSQL tests
- Run the single query script with
yarn tsx src/pg/single.ts <count>
(example:yarn tsx src/pg/single.ts 100
) - Run the multiple queries script with
yarn tsx src/pg/multiple.ts <count>
(example:yarn tsx src/pg/multiple.ts 100
) - Run the orchid queries script with
yarn tsx src/pg/orchid.ts <count>
(example:yarn tsx src/pg/orchid.ts 100
)
For MySQL tests
- Run the single query script with
yarn tsx src/mysql/single.ts <count>
(example:yarn tsx src/mysql/single.ts 100
) - Run the multiple queries script with
yarn tsx src/mysql/multiple.ts <count>
(example:yarn tsx src/mysql/multiple.ts 100
)
For SQLite tests
- Run the single query script with
yarn tsx src/sqlite/single.ts <count>
(example:yarn tsx src/sqlite/single.ts 100
) - Run the multiple queries script with
yarn tsx src/sqlite/multiple.ts <count>
(example:yarn tsx src/sqlite/multiple.ts 100
)
The two scripts write the loaded data into the files single.json
, multiple.json
, and orchid.json
respectively, so you can inspect the files and check that they fetch the same data.
Feel free to open an issue or submit a PR, I created this repo out of curiosity and my goal is to learn new things. if you see that I am doing something wrong, please let me know!