Understanding LATERAL joins in PostgreSQL

9 responses to “Understanding LATERAL joins in PostgreSQL”

Andreas Kretschmer says:

July 8, 2021 at 8:14 am

nice example and explanation

Reply
- Hans-Jürgen Schönig says:
  
  July 8, 2021 at 9:11 am
  
  thanks for the feedback. new ideas are always welcome :).
  
  Reply
wilson parry says:

December 22, 2021 at 12:49 pm

Great explanation. So it's also very useful when there's no primary key to JOIN on. As I was reading this, I thought, "Why not just JOIN on id and add a WHERE statement after (prob more expensive)?" But there is no primary key shared by the tables. The only relation you have is price. Also, I often hear LATERAL can be done with a CTE but I just tried and could not reproduce the result with a CTE

Reply
Ben says:

January 28, 2022 at 4:56 am

Winning! thanks for the blog post: clear, concise, lucid.

btw would you know any university courses where one can learn postgres / querying to an expert / advanced level?

Reply
Elijah Windrunner says:

December 15, 2022 at 7:50 am

Best explanation of LATERAL. Thx

Reply
disqus_v49hajf7Dt says:

July 14, 2023 at 7:59 am

Thanks, great explanation! Lateral join seems to be very dangerous and should only be used as a last-ditch effort. Even in this case which looks to be a very rare problem you are calculating the result in O(n*m) time for an O(n) problem and only works because you have 3 customers instead of say 30000. But it's still much better than cross joins and window functions to solve the same problem.

Is there a declarative way to force postgresql to solve this in O(n)? In pl/SQL I could put together a simple function that solved this in less than a second for a 1000000 x 100000 dataset which was obviously impossible for the lateral join.

Reply
laurenz says:

July 14, 2023 at 8:31 am

A lateral join always forces a nested loop join. That is not dangerous at all, but can perform badly, particularly if both result sets are large. With the top three customers it shouldn't matter, and I don't think there is a better way.

If n and m are the row counts being joined, this always has to be O(n*m). "Less than a second" has nothing to do with the big O notation. PL/pgSQL is not particularly fast, and I doubt that a lateral join implemented in PL/pgSQL will beat a lateral join in SQL.

Reply
laurenz says:

July 14, 2023 at 11:09 am

I may have completely misunderstood, true. The comment was not very clear.
Anyway, the algorithm you describe is known as "nested loops":

take the first wish and start looping through the products [...]
take the next wish and continue the loop [...]

Reply
- disqus_v49hajf7Dt says:
  
  July 14, 2023 at 11:24 am
  
  No, it's not. Please take your time to understand it.
  
  I edited my comment above and added my SQLs.
  
  Reply

Understanding LATERAL joins in PostgreSQL

Inspecting FROM more closely

LATERAL joins: Creating sample data

Running LATERAL joins

Finally...

9 responses to “Understanding LATERAL joins in PostgreSQL”

Leave a Reply Cancel reply

Hans-Jürgen Schönig

Blog Tags

NEWSLETTER

Articles by our PostgreSQL Experts