effective_cache_size: Better set it right

The practical impact of effective_cache_size:

Let us take a look at some practical implications of effective_cache_size. To do so we can create a simple table:

test=# CREATE TABLE t_test (id int4);
CREATE TABLE

1 2	test=# CREATE TABLE t_test (id int4); CREATE TABLE

Then we can import some data into the table. We do this in a random order to make the impact more visible:

test=# INSERT INTO t_test SELECT * FROM generate_series(1, 2500000) ORDER BY random();
INSERT 0 2500000

1 2	test=# INSERT INTO t_test SELECT * FROM generate_series(1, 2500000) ORDER BY random(); INSERT 0 2500000

Let us create an index now:

test=# CREATE INDEX idx_in ON t_test (id);
CREATE INDEX

1 2	test=# CREATE INDEX idx_in ON t_test (id); CREATE INDEX

As I have stated before, the default value of effective_cache_size is 128 MB. We can set this to 1 MB on the fly (for our session only):

test=# SET effective_cache_size TO '1 MB';
SET

1 2	test=# SET effective_cache_size TO '1 MB'; SET

To look for the lowest 10 numbers we can use the following query:

test=# explain SELECT * FROM t_test ORDER BY id LIMIT 10;
                                           QUERY PLAN
--------------------------------------------------------------------------------------------
 Limit (cost=0.00..39.97 rows=10 width=4)
 -> Index Only Scan using idx_in on t_test (cost=0.00..9992553.14 rows=2500000 width=4)
(2 rows)

test=# explain SELECT * FROM t_test ORDER BY id LIMIT 10;

QUERY PLAN

--------------------------------------------------------------------------------------------

Limit (cost=0.00..39.97 rows=10 width=4)

-> Index Only Scan using idx_in on t_test (cost=0.00..9992553.14 rows=2500000 width=4)

(2 rows)

As you can see costs of this query are estimated at 39.97 penalty points.

What happens if we change effective_cache_size to an insanely high value?

test=# SET effective_cache_size TO '10000 MB';
SET

test=# explain SELECT * FROM t_test ORDER BY id LIMIT 10;
                                     QUERY PLAN
-------------------------------------------------------------------------------------------
 Limit (cost=0.00..0.44 rows=10 width=4)
  -> Index Only Scan using idx_in on t_test (cost=0.00..109180.31 rows=2500000 width=4)
(2 rows)

test=# SET effective_cache_size TO '10000 MB';

SET

test=# explain SELECT * FROM t_test ORDER BY id LIMIT 10;

QUERY PLAN

-------------------------------------------------------------------------------------------

Limit (cost=0.00..0.44 rows=10 width=4)

-> Index Only Scan using idx_in on t_test (cost=0.00..109180.31 rows=2500000 width=4)

(2 rows)

As you can see the costs will drop dramatically. This makes sense because if we don't expect the kernel to cache any data if we got only 1 MB of RAM – however, we expect the cache hit rate on the kernel side to up dramatically if we can expect things to be cached by the OS. Random I/O is the most expensive thing and changing this cost parameter has serious impacts on what the planner believes. Just imagine a more complex query – different cost estimates can lead to totally different plans.

In order to receive regular updates on important changes in PostgreSQL, subscribe to our newsletter, or follow us on Facebook or LinkedIn.

3 responses to “effective_cache_size: Better set it right”

Pablo Luna says:

July 5, 2017 at 1:07 pm

What if increase effective_cache_size makes worsening in execution plains? Maybe you can reduce the cost but i have experience with a coup of queries whose time execution worsened without any other cause.

Reply
Robert Smith says:

April 26, 2018 at 6:46 am

Since the only difference in the query plans in the example is the estimated cost I assume that the performance would actually be the same? In this instance the estimated cost has not changed the selected plan?

Reply
gunnar says:

November 9, 2022 at 2:54 pm

I don't get what the conclusion in terms of "which value to choose for xy amount of available RAM" is here. I doubt effective_cache_size = '10000 MB' is the recommendation.

Reply

effective_cache_size: Better set it right

The practical impact of effective_cache_size:

3 responses to “effective_cache_size: Better set it right”

Leave a Reply Cancel reply

Hans-Jürgen Schönig

Blog Tags

NEWSLETTER

Articles by our PostgreSQL Experts