The WordPress.com Blog

WordPress.com News

An efficient alternative to paging with SQL OFFSETs

February 14, 2014

Stephane Daury

Challenge

Running WordPress.com means having multimillion-record database tables. Tables which we often need to batch-query.

Provided we could hardly select (or update, etc) millions of records at once and expect speed, we commonly have to “page” our scripts to only handle a limited number of records at once, then move on to the next batch.

Classic, but inefficient, solution

The usual way of paging result sets in most SQL RDMS is to use the OFFSET option (or LIMIT [offset], [limit], which is the same).

SELECT * FROM my_table OFFSET 8000000 LIMIT 100;
But on a performance level, this means you’re asking your DB engine to figure out where to start from all on its own, every time. Which then means it must be aware of every record before the queried offset, because they could be different between queries (deletes, etc). So the higher your offset number, the longer the overall query will take.
Alternative solution
Instead, of keeping track of an offset in your query script, consider keeping track of the last record’s primary key in the previous result set instead. Say, its ID. At the next loop instance, query your table based on other records having a greater value for said ID.
SELECT * FROM my_table WHERE id &amp;amp;amp;gt; 7999999 LIMIT 100;
This will let you page in the same way, but your DB’s engine will know exactly where to start, based on an efficient indexed key, and won’t have to consider any of the records prior to your range. Which will all translate to speedy queries.
Here’s a real-life sample of how much difference this can make:
mysql&amp;amp;amp;gt; SELECT * FROM feeds LIMIT 8000000, 10;
  [...]
10 rows in set (12.80 sec)

mysql&amp;amp;amp;gt; SELECT * FROM feeds WHERE feed_id &amp;amp;amp;gt; 12958559 LIMIT 10;
  [...]
10 rows in set (0.01 sec)

I received the very same records back, but the first query took 12.80 seconds, while the alternative took 0.01 instead. 🙂
PHP/WordPress example
&amp;amp;amp;lt;?php
// Start with 0
$last_id = 0;

do {
    $blogs = $wpdb-&amp;amp;amp;gt;get_results( $wpdb-&amp;amp;amp;gt;prepare(
        'SELECT * FROM wp_blogs WHERE blog_id &amp;amp;amp;gt; %d LIMIT 100;',
        $last_id // Use the last ID to start after
    ) );

    foreach ( $blogs as $blog ) {
        // Do your thing!
        // ...
        // Record the last ID for the next loop
        $last_id = $blog-&amp;amp;amp;gt;blog_id;
    }
// Do it until we have no more records
} while ( ! empty( $blogs ) );
?&amp;amp;amp;gt;

Email Newsletter

			
			
				Missing out on the latest WordPress.com developments? Enter your email below to receive future announcements direct to your inbox. An email confirmation will be sent before you will start receiving notifications—please check your spam folder if you don't receive this.
				
					
						Email Address:					

									

				
					
					
					
					
					
										
				
			
							
					Join 107.6M other subscribers				
						
			
Share this:
Click to share on Tumblr (Opens in new window)
Click to share on X (Opens in new window)
Click to share on Mastodon (Opens in new window)
Click to share on Facebook (Opens in new window)
Click to share on LinkedIn (Opens in new window)
Click to share on WhatsApp (Opens in new window)
Click to share on Threads (Opens in new window)
Click to share on Bluesky (Opens in new window)
More
Click to email a link to a friend (Opens in new window)
Click to print (Opens in new window)
Click to share on Pinterest (Opens in new window)
Click to share on Reddit (Opens in new window)
Click to share on Pocket (Opens in new window)
Like Loading...


	Related

February 14, 2014
Developer Tools

Create your new blog or website for free