Gradually Deleting Data in SQL Server

Posted on August 13, 2011 by Glenn Berry

If you have a situation where you have a very large table in SQL Server, where you need to periodically delete tens of millions of rows of data, there are several ways to do it.

If you have a maintenance window (or your database is not required to be available 24 x 7 x 365), you can (and probably should) just delete all of the rows in one shot, using a set based operation. This would be the quickest way to delete a large number of rows, but you will probably end up getting lock escalation to a table lock, which essentially makes the table unavailable during the delete.

Another issue to consider is whether you have transactional replication on that table, and/or you have database mirroring in place on the database. Deleting a large number of rows from a table will generate a lot of log activity, which may cause transactional replication or database mirroring to fall behind. This of course depends on your hardware and network infrastructure. You also want to keep an eye on your transaction log, to make sure it is not filling up and having auto-grow kick in.

A safer, but much more time consuming way to delete millions of rows is to use some sort of looping mechanism, where you gradually delete a fairly small number of rows in a loop, to slowly nibble away at the table. This will take much longer than a set based operation, but, if done properly, will not cause concurrency problems, and will not overwhelm transactional replication or database mirroring.

At any rate, I recently faced a situation like this, so I decided to show one method to deal with it pretty easily. In this case, we want to delete every row that has a TransactionId lower than a certain number. We are going to delete 500 random rows that qualify in each delete, and loop 5000 times, with a slight delay between each delete. This will delete 2.5 million rows each time the query is run. You can obviously adjust these numbers and the delay time so that it works best in your environment. You could also wrap this into a stored procedure.

-- Gradual Delete Sample
-- Glenn Berry 
-- August 2011
-- https://sqlserverperformance.wordpress.com/
-- Twitter: GlennAlanBerry

SET NOCOUNT ON;

-- Check space used by table before we begin
EXEC sp_spaceused N'dbo.BigLoggingTable';

-- Declare local variables
DECLARE @NumberOfLoops AS int;
SET @NumberOfLoops = 5000;

DECLARE @CurrentLoop AS int;
SET @CurrentLoop = 0

DECLARE @DeleteSize bigint;
SET @DeleteSize = 500;

DECLARE @HighWaterMark bigint;
SET @HighWaterMark = 382989078;

WHILE @CurrentLoop < @NumberOfLoops
    BEGIN
        -- Just delete any xxx rows that are below the HighWaterMark
        DELETE 
        FROM dbo.BigLoggingTable
        WHERE TransactionId IN 
            (SELECT TOP(@DeleteSize) TransactionId 
             FROM dbo.BigLoggingTable WITH (NOLOCK)
             WHERE TransactionId < @HighWaterMark);
             
        WAITFOR DELAY '00:00:00:50';
          
        SET @CurrentLoop = @CurrentLoop + 1;
    END

-- Check space used by table after we are done    
EXEC sp_spaceused N'dbo.BigLoggingTable';

This entry was posted in SQL Server 2005, SQL Server 2008, SQL Server 2008 R2, SQL Server Denali. Bookmark the permalink.

23 Responses to Gradually Deleting Data in SQL Server

Bender says:

August 13, 2011 at 2:09 PM

I’ve had to do this on many very large tables before. I’ve used this process before and while it works very well, I’ve found the deeper it goes the slower it gets. I manually updated the statistics on every, say, 200th iteration and that seemed to keep it more consistent. I’m guessing the WAITFOR is probably giving SQL time to do the same. GUESSING.

When it comes to very very large tables, which in my case is logging tables (I don’t work with good citizens who clean up after themselves) I usually just bulk copy the data I want out, truncate the table and BULK INSERT the data back in. Again, probably not the best solution for every situation, but it’s the fastest one I’ve come up with.

B

Reply
- Felipe Antunes says:
  
  August 23, 2011 at 10:31 AM
  
  Hi Mr Berry,
  
  Thanks for the post. But I also don´t get the reason for the WAITFOR. Is it really necessary?
  
  Reply
  - Glenn Berry says:
    
    August 23, 2011 at 6:59 PM
    
    The WAITFOR just gives the system a slight break in between deleting batches of rows. It helps have less effect on concurrency.
Merrill Aldrich says:

August 15, 2011 at 10:15 AM

I’ve had to do this on many systems too. Worth noting: table partitioning helps this scenario quite a lot, if it’s an option. EE only, and the underlying tables have to be set up for partitioning, so it doesn’t always work – but when it does, you can just switch and truncate.

Also, for non-partitioned tables, DELETE TOP (n) … works within the same sort of looping structure. I have some of these that basically do: loop / delete top (n) … where … / while @@rowcount > 0, to keep on deleting until no rows qualify any longer.

Reply
Manjot says:

August 15, 2011 at 6:22 PM

Hi,
I always delete huge amount of data by dividing it into smaller chunks as you wrote above but last time when I was running this on a server, it still tend to grow the transaction log file (database was in simple recovery mode). I checked the sys.dm_os_waiting_tasks table and saw that the checkpoint process has been waiting for a long time. So, I just waited till the checkpoint wait disappeared and then re-ran the delete process. So I concluded that checkpoint needed to catchup. How would you address this issue without waiting?

Reply
- Glenn Berry says:
  
  August 23, 2011 at 7:04 PM
  
  You could periodically issue a manual checkpoint, and or have a delay that allows the automatic checkpoint to keep up.
  
  Reply
Jason Crider says:

August 17, 2011 at 8:02 AM

Thanks for the very informative post Glenn. I’m really enjoying all the great info putting out there. Any new word on getting your book on the Kindle?

Your post reminded me about a situation that happened to me and I mentioned you in my blog post @ http://www.jasoncrider.com/blog/archives/2011/08/16/deleting-data-in-small-chunks-on-sql-server/.

Keep up the more than excellent work.

Reply
- Glenn Berry says:
  
  August 17, 2011 at 10:02 AM
  
  My book is available on Kindle, since August 7. Thanks for asking about it!
  
  Reply
Pingback: Something for the Weekend – SQL Server Links 19/08/11
Randy says:

August 20, 2011 at 5:29 PM

I’ve done something similar using a try / catch to keep the loop going. I also set deadlock priority low and was able to run it even during the business day. Can take a very long time of course, but I always want my process to “die” first. At least it allows me to stop doing stuff like this every single weekend. 🙂

Reply
Charles Kincaid says:

August 21, 2011 at 4:54 PM

Then there is the issue of the clustered index. Likely whatever key you choose to purge your table won’t match the clustered key on that table. If it does then you might dodge “unused is taking up my space” syndrom. If any row on the leaf of a clustered indes is good then the whole page stays. If your clustered index is some IDENTITY based thing then you wind up with index pages more full of hole than a government promise.

Eventually you will have to resort to rebuilding the clustered index. This technique makes the eventual farther in the future.

Reply
- Glenn Berry says:
  
  August 23, 2011 at 7:08 PM
  
  In this example, we are using the primary key column for the delete, to avoid doing an index scan.
  
  Reply
Vanessa says:

August 21, 2011 at 11:06 PM

Hey, nice article.
Can we just take advantage of some SQL management software to help us do the gradually data deleting work? Just saw an article about SQL restore and backup with other features: http://www.todo-backup.com/products/features/sql-backup-and-restore.htm

Reply
- Glenn Berry says:
  
  August 23, 2011 at 7:05 PM
  
  I don’t know of any 3rd party tools that allow you to do this. You just have to write a little code to do it yourself, depending on your requirements.
  
  Reply
r_guy says:

August 22, 2011 at 6:58 AM

If you are deleting the entire table, truncate. If you are keeping a lot less rows than deleting, copy to temporary table, drop and recreate table if feasible or truncate table and copy rows back. Loops are a last alternative.

Reply
- Glenn Berry says:
  
  August 23, 2011 at 7:02 PM
  
  Yes, TRUNCATE is much better if you are deleting the entire table. Copying the table to another table is not such a good idea if you have hundreds of millions of rows and you want to delete, say, 10 million rows. Remember, the whole point here is to delete lots of rows while the table is online and available 24 x 7
  
  Reply
Slicky says:

August 22, 2011 at 10:32 AM

Hi Glenn. Ive heard the sp_spaceused is unreliable method to calculate a table’s size. Reading about it on an blog from an expert like you, I’m thinking that might not be true. Can you confirm?

Reply
- Glenn Berry says:
  
  August 23, 2011 at 7:07 PM
  
  sp_spaceused is not always 100% accurate, but it is usually good enough for this purpose, where we just want to get a rough idea of how much space was freed up.
  
  Reply
shaun5stu says:

August 24, 2011 at 3:23 PM

I had to develop something similar to this for use on a group of five financial services tables that had to be up 24/7. Locking was a huge issue for me – locks of even 10 seconds were not tolerated. I found a great explanation of my options at http://www.sqlsoldier.com/wp/sqlserver/sqluvldbweekarchivingandpurgingdata.

Merrill Aldrich: Note that using TOP will not reduce locks. This is mentioned in the above referenced blog post and also confirmed by my experience. Rather than using DELETE TOP on each of the tables, I did a SELECT TOP to insert the IDs of the records I wanted to delete into a temp table, then deleted based on joining to that table.

Reply
- JohnCampbell says:
  
  August 26, 2011 at 6:30 AM
  
  That is the way that I have been doing large deletes for quite a few years. I build a temp table of the key values from the table for the records that I am going to delete, select the top xxx number of records from the temp table, delete the working records and then the temp table records, wait x.x seconds and do again within a while loop. The number of records to be deleted at one time needs to be adjusted so that the bite is big enough to be meangiful, but not take an excessive amount of time, since this is locking the table that you are working on. The x.x. seconds for the wait adjustment is to allow all of the backed up records to process, I generally start with 1 second here and go up, monitoring the locks in the database. I am probably taking smaller bites for each delete than others, but generally the users end up asking me when I am going to start on the delete operation after I am finishing up. Getting the users to agree on what to delete is the biggest problem in my experience 🙂
  
  Reply
  - Glenn Berry says:
    
    August 26, 2011 at 7:03 AM
    
    Yes, that is a very good method. I always say, if the users notice what you are doing, then you have not done it right!
Michael K. Campbell says:

August 26, 2011 at 10:55 AM

Great Post Glenn. I outlined a similar technique, which I call ‘Nibbling Deletes’ in the following video (for anyone who’d like to see a bit more background/info on how this works):
http://www.sqlservervideos.com/video/nibbling-deletes/

Reply
K Lindner says:

October 21, 2011 at 9:50 AM

My spin on it doesn’t implement a WAITFOR but I like the idea! I built mine to track execution time so that I could add an abort for modifications taking longer than n seconds.

If you’re using an update, you’ll have to build exclusionary logic in your where clause to prevent endless looping (e.g., set x=1 where x1).

declare @iRowcount bigint,
@dtThen datetime,
@iElapsed bigint,
@iElapsedThreshold bigint,
@iTotalRows bigint

select @iRowCount = 10000,
@iElapsedThreshold = 30, — max seconds for a single batch to complete
@iTotalRows = 0

set rowcount 10000 — numbers of rows to modify in each iteration
set nocount on

while @iRowcount > 0
begin
begin tran batchupdate
select @dtThen = getdate()

/*
** your DML here
*/

select @iRowcount = @@ROWCOUNT
commit tran batchupdate
select @iTotalRows = @iTotalRows + @iRowcount
select @iElapsed = datediff(ss,@dtThen,getdate())

print ‘Total Rows Affected: ‘ + convert(varchar(30), @iTotalRows) + ‘ Row(s) Affected: ‘ + convert(varchar(30), @iRowCount) + ‘ Elapsed: ‘ + convert(varchar(30),@iElapsed)
if @iElapsed > @iElapsedThreshold
begin
print ‘Aborting due to threshold violation.’
select @iRowcount = 0
break
end
end
set nocount off
set rowcount 0
go

Reply

	brenda on SQL Server 2012 Diagnostic Inf…
	Larry on Three Generations of Toshiba P…
	way0utwest on Are Electric Cars Practic…
	Bala on SQL Server Diagnostic Informat…
	Tom Schiro on SQL Server Diagnostic Informat…