Last active
February 14, 2023 12:08
-
-
Save seventhskye/0cc7b2804252975d36dca047ab7729e9 to your computer and use it in GitHub Desktop.
A script to delete all objects, versions and delete markers from an s3 bucket.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
#!/usr/bin/env python | |
import boto3 | |
client = boto3.client('s3') | |
Bucket = 'a-bucket' | |
Prefix = 'a-prefix' # leave blank to delete the entire contents | |
IsTruncated = True | |
MaxKeys = 1000 | |
KeyMarker = None | |
while IsTruncated == True: | |
if not KeyMarker: | |
version_list = client.list_object_versions( | |
Bucket=Bucket, | |
MaxKeys=MaxKeys, | |
Prefix=Prefix) | |
else: | |
version_list = client.list_object_versions( | |
Bucket=Bucket, | |
MaxKeys=MaxKeys, | |
Prefix=Prefix, | |
KeyMarker=KeyMarker) | |
try: | |
objects = [] | |
versions = version_list['Versions'] | |
for v in versions: | |
objects.append({'VersionId':v['VersionId'],'Key': v['Key']}) | |
response = client.delete_objects(Bucket=Bucket,Delete={'Objects':objects}) | |
print response | |
except: | |
pass | |
try: | |
objects = [] | |
delete_markers = version_list['DeleteMarkers'] | |
for d in delete_markers: | |
objects.append({'VersionId':d['VersionId'],'Key': d['Key']}) | |
response = client.delete_objects(Bucket=Bucket,Delete={'Objects':objects}) | |
print response | |
except: | |
pass | |
IsTruncated = version_list['IsTruncated'] | |
KeyMarker = version_list['NextKeyMarker'] |
Since the KeyMarker property is no longer available at this point, I changed and tested the above code as following.
Heads up: ChatGPT could not help me with this requirement.
#!/usr/bin/env python
import boto3
def delete_markers(bucket_name, prefix):
client = boto3.client('s3')
Bucket = bucket_name
Prefix = prefix
IsTruncated = True
MaxKeys = 1000
while IsTruncated == True:
version_list = client.list_object_versions(
Bucket=Bucket,
MaxKeys=MaxKeys,
Prefix=Prefix
)
try:
objects = []
versions = version_list ['Versions']
for v in versions:
objects.append({'VersionId':v['VersionId'],'Key':v['Key']})
response = client.delete_objects(Bucket=Bucket,Delete={'Objects':objects})
print (response)
except:
pass
try:
objects = []
delete_markers = version_list['DeleteMarkers']
for d in delete_markers:
objects.append({'VersionId':d['VersionId'],'Key': d['Key']})
response = client.delete_objects(Bucket=Bucket,Delete={'Objects':objects})
print (response)
except:
pass
IsTruncated = version_list['IsTruncated']
delete_markers('my_bucket_name','my/path')
Hi @giansmart ,
Thanks a lot above script worked smootly :)
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Hi there!
Thanks a lot for this gist! I reused your code for a script that completely deletes the bucket's contents. I reworked and extended it a bit with an argument parser to be able to pass bucket name and profile. The user can also specify if he wants to delete the bucket after cleaning it.
It's work in progress but I'm happy to share it here, if anyone is interested: https://github.com/dimitrigraf/python-scripts/blob/main/empty-s3-bucket
I do have a question regarding the very last line of the script, where the
KeyMarker
variable is set. When I ran the script a few times on a test bucket with 1010 objects, the script fails in the very last loop becauseNextKeyMarker
cannot be found/looked up anymore. I guess that makes sense. But since I'm not very experienced with Python, I was wondering how to properly catch such exceptions.Any ideas?