Skip to content

Instantly share code, notes, and snippets.

@baojie
Created July 21, 2013 07:04
Show Gist options
  • Save baojie/6047780 to your computer and use it in GitHub Desktop.
Save baojie/6047780 to your computer and use it in GitHub Desktop.
Python multiprocessing hello world. Split a list and process sublists in different jobs
import multiprocessing
# split a list into evenly sized chunks
def chunks(l, n):
return [l[i:i+n] for i in range(0, len(l), n)]
def do_job(job_id, data_slice):
for item in data_slice:
print "job", job_id, item
def dispatch_jobs(data, job_number):
total = len(data)
chunk_size = total / job_number
slice = chunks(data, chunk_size)
jobs = []
for i, s in enumerate(slice):
j = multiprocessing.Process(target=do_job, args=(i, s))
jobs.append(j)
for j in jobs:
j.start()
if __name__ == "__main__":
data = ['a', 'b', 'c', 'd']
dispatch_jobs(data, 2)
@manugarri
Copy link

very elegant. thanks for sharing!

@JasonArnotClark
Copy link

As someone just learning to program, this finally made multiprocessing make sense. Thanks for posting.

@TheReal1604
Copy link

That is a nice example, thank you!

@itouchup
Copy link

itouchup commented Sep 5, 2018

Exactly what I needed to see. Thank you.

@dvandonkelaar
Copy link

Good example for the beginner programming.

For python 3.7, the section
slice = chunks(data, chunk_size)

needs to be changed to:
slice = chunks(data, int(chunk_size))

To avoid:
TypeError: 'float' object cannot be interpreted as an integer

@ratmcu
Copy link

ratmcu commented Aug 8, 2019

Good example for the beginner programming.

For python 3.7, the section
slice = chunks(data, chunk_size)

needs to be changed to:
slice = chunks(data, int(chunk_size))

To avoid:
TypeError: 'float' object cannot be interpreted as an integer

or
chunk_size = total // job_number

@huni1023
Copy link

it works to me! thanks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment