`gevent.threadpool` - A pool of native threads#

class ThreadPool(maxsize, hub=None, idle_task_timeout=-1)[source]#

Bases: GroupMappingMixin

A pool of native worker threads.

This can be useful for CPU intensive functions, or those that otherwise will not cooperate with gevent. The best functions to execute in a thread pool are small functions with a single purpose; ideally they release the CPython GIL. Such functions are extension functions implemented in C.

It implements the same operations as a gevent.pool.Pool, but using threads instead of greenlets.

Note

The method apply_async() will always return a new greenlet, bypassing the threadpool entirely.

Most users will not need to create instances of this class. Instead, use the threadpool already associated with gevent’s hub:

pool = gevent.get_hub().threadpool
result = pool.spawn(lambda: "Some func").get()

Important

It is only possible to use instances of this class from the thread running their hub. Typically that means from the thread that created them. Using the pattern shown above takes care of this.

There is no gevent-provided way to have a single process-wide limit on the number of threads in various pools when doing that, however. The suggested way to use gevent and threadpools is to have a single gevent hub and its one threadpool (which is the default without doing any extra work). Only dispatch minimal blocking functions to the threadpool, functions that do not use the gevent hub.

The len of instances of this class is the number of enqueued (unfinished) tasks.

Just before a task starts running in a worker thread, the values of threading.setprofile() and threading.settrace() are consulted. Any values there are installed in that thread for the duration of the task (using sys.setprofile() and sys.settrace(), respectively). (Because worker threads are long-lived and outlast any given task, this arrangement lets the hook functions change between tasks, but does not let them see the bookkeeping done by the worker thread itself.)

Caution

Instances of this class are only true if they have unfinished tasks.

Changed in version 1.5a3: The undocumented apply_e function, deprecated since 1.1, was removed.

Changed in version 20.12.0: Install the profile and trace functions in the worker thread while the worker thread is running the supplied task.

Changed in version 22.08.0: Add the option to let idle threads expire and be removed from the pool after idle_task_timeout seconds (-1 for no timeout)

apply(func, args=None, kwds=None)#

Rough equivalent of the apply() builtin function, blocking until the result is ready and returning it.

The func will usually, but not always, be run in a way that allows the current greenlet to switch out (for example, in a new greenlet or thread, depending on implementation). But if the current greenlet or thread is already one that was spawned by this pool, the pool may choose to immediately run the func synchronously.

Note

As implemented, attempting to use Threadpool.apply() from inside another function that was itself spawned in a threadpool (any threadpool) will cause the function to be run immediately.

Changed in version 1.1a2: Now raises any exception raised by func instead of dropping it.

apply(func, args=None, kwds=None)#

Rough quivalent of the apply() builtin function blocking until the result is ready and returning it.

Any exception func raises will be propagated to the caller of apply (that is, this method will raise the exception that func raised).

apply_async(func, args=None, kwds=None, callback=None)#

A variant of the apply() method which returns a Greenlet object.

When the returned greenlet gets to run, it will call apply(), passing in func, args and kwds.

If callback is specified, then it should be a callable which accepts a single argument. When the result becomes ready callback is applied to it (unless the call failed).

This method will never block, even if this group is full (that is, even if spawn() would block, this method will not).

Caution

The returned greenlet may or may not be tracked as part of this group, so joining this group is not a reliable way to wait for the results to be available or for the returned greenlet to run; instead, join the returned greenlet.

Tip

Because ThreadPool objects do not track greenlets, the returned greenlet will never be a part of it. To reduce overhead and improve performance, Group and Pool may choose to track the returned greenlet. These are implementation details that may change.

apply_cb(func, args=None, kwds=None, callback=None)#

apply() the given func(*args, **kwds), and, if a callback is given, run it with the results of func (unless an exception was raised.)

The callback may be called synchronously or asynchronously. If called asynchronously, it will not be tracked by this group. (Group and Pool call it asynchronously in a new greenlet; ThreadPool calls it synchronously in the current greenlet.)

imap(func, *iterables, maxsize=None) → iterable#

An equivalent of itertools.imap(), operating in parallel. The func is applied to each element yielded from each iterable in iterables in turn, collecting the result.

If this object has a bound on the number of active greenlets it can contain (such as Pool), then at most that number of tasks will operate in parallel.

Parameters:

maxsize (int) –

If given and not-None, specifies the maximum number of finished results that will be allowed to accumulate awaiting the reader; more than that number of results will cause map function greenlets to begin to block. This is most useful if there is a great disparity in the speed of the mapping code and the consumer and the results consume a great deal of resources.

Note

This is separate from any bound on the number of active parallel tasks, though they may have some interaction (for example, limiting the number of parallel tasks to the smallest bound).

Note

Using a bound is slightly more computationally expensive than not using a bound.

Tip

The imap_unordered() method makes much better use of this parameter. Some additional, unspecified, number of objects may be required to be kept in memory to maintain order by this function.

Returns:

An iterable object.

Changed in version 1.1b3: Added the maxsize keyword parameter.

Changed in version 1.1a1: Accept multiple iterables to iterate in parallel.

imap_unordered(func, *iterables, maxsize=None) → iterable#

The same as imap() except that the ordering of the results from the returned iterator should be considered in arbitrary order.

This is lighter weight than imap() and should be preferred if order doesn’t matter.

gevent.threadpool - A pool of native threads#

`gevent.threadpool` - A pool of native threads#