python - 延迟调用函数

标签 python multithreading job-scheduling

我得到了这个面试练习题。

实现一个作业调度器,它接受一个函数 f 和一个整数 n,并在 n 毫秒后调用 f。

我有一个非常简单的解决方案:

import time

def schedulerX(f,n):
    time.sleep(0.001*n)
    f

但是,建议的解决方案更加详细,如下所示。 我不明白所有这些额外代码背后的目的是什么。 请赐教。

from time import sleep
import threading

class Scheduler:
    def __init__(self):
        self.fns = [] # tuple of (fn, time)
        t = threading.Thread(target=self.poll)
        t.start()

    def poll(self):
        while True:
            now = time() * 1000
            for fn, due in self.fns:
                if now > due:
                    fn()
            self.fns = [(fn, due) for (fn, due) in self.fns if due > now]
            sleep(0.01)

    def delay(self, f, n):
        self.fns.append((f, time() * 1000 + n))

最佳答案

正如其他人所指出的,您的解决方案是“阻塞”的:它可以防止在等待运行时发生任何其他事情。建议的解决方案的目的是让您安排工作,然后同时处理其他事情。

至于对建议代码的作用的解释:

您首先要创建一个 Scheduler,它会启动自己的线程,在后台有效运行,并运行作业。

scheduler = Scheduler()

在您的代码中,您可以安排任何您想要的作业,而无需等待它们运行:

def my_recurring_job():
    # Do some stuff in the background, then re-run this job again
    # in one second.

    ### Do some stuff ###

    scheduler.delay(my_recurring_job, 1000)

scheduler.delay(lambda: print("5 seconds passed!"), 5 * 1000)
scheduler.delay(lambda: print("2 hours passed!"), 2 * 60 * 60 * 1000)
scheduler.delay(my_recurring_job, 1000)

# You can keep doing other stuff without waiting

调度程序的线程只是在它的poll 方法中永远循环,运行任何时间到了的作业,然后休眠 0.01 秒并再次检查。代码中有一个小错误,如果 now == due,作业将不会运行,但也不会保留以备后用。它应该是 if now >= due: 而不是。

更高级的调度程序可能会使用 threading.Condition 而不是每秒轮询 100 次:

import threading
from time import time

class Scheduler:
    def __init__(self):
        self.fns = [] # tuple of (fn, time)

        # The lock prevents 2 threads from messing with fns at the same time;
        # also lets us use Condition
        self.lock = threading.RLock()

        # The condition lets one thread wait, optionally with a timeout,
        # and lets other threads wake it up
        self.condition = threading.Condition(self.lock)

        t = threading.Thread(target=self.poll)
        t.start()

    def poll(self):
        while True:
            now = time() * 1000

            with self.lock:
                # Prevent the other thread from adding to fns while we're sorting
                # out the jobs to run now, and the jobs to keep for later

                to_run = [fn for fn, due in self.fns if due <= now]
                self.fns = [(fn, due) for (fn, due) in self.fns if due > now]

            # Run all the ready jobs outside the lock, so we don't keep it
            # locked longer than we have to
            for fn in to_run:
                fn()

            with self.lock:
                if not self.fns:
                    # If there are no more jobs, wait forever until a new job is 
                    # added in delay(), and notify_all() wakes us up again
                    self.condition.wait()
                else:
                    # Wait only until the soonest next job's due time.
                    ms_remaining = min(due for fn, due in self.fns) - time()*1000
                    if ms_remaining > 0:
                        self.condition.wait(ms_remaining / 1000)

    def delay(self, f, n):
        with self.lock:
            self.fns.append((f, time() * 1000 + n))

            # If the scheduler thread is currently waiting on the condition,
            # notify_all() will wake it up, so that it can consider the new job's
            # due time.
            self.condition.notify_all()

关于python - 延迟调用函数,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/55642383/

相关文章:

python - BeautifulSoup KeyError 问题

python - 合并到不同的词典

multithreading - 创建 BackgroundWorkers 的线程似乎将 Completed 事件排队

c# - 刷新高速缓存时如何提供过时的数据?

C# (WPF) 异步线程与 GUI 接口(interface)

c# - "Scheduler already exists issue"仍然存在

kubernetes - 由于验证错误,无法使用 kubectl 创建作业

python - Pandas 标准偏差返回 NaN

python - Scrapy 将唯一网址过滤为重复网址

android - GcmNetworkManager 调度问题