c - pthread C 中的线程队列 - Web 服务器响应流水线

我有一个用 C 实现的类似 HTTP Apache 的 Web 服务器，我的问题是我不知道如何初始化队列(以及如何将线程排入队列)，主要是因为我不确定如何在继续当前线程之前检查是否有前一个线程要加入。

服务器可以利用管道请求来提高其响应速度，使用线程更复杂的方式:Web 服务器可以为每个新请求生成一个新线程资源，同时准备回应；但是，由于资源必须归还按照服务器接收请求的顺序 (FIFO) 发送给客户端，它将在各个响应线程之间采取协调阶段。

这个协调阶段是通过实现一种“医生候诊室”来实现的每个病人在进入时都会询问谁是最后到达的，并记录下来并只有当前面的人离开后，他才进入医生的办公室。这样一来，每个人都拥有了队列的部分 View (只关心一个人)，但是这个部分 View 允许正确的 FIFO队列的实现。

以下是我必须做什么的描述:

同样，每个新线程都必须存储处理前一个线程的标识符使用系统调用 pthread_join() 请求并等待其终止。第一个线程，显然，不必等待任何人，最后一个线程必须由主线程等待在关闭连接本身之前处理该连接上的请求的线程返回等待新的连接请求。

我在正确初始化 to_join 数据结构时遇到问题，主要是因为我不明白如何计算要加入的线程的索引 i。 - 我如何区分第一个和最后一个线程在指针数组中？

这是代码(我只能在 TO BE DONE START 和 TO BE DONE END 注释之间进行修改):

#include "incApache.h"

pthread_mutex_t accept_mutex = PTHREAD_MUTEX_INITIALIZER;
pthread_mutex_t mime_mutex = PTHREAD_MUTEX_INITIALIZER;

int client_sockets[MAX_CONNECTIONS]; /* for each connection, its socket FD */
int no_response_threads[MAX_CONNECTIONS]; /* for each connection, how many response threads */

pthread_t thread_ids[MAX_THREADS];
int connection_no[MAX_THREADS]; /* connection_no[i] >= 0 means that i-th thread belongs to connection connection_no[i] */
pthread_t *to_join[MAX_THREADS]; /* for each thread, the pointer to the previous (response) thread, if any */

int no_free_threads = MAX_THREADS - 2 * MAX_CONNECTIONS; /* each connection has one thread listening and one reserved for replies */
struct response_params thread_params[MAX_THREADS - MAX_CONNECTIONS]; /* params for the response threads (the first MAX_CONNECTIONS threads are waiting/parsing requests) */

pthread_mutex_t threads_mutex = PTHREAD_MUTEX_INITIALIZER; /* protects the access to thread-related data structures */

pthread_t thread_ids[MAX_CONNECTIONS];
int connection_no[MAX_CONNECTIONS];

void *client_connection_thread(void *vp) {
    int client_fd;
    struct sockaddr_storage client_addr;
    socklen_t addr_size;
    pthread_mutex_lock(&threads_mutex);
    int connection_no = *((int *) vp);

    /*** properly initialize the thread queue to_join ***/
/*** TO BE DONE 3.1 START ***/
        //to_join[0] = thread_ids[new_thread_idx];
    //pthread_t *first;     Am I perhaps supposed to initialize the to_join data structure as a queue with two pointers
    //pthread_t *last;      indicating the first and last element? How can I do it on an array of pointers?
/*** TO BE DONE 3.1 END ***/

    pthread_mutex_unlock(&threads_mutex);
#endif
    for (;;) {
        addr_size = sizeof(client_addr);
        pthread_mutex_lock(&accept_mutex);
        if ((client_fd = accept(listen_fd, (struct sockaddr *) &client_addr, &addr_size)) == -1)
            fail_errno("Cannot accept client connection");
        pthread_mutex_unlock(&accept_mutex);
        client_sockets[connection_no] = client_fd;
        char str[INET_ADDRSTRLEN];
        struct sockaddr_in *ipv4 = (struct sockaddr_in *) &client_addr;
        printf("Accepted connection from %s\n", inet_ntop(AF_INET, &(ipv4->sin_addr), str, INET_ADDRSTRLEN));
        manage_http_requests(client_fd
                , connection_no);
    }
}

#pragma clang diagnostic pop
void send_resp_thread(int out_socket, int response_code, int cookie,
              int is_http1_0, int connection_idx, int new_thread_idx,
              char *filename, struct stat *stat_p)
{
    struct response_params *params =  thread_params + (new_thread_idx - MAX_CONNECTIONS);
    debug(" ... send_resp_thread(): idx=%lu\n", (unsigned long)(params - thread_params));
    params->code = response_code;
    params->cookie = cookie;
    params->is_http1_0 = is_http1_0;
    params->filename = filename ? my_strdup(filename) : NULL;
    params->p_stat = stat_p;
    pthread_mutex_lock(&threads_mutex);
    connection_no[new_thread_idx] = connection_idx;
    debug(" ... send_resp_thread(): parameters set, conn_no=%d\n", connection_idx);

    /*** enqueue the current thread in the "to_join" data structure ***/
/*** TO BE DONE 3.1 START ***/
    //Again, should I use a standard enqueue implementation? But then how would I keep track of the last node ot arrive?
/*** TO BE DONE 3.1 END ***/

    if (pthread_create(thread_ids + new_thread_idx, NULL, response_thread, connection_no + new_thread_idx))
        fail_errno("Could not create response thread");
    pthread_mutex_unlock(&threads_mutex);
    debug(" ... send_resp_thread(): new thread created\n");
}

void *response_thread(void *vp)
{
    size_t thread_no = ((int *) vp) - connection_no;
    int connection_idx = *((int *) vp);
    debug(" ... response_thread() thread_no=%lu, conn_no=%d\n", (unsigned long) thread_no, connection_idx);
    const size_t i = thread_no - MAX_CONNECTIONS;
    send_response(client_sockets[connection_idx],
              thread_params[i].code,
              thread_params[i].cookie,
              thread_params[i].is_http1_0,
              (int)thread_no,
              thread_params[i].filename,
              thread_params[i].p_stat);
    debug(" ... response_thread() freeing filename and stat\n");
    free(thread_params[i].filename);
    free(thread_params[i].p_stat);
    return NULL;
}

最佳答案

I am having trouble initializing properly the to_join data structure, mostly because I don't understand how to compute the index i of the thread to join.- how can I differenciate the first and last thread in an array of pointers?

赋值与初始化不同，对一个元素的操作与对整个数组的操作不同。据我所知，您实际上并没有在该函数中初始化 to_join (因此该注释具有误导性)。相反，您只需为单个元素分配适当的值。

该分析是根据我对各种全局变量的名称、范围和文档注释以及相关函数的名称、签名和初始行的解释得出的:

各个数组似乎保存与多个连接的多个线程相关的数据，因为文件范围 connection_no 数组之一的作用是将线程与连接关联起来。
该函数似乎是与连接相关的线程的线程启动函数。
当任何其他与连接相关的线程正在运行时启动的任何线程都不应执行除设置与其自身相关的数据之外的任何操作，以免破坏其他线程和连接所依赖的数据。

现在，至于实际问题——如何确定新线程应加入哪个线程？ 你不能。至少，不只依赖问题中提供的模板代码，未经修改。^*

假设，如果您可以访问将线程与连接关联起来的 connection_no 数组的版本，那么您可以使用它来查找与当前关联的所有线程的索引联系。然后，您可以从相应的 thread_ids 数组中获取它们的线程 ID(请注意，这里存在另一个名称冲突)，并从 join_to 数组中获取它们的连接目标。连接的第一个线程是不与另一个线程连接的线程，最后一个线程是不与任何其他线程连接的线程。这种分析并不完全简单，但也没有真正的技巧。细节保留为应有的练习。

但是，即使解决了文件范围名称冲突，您也无法执行上述分析，因为文件范围 connection_no 数组被整个区域内同名的局部变量遮蔽您可以在其中插入代码。^*

另请注意，您似乎需要为新线程选择一个线程索引，该索引通常不会为 0。看起来您需要扫描 thread_ids 或 connection_no 数组来查找可用索引。

<小时/>

^*除非你作弊。我的目的是让您(仅)将代码插入到 client_connection_thread 函数的主体中，但事实上，您可以通过将代码插入到指定区域来将该函数拆分为两个或多个。如果在实践中假设 connection_no 和 thread_ids 的第二个文件范围声明被忽略或丢失，则拆分该函数可以为阴影问题提供解决方法。例如:

    /*** properly initialize the thread queue to_join ***/
/*** TO BE DONE 3.1 START ***/

    return client_connection_thread_helper1(connection_no);
}  // end of function

// parameter 'con' is the number of this thread's connection
void *client_connection_thread_helper1(int con) {
    int my_index;
    // ... Find an available thread index (TODO: what if there isn't one?) ...
    thread_ids[my_index] = pthread_self();
    connection_no[my_index] = con;  // connection_no is not shadowed in this scope

    pthread_t *last = NULL;
    // ... Find the last (other) thread associated with connection 'con', if any ...
    // You can determine the first, too, but that does not appear to be required.

    to_join[my_index] = last;

    return client_connection_thread_helper2(con);
}

// A second additional function is required for the remaining bits of
// client_connection_thread(), because they need the local connection_no
void *client_connection_thread_helper2(int connection_no) {
    int client_fd;
    struct sockaddr_storage client_addr;
    socklen_t addr_size;

/*** TO BE DONE 3.1 END ***/


    pthread_mutex_unlock(&threads_mutex);

我认为弄清楚这种功能拆分的需求和实现可能是练习的一部分，但这将是一个肮脏的伎俩，总体而言，练习似乎更有可能只是形式不佳.

关于c - pthread C 中的线程队列 - Web 服务器响应流水线，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/59435445/

c - pthread C 中的线程队列 - Web 服务器响应流水线

上一篇：c - 将元素添加到有序链接列表

下一篇：c - 服务器没有在预期的位置监听