]> git.droids-corp.org - dpdk.git/commitdiff
common/mlx5: fix queue pair ack timeout configuration
authorYajun Wu <yajunw@nvidia.com>
Mon, 14 Feb 2022 06:03:19 +0000 (08:03 +0200)
committerRaslan Darawsheh <rasland@nvidia.com>
Wed, 23 Feb 2022 14:57:30 +0000 (15:57 +0100)
VDPA driver creates two QPs(1 queue pair include 1 send queue
and 1 receive queue) per virtio queue to get traffic events
from NIC to SW.
Two QPs(called FW QP and SW QP) are created as loopback QP
and FW QP'SQ is connected to SW QP'RQ internally.

When packet receive or send out, HW will send WQE by FW QP'SQ,
then SW will get CQE from the CQ of SW QP.

With large scale and heavy traffic, the SQ's request may fail
to get ACK from RQ HW, because HW is busy.
SQ will retry the request with qpc.retry_count times and each time
wait for 4.096 uS *2^(ack_timeout) for the response. If still can’t
get RQ’s HW response, SQ will go to an error state.

16 is experienced value. It should not be too high or too low.
Too high will make QP waits too long in case it’s packet drop.
Too low will cause QP to go to an error state(retry-exceeded) easily.

Fixes: 15c3807e86ab ("common/mlx5: support DevX QP operations")
Cc: stable@dpdk.org
Signed-off-by: Yajun Wu <yajunw@nvidia.com>
Acked-by: Matan Azrad <matan@nvidia.com>
drivers/common/mlx5/mlx5_devx_cmds.c

index 2e807a08297d7030025980fab678ac1e33d83514..7732613c6990a2c59888635df228b361f1fdcf35 100644 (file)
@@ -2279,7 +2279,7 @@ mlx5_devx_cmd_modify_qp_state(struct mlx5_devx_obj *qp, uint32_t qp_st_mod_op,
        case MLX5_CMD_OP_RTR2RTS_QP:
                qpc = MLX5_ADDR_OF(rtr2rts_qp_in, &in, qpc);
                MLX5_SET(rtr2rts_qp_in, &in, qpn, qp->id);
-               MLX5_SET(qpc, qpc, primary_address_path.ack_timeout, 14);
+               MLX5_SET(qpc, qpc, primary_address_path.ack_timeout, 16);
                MLX5_SET(qpc, qpc, log_ack_req_freq, 0);
                MLX5_SET(qpc, qpc, retry_count, 7);
                MLX5_SET(qpc, qpc, rnr_retry, 7);