mempool/octeontx2: add optimized dequeue operation for arm64
authorPavan Nikhilesh <pbhagavatula@marvell.com>
Sat, 22 Jun 2019 13:24:15 +0000 (18:54 +0530)
committerThomas Monjalon <thomas@monjalon.net>
Tue, 25 Jun 2019 21:35:57 +0000 (23:35 +0200)
commitee338015e7a9f589aee597c74ecbb68aadf10331
tree34f3a67fc929b241f65e964bf720d21806e5d18e
parentd7a0da3c0043612346bc7a410b9586244cda8e4a
mempool/octeontx2: add optimized dequeue operation for arm64

This patch adds an optimized arm64 instruction based routine to leverage
CPU pipeline characteristics of octeontx2. The theme is to fill the
pipeline with CASP operations as much HW can do so that HW can do alloc()
HW ops in full throttle.

Signed-off-by: Pavan Nikhilesh <pbhagavatula@marvell.com>
Signed-off-by: Jerin Jacob <jerinj@marvell.com>
Signed-off-by: Vamsi Attunuru <vattunuru@marvell.com>
drivers/mempool/octeontx2/otx2_mempool_ops.c