mlx4: improve Rx performance with better prefetching