+* **Added write combining store APIs.**
+
+ Added ``rte_write32_wc`` and ``rte_write32_wc_relaxed`` APIs
+ that enable write combining stores (depending on architecture).
+ The functions are provided as a generic stubs and
+ x86 specific implementation.
+
+* **Added prefetch with intention to write APIs.**
+
+ Added new prefetch function variants e.g. ``rte_prefetch0_write``,
+ which allow the programmer to prefetch a cache line and also indicate
+ the intention to write.
+
+* **Added the rte_cldemote API.**
+
+ Added a hardware hint CLDEMOTE, which is similar to prefetch in reverse.
+ CLDEMOTE moves the cache line to the more remote cache, where it expects
+ sharing to be efficient. Moving the cache line to a level more distant from
+ the processor helps to accelerate core-to-core communication.
+ This API is specific to x86 and implemented as a stub for other
+ architectures.
+
+* **Added support for limiting maximum SIMD bitwidth.**
+
+ Added a new EAL config setting ``max_simd_bitwidth`` to limit the vector
+ path selection at runtime. This value can be set by apps using the
+ ``rte_vect_set_max_simd_bitwidth`` function, or by the user with EAL flag
+ ``--force-max-simd-bitwidth``.
+
+* **Added zero copy APIs for rte_ring.**
+
+ For rings with producer/consumer in ``RTE_RING_SYNC_ST``, ``RTE_RING_SYNC_MT_HTS``
+ modes, these APIs split enqueue/dequeue operation into three phases
+ (enqueue/dequeue start, copy data to/from ring, enqueue/dequeue finish).
+ Along with the advantages of the peek APIs, these provide the ability to
+ copy the data to the ring memory directly without the need for temporary
+ storage.
+
+* **Updated CRC modules of the net library.**
+
+ * Added runtime selection of the optimal architecture-specific CRC path.
+ * Added optimized implementations of CRC32-Ethernet and CRC16-CCITT
+ using the AVX512 and VPCLMULQDQ instruction sets.
+
+* **Introduced extended buffer description for receiving.**
+
+ Added the extended Rx buffer description for Rx queue setup routine
+ providing the individual settings for each Rx segment with maximal size,
+ buffer offset and memory pool to allocate data buffers from.
+
+* **Added the FEC API, for a generic FEC query and config.**
+
+ Added the FEC API which provides functions for query FEC capabilities and
+ current FEC mode from device. Also, API for configuring FEC mode is also provided.
+
+* **Added thread safety to rte_flow functions.**
+
+ Added ``RTE_ETH_DEV_FLOW_OPS_THREAD_SAFE`` device flag to indicate
+ whether PMD supports thread safe operations. If PMD doesn't set the flag,
+ rte_flow API level functions will protect the flow operations with mutex.
+
+* **Added flow-based traffic sampling support.**
+
+ Added new action: ``RTE_FLOW_ACTION_TYPE_SAMPLE`` to duplicate the matching
+ packets with specified ratio, and apply with own set of actions with a fate
+ action. When the ratio is set to 1 then the packets will be 100% mirrored.
+
+* **Added support of shared action in flow API.**
+
+ Added shared action support to utilize single flow action in multiple flow
+ rules. An update of shared action configuration alters the behavior of all
+ flow rules using it.
+
+ * Added new action: ``RTE_FLOW_ACTION_TYPE_SHARED`` to use shared action
+ as flow action.
+ * Added new flow APIs to create/update/destroy/query shared action.
+
+* **Flow rules allowed to use private PMD items / actions.**
+
+ * Flow rule verification was updated to accept private PMD
+ items and actions.
+
+* **Added generic API to offload tunneled traffic and restore missed packet.**
+
+ * Added a new hardware independent helper to flow API that
+ offloads tunneled traffic and restores missed packets.
+
+* **Updated the ethdev library to support hairpin between two ports.**
+
+ New APIs are introduced to support binding / unbinding 2 ports hairpin.
+ Hairpin Tx part flow rules can be inserted explicitly.
+ New API is added to get the hairpin peer ports list.
+