2 Copyright(c) 2010-2014 Intel Corporation. All rights reserved.
5 Redistribution and use in source and binary forms, with or without
6 modification, are permitted provided that the following conditions
9 * Redistributions of source code must retain the above copyright
10 notice, this list of conditions and the following disclaimer.
11 * Redistributions in binary form must reproduce the above copyright
12 notice, this list of conditions and the following disclaimer in
13 the documentation and/or other materials provided with the
15 * Neither the name of Intel Corporation nor the names of its
16 contributors may be used to endorse or promote products derived
17 from this software without specific prior written permission.
19 THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
20 "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
21 LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
22 A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
23 OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
24 SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
25 LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
26 DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
27 THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
28 (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
29 OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
31 QoS Scheduler Sample Application
32 ================================
34 The QoS sample application demonstrates the use of the DPDK to provide QoS scheduling.
39 The architecture of the QoS scheduler application is shown in the following figure.
41 .. _figure_qos_sched_app_arch:
43 .. figure:: img/qos_sched_app_arch.*
45 QoS Scheduler Application Architecture
48 There are two flavors of the runtime execution for this application,
49 with two or three threads per each packet flow configuration being used.
50 The RX thread reads packets from the RX port,
51 classifies the packets based on the double VLAN (outer and inner) and
52 the lower two bytes of the IP destination address and puts them into the ring queue.
53 The worker thread dequeues the packets from the ring and calls the QoS scheduler enqueue/dequeue functions.
54 If a separate TX core is used, these are sent to the TX ring.
55 Otherwise, they are sent directly to the TX port.
56 The TX thread, if present, reads from the TX ring and write the packets to the TX port.
58 Compiling the Application
59 -------------------------
61 To compile the application:
63 #. Go to the sample application directory:
65 .. code-block:: console
67 export RTE_SDK=/path/to/rte_sdk
68 cd ${RTE_SDK}/examples/qos_sched
70 #. Set the target (a default target is used if not specified). For example:
74 This application is intended as a linuxapp only.
76 .. code-block:: console
78 export RTE_TARGET=x86_64-native-linuxapp-gcc
80 #. Build the application:
82 .. code-block:: console
88 To get statistics on the sample app using the command line interface as described in the next section,
89 DPDK must be compiled defining *CONFIG_RTE_SCHED_COLLECT_STATS*,
90 which can be done by changing the configuration file for the specific target to be compiled.
92 Running the Application
93 -----------------------
97 In order to run the application, a total of at least 4
98 G of huge pages must be set up for each of the used sockets (depending on the cores in use).
100 The application has a number of command line options:
102 .. code-block:: console
104 ./qos_sched [EAL options] -- <APP PARAMS>
106 Mandatory application parameters include:
108 * --pfc "RX PORT, TX PORT, RX LCORE, WT LCORE, TX CORE": Packet flow configuration.
109 Multiple pfc entities can be configured in the command line,
110 having 4 or 5 items (if TX core defined or not).
112 Optional application parameters include:
114 * -i: It makes the application to start in the interactive mode.
115 In this mode, the application shows a command line that can be used for obtaining statistics while
116 scheduling is taking place (see interactive mode below for more information).
118 * --mst n: Master core index (the default value is 1).
120 * --rsz "A, B, C": Ring sizes:
122 * A = Size (in number of buffer descriptors) of each of the NIC RX rings read
123 by the I/O RX lcores (the default value is 128).
125 * B = Size (in number of elements) of each of the software rings used
126 by the I/O RX lcores to send packets to worker lcores (the default value is 8192).
128 * C = Size (in number of buffer descriptors) of each of the NIC TX rings written
129 by worker lcores (the default value is 256)
131 * --bsz "A, B, C, D": Burst sizes
133 * A = I/O RX lcore read burst size from the NIC RX (the default value is 64)
135 * B = I/O RX lcore write burst size to the output software rings,
136 worker lcore read burst size from input software rings,QoS enqueue size (the default value is 64)
138 * C = QoS dequeue size (the default value is 32)
140 * D = Worker lcore write burst size to the NIC TX (the default value is 64)
142 * --msz M: Mempool size (in number of mbufs) for each pfc (default 2097152)
144 * --rth "A, B, C": The RX queue threshold parameters
146 * A = RX prefetch threshold (the default value is 8)
148 * B = RX host threshold (the default value is 8)
150 * C = RX write-back threshold (the default value is 4)
152 * --tth "A, B, C": TX queue threshold parameters
154 * A = TX prefetch threshold (the default value is 36)
156 * B = TX host threshold (the default value is 0)
158 * C = TX write-back threshold (the default value is 0)
160 * --cfg FILE: Profile configuration to load
162 Refer to *DPDK Getting Started Guide* for general information on running applications and
163 the Environment Abstraction Layer (EAL) options.
165 The profile configuration file defines all the port/subport/pipe/traffic class/queue parameters
166 needed for the QoS scheduler configuration.
168 The profile file has the following format:
172 ; port configuration [port]
175 number of subports per port = 1
176 number of pipes per subport = 4096
177 queue sizes = 64 64 64 64
179 ; Subport configuration
182 tb rate = 1250000000; Bytes per second
183 tb size = 1000000; Bytes
184 tc 0 rate = 1250000000; Bytes per second
185 tc 1 rate = 1250000000; Bytes per second
186 tc 2 rate = 1250000000; Bytes per second
187 tc 3 rate = 1250000000; Bytes per second
188 tc period = 10; Milliseconds
189 tc oversubscription period = 10; Milliseconds
191 pipe 0-4095 = 0; These pipes are configured with pipe profile 0
196 tb rate = 305175; Bytes per second
197 tb size = 1000000; Bytes
199 tc 0 rate = 305175; Bytes per second
200 tc 1 rate = 305175; Bytes per second
201 tc 2 rate = 305175; Bytes per second
202 tc 3 rate = 305175; Bytes per second
203 tc period = 40; Milliseconds
205 tc 0 oversubscription weight = 1
206 tc 1 oversubscription weight = 1
207 tc 2 oversubscription weight = 1
208 tc 3 oversubscription weight = 1
210 tc 0 wrr weights = 1 1 1 1
211 tc 1 wrr weights = 1 1 1 1
212 tc 2 wrr weights = 1 1 1 1
213 tc 3 wrr weights = 1 1 1 1
215 ; RED params per traffic class and color (Green / Yellow / Red)
218 tc 0 wred min = 48 40 32
219 tc 0 wred max = 64 64 64
220 tc 0 wred inv prob = 10 10 10
221 tc 0 wred weight = 9 9 9
223 tc 1 wred min = 48 40 32
224 tc 1 wred max = 64 64 64
225 tc 1 wred inv prob = 10 10 10
226 tc 1 wred weight = 9 9 9
228 tc 2 wred min = 48 40 32
229 tc 2 wred max = 64 64 64
230 tc 2 wred inv prob = 10 10 10
231 tc 2 wred weight = 9 9 9
233 tc 3 wred min = 48 40 32
234 tc 3 wred max = 64 64 64
235 tc 3 wred inv prob = 10 10 10
236 tc 3 wred weight = 9 9 9
241 These are the commands that are currently working under the command line interface:
245 * --quit: Quits the application.
249 * stats app: Shows a table with in-app calculated statistics.
251 * stats port X subport Y: For a specific subport, it shows the number of packets that
252 went through the scheduler properly and the number of packets that were dropped.
253 The same information is shown in bytes.
254 The information is displayed in a table separating it in different traffic classes.
256 * stats port X subport Y pipe Z: For a specific pipe, it shows the number of packets that
257 went through the scheduler properly and the number of packets that were dropped.
258 The same information is shown in bytes.
259 This information is displayed in a table separating it in individual queues.
263 All of these commands work the same way, averaging the number of packets throughout a specific subset of queues.
265 Two parameters can be configured for this prior to calling any of these commands:
267 * qavg n X: n is the number of times that the calculation will take place.
268 Bigger numbers provide higher accuracy. The default value is 10.
270 * qavg period X: period is the number of microseconds that will be allowed between each calculation.
271 The default value is 100.
273 The commands that can be used for measuring average queue size are:
275 * qavg port X subport Y: Show average queue size per subport.
277 * qavg port X subport Y tc Z: Show average queue size per subport for a specific traffic class.
279 * qavg port X subport Y pipe Z: Show average queue size per pipe.
281 * qavg port X subport Y pipe Z tc A: Show average queue size per pipe for a specific traffic class.
283 * qavg port X subport Y pipe Z tc A q B: Show average queue size of a specific queue.
288 The following is an example command with a single packet flow configuration:
290 .. code-block:: console
292 ./qos_sched -l 1,5,7 -n 4 -- --pfc "3,2,5,7" --cfg ./profile.cfg
294 This example uses a single packet flow configuration which creates one RX thread on lcore 5 reading
295 from port 3 and a worker thread on lcore 7 writing to port 2.
297 Another example with 2 packet flow configurations using different ports but sharing the same core for QoS scheduler is given below:
299 .. code-block:: console
301 ./qos_sched -l 1,2,6,7 -n 4 -- --pfc "3,2,2,6,7" --pfc "1,0,2,6,7" --cfg ./profile.cfg
303 Note that independent cores for the packet flow configurations for each of the RX, WT and TX thread are also supported,
304 providing flexibility to balance the work.
306 The EAL coremask/corelist is constrained to contain the default mastercore 1 and the RX, WT and TX cores only.
311 The Port/Subport/Pipe/Traffic Class/Queue are the hierarchical entities in a typical QoS application:
313 * A subport represents a predefined group of users.
315 * A pipe represents an individual user/subscriber.
317 * A traffic class is the representation of a different traffic type with a specific loss rate,
318 delay and jitter requirements; such as data voice, video or data transfers.
320 * A queue hosts packets from one or multiple connections of the same type belonging to the same user.
322 The traffic flows that need to be configured are application dependent.
323 This application classifies based on the QinQ double VLAN tags and the IP destination address as indicated in the following table.
325 .. _table_qos_scheduler_1:
327 .. table:: Entity Types
329 +----------------+-------------------------+--------------------------------------------------+----------------------------------+
330 | **Level Name** | **Siblings per Parent** | **QoS Functional Description** | **Selected By** |
332 +================+=========================+==================================================+==================================+
333 | Port | - | Ethernet port | Physical port |
335 +----------------+-------------------------+--------------------------------------------------+----------------------------------+
336 | Subport | Config (8) | Traffic shaped (token bucket) | Outer VLAN tag |
338 +----------------+-------------------------+--------------------------------------------------+----------------------------------+
339 | Pipe | Config (4k) | Traffic shaped (token bucket) | Inner VLAN tag |
341 +----------------+-------------------------+--------------------------------------------------+----------------------------------+
342 | Traffic Class | 4 | TCs of the same pipe services in strict priority | Destination IP address (0.0.X.0) |
344 +----------------+-------------------------+--------------------------------------------------+----------------------------------+
345 | Queue | 4 | Queue of the same TC serviced in WRR | Destination IP address (0.0.0.X) |
347 +----------------+-------------------------+--------------------------------------------------+----------------------------------+
349 Please refer to the "QoS Scheduler" chapter in the *DPDK Programmer's Guide* for more information about these parameters.