mirror-linux

History

Davide Caratti 4c8fc3fe28 act_mirred: use the backlog for nested calls to mirred ingress [ Upstream commit `ca22da2fbd` ] William reports kernel soft-lockups on some OVS topologies when TC mirred egress->ingress action is hit by local TCP traffic [1]. The same can also be reproduced with SCTP (thanks Xin for verifying), when client and server reach themselves through mirred egress to ingress, and one of the two peers sends a "heartbeat" packet (from within a timer). Enqueueing to backlog proved to fix this soft lockup; however, as Cong noticed [2], we should preserve - when possible - the current mirred behavior that counts as "overlimits" any eventual packet drop subsequent to the mirred forwarding action [3]. A compromise solution might use the backlog only when tcf_mirred_act() has a nest level greater than one: change tcf_mirred_forward() accordingly. Also, add a kselftest that can reproduce the lockup and verifies TC mirred ability to account for further packet drops after TC mirred egress->ingress (when the nest level is 1). [1] https://lore.kernel.org/netdev/33dc43f587ec1388ba456b4915c75f02a8aae226.1663945716.git.dcaratti@redhat.com/ [2] https://lore.kernel.org/netdev/Y0w%2FWWY60gqrtGLp@pop-os.localdomain/ [3] such behavior is not guaranteed: for example, if RPS or skb RX timestamping is enabled on the mirred target device, the kernel can defer receiving the skb and return NET_RX_SUCCESS inside tcf_mirred_forward(). Reported-by: William Zhao <wizhao@redhat.com> CC: Xin Long <lucien.xin@gmail.com> Signed-off-by: Davide Caratti <dcaratti@redhat.com> Reviewed-by: Marcelo Ricardo Leitner <marcelo.leitner@gmail.com> Acked-by: Jamal Hadi Salim <jhs@mojatatu.com> Signed-off-by: Paolo Abeni <pabeni@redhat.com> Signed-off-by: Sasha Levin <sashal@kernel.org>		2023-03-30 12:49:20 +02:00
..
.gitignore	…
Makefile	Merge git://git.kernel.org/pub/scm/linux/kernel/git/netdev/net	2022-07-14 15:27:35 -07:00
README	…
bridge_igmp.sh	selftests: net: bridge: update IGMP/MLD membership interval value	2021-10-29 13:58:21 +01:00
bridge_locked_port.sh	selftests: forwarding: Use same VRF for port and VLAN upper	2022-03-22 11:09:24 +01:00
bridge_mdb.sh	selftests: forwarding: new test, verify host mdb entries	2022-04-12 10:06:53 +02:00
bridge_mdb_port_down.sh	selftest: net: bridge mdb add/del entry to port that is down	2022-07-04 10:30:06 +01:00
bridge_mld.sh	selftests: net: bridge: update IGMP/MLD membership interval value	2021-10-29 13:58:21 +01:00
bridge_port_isolation.sh	…
bridge_sticky_fdb.sh	…
bridge_vlan_aware.sh	selftests: net: bridge: Parameterize ageing timeout	2022-02-03 14:05:56 +00:00
bridge_vlan_mcast.sh	selftests: net: bridge: fix typo in vlan_filtering dependency test	2021-11-29 12:49:53 +00:00
bridge_vlan_unaware.sh	selftests: net: bridge: Parameterize ageing timeout	2022-02-03 14:05:56 +00:00
config	selftests: add a test case for mirred egress to ingress	2021-11-16 19:17:38 -08:00
custom_multipath_hash.sh	selftests: forwarding: Fix failing tests with old libnet	2022-08-10 22:36:14 -07:00
devlink_lib.sh	selftests: devlink_lib: Add function for querying maximum pool size	2022-09-19 18:07:59 -07:00
dual_vxlan_bridge.sh	selftests: forwarding: Add test for dual VxLAN bridge	2021-03-17 12:26:28 -07:00
ethtool.sh	…
ethtool_extended_state.sh	selftests: forwarding: ethtool_extended_state: Convert to busywait	2022-06-29 14:01:23 +01:00
ethtool_lib.sh	net: selftests: Add lanes setting test	2021-02-03 18:37:29 -08:00
fib_offload_lib.sh	selftests: fib offload: use sensible tos values	2022-02-03 19:11:21 -08:00
forwarding.config.sample	selftests: net: bridge: Parameterize ageing timeout	2022-02-03 14:05:56 +00:00
gre_custom_multipath_hash.sh	selftests: forwarding: Fix failing tests with old libnet	2022-08-10 22:36:14 -07:00
gre_inner_v4_multipath.sh	…
gre_inner_v6_multipath.sh	…
gre_multipath.sh	…
gre_multipath_nh.sh	…
gre_multipath_nh_res.sh	selftests: forwarding: Add resilient multipath tunneling nexthop test	2021-03-12 17:44:10 -08:00
hw_stats_l3.sh	selftests: lib: Add a generic helper for obtaining HW stats	2022-05-08 11:46:20 +01:00
hw_stats_l3_gre.sh	selftests: forwarding: Add a tunnel-based test for L3 HW stats	2022-05-08 11:46:21 +01:00
ip6_forward_instats_vrf.sh	ipv6: When forwarding count rx stats on the orig netdev	2021-10-15 15:32:04 -07:00
ip6gre_custom_multipath_hash.sh	selftests: forwarding: Fix failing tests with old libnet	2022-08-10 22:36:14 -07:00
ip6gre_flat.sh	selftests: forwarding: Add IPv6 GRE flat tests	2021-10-08 16:40:58 +01:00
ip6gre_flat_key.sh	selftests: forwarding: Add IPv6 GRE flat tests	2021-10-08 16:40:58 +01:00
ip6gre_flat_keys.sh	selftests: forwarding: Add IPv6 GRE flat tests	2021-10-08 16:40:58 +01:00
ip6gre_hier.sh	selftests: forwarding: Add IPv6 GRE hierarchical tests	2021-10-08 16:40:58 +01:00
ip6gre_hier_key.sh	selftests: forwarding: Add IPv6 GRE hierarchical tests	2021-10-08 16:40:58 +01:00
ip6gre_hier_keys.sh	selftests: forwarding: Add IPv6 GRE hierarchical tests	2021-10-08 16:40:58 +01:00
ip6gre_inner_v4_multipath.sh	…
ip6gre_inner_v6_multipath.sh	…
ip6gre_lib.sh	selftests: forwarding: Add IPv6 GRE hierarchical tests	2021-10-08 16:40:58 +01:00
ipip_flat_gre.sh	…
ipip_flat_gre_key.sh	…
ipip_flat_gre_keys.sh	…
ipip_hier_gre.sh	…
ipip_hier_gre_key.sh	…
ipip_hier_gre_keys.sh	…
ipip_lib.sh	…
lib.sh	selftests: forwarding: lib: quote the sysctl values	2023-02-14 19:11:48 +01:00
local_termination.sh	selftests: forwarding: add a test for local_termination.sh	2022-04-23 12:18:16 +01:00
loopback.sh	…
mirror_gre.sh	…
mirror_gre_bound.sh	…
mirror_gre_bridge_1d.sh	…
mirror_gre_bridge_1d_vlan.sh	selftests: forwarding: Fix packet matching in mirroring selftests	2021-11-10 14:38:44 +00:00
mirror_gre_bridge_1q.sh	selftests: mirror_gre_bridge_1q: Avoid changing PVID while interface is operational	2022-05-03 11:21:14 +02:00
mirror_gre_bridge_1q_lag.sh	selftests: mirror_gre_bridge_1q_lag: Enslave port to bridge before other configurations	2022-06-17 10:31:33 +01:00
mirror_gre_changes.sh	selftests: forwarding: Fix packet matching in mirroring selftests	2021-11-10 14:38:44 +00:00
mirror_gre_flower.sh	…
mirror_gre_lag_lacp.sh	…
mirror_gre_lib.sh	…
mirror_gre_neigh.sh	…
mirror_gre_nh.sh	…
mirror_gre_topo_lib.sh	…
mirror_gre_vlan.sh	…
mirror_gre_vlan_bridge_1q.sh	selftests: forwarding: Fix packet matching in mirroring selftests	2021-11-10 14:38:44 +00:00
mirror_lib.sh	selftests: forwarding: Fix packet matching in mirroring selftests	2021-11-10 14:38:44 +00:00
mirror_topo_lib.sh	…
mirror_vlan.sh	selftests: forwarding: Fix packet matching in mirroring selftests	2021-11-10 14:38:44 +00:00
no_forwarding.sh	selftests: forwarding: add a no_forwarding.sh test	2022-04-23 12:18:16 +01:00
pedit_dsfield.sh	selftests: Clean forgotten resources as part of cleanup()	2021-06-08 14:39:07 -07:00
pedit_ip.sh	selftests: forwarding: Add a test for pedit munge SIP and DIP	2022-02-07 11:59:57 +00:00
pedit_l4port.sh	selftests: Clean forgotten resources as part of cleanup()	2021-06-08 14:39:07 -07:00
q_in_vni.sh	…
q_in_vni_ipv6.sh	selftests: forwarding: Add Q-in-VNI test for IPv6	2021-12-22 17:14:34 -08:00
router.sh	selftests: router.sh: Add a diagram	2022-05-04 11:21:32 +01:00
router_bridge.sh	…
router_bridge_vlan.sh	…
router_broadcast.sh	…
router_mpath_nh.sh	selftests/net: Use kselftest skip code for skipped tests	2021-08-24 16:49:09 -07:00
router_mpath_nh_res.sh	selftests/net: Use kselftest skip code for skipped tests	2021-08-24 16:49:09 -07:00
router_multicast.sh	selftests: forwarding: Add test cases for unresolved multicast routes	2022-09-20 08:22:15 -07:00
router_multipath.sh	…
router_nh.sh	…
router_vid_1.sh	selftests: router_vid_1: Add a diagram, fix coding style	2022-05-04 11:21:32 +01:00
sch_ets.sh	…
sch_ets_core.sh	…
sch_ets_tests.sh	…
sch_red.sh	selftests: forwarding: add shebang for sch_red.sh	2022-09-22 07:33:56 -07:00
sch_tbf_core.sh	…
sch_tbf_ets.sh	…
sch_tbf_etsprio.sh	selftests: mlxsw: Test port shaper	2021-10-28 19:47:50 -07:00
sch_tbf_prio.sh	…
sch_tbf_root.sh	…
skbedit_priority.sh	selftests: Clean forgotten resources as part of cleanup()	2021-06-08 14:39:07 -07:00
tc_actions.sh	act_mirred: use the backlog for nested calls to mirred ingress	2023-03-30 12:49:20 +02:00
tc_chains.sh	…
tc_common.sh	testing: selftests: tc_common: Add tc_check_at_least_x_packets()	2021-10-08 16:40:58 +01:00
tc_flower.sh	net: re-solve some conflicts after net -> net-next merge	2021-02-16 23:12:23 -08:00
tc_flower_router.sh	…
tc_mpls_l2vpn.sh	…
tc_police.sh	net/sched: act_police: more accurate MTU policing	2022-02-14 11:15:04 +00:00
tc_shblocks.sh	…
tc_vlan_modify.sh	…
tsn_lib.sh	selftests: net: tsn_lib: run phc2sys in automatic mode	2022-09-26 13:22:01 -07:00
vxlan_asymmetric.sh	selftests: net: Fix typo 'the the' in comment	2022-07-26 20:26:58 -07:00
vxlan_asymmetric_ipv6.sh	selftests: forwarding: Add a test for VxLAN asymmetric routing with IPv6	2021-12-22 17:14:34 -08:00
vxlan_bridge_1d.sh	selftests: forwarding: vxlan_bridge_1d: Add more ECN decap test cases	2021-03-29 13:29:49 -07:00
vxlan_bridge_1d_ipv6.sh	selftests: forwarding: Add VxLAN tests with a VLAN-unaware bridge for IPv6	2021-12-22 17:14:32 -08:00
vxlan_bridge_1d_port_8472.sh	…
vxlan_bridge_1d_port_8472_ipv6.sh	selftests: forwarding: Add VxLAN tests with a VLAN-unaware bridge for IPv6	2021-12-22 17:14:32 -08:00
vxlan_bridge_1q.sh	selftests: forwarding: vxlan_bridge_1q: Remove unused function	2021-12-22 17:14:33 -08:00
vxlan_bridge_1q_ipv6.sh	selftests: forwarding: Add VxLAN tests with a VLAN-aware bridge for IPv6	2021-12-22 17:14:33 -08:00
vxlan_bridge_1q_port_8472.sh	…
vxlan_bridge_1q_port_8472_ipv6.sh	selftests: forwarding: Add VxLAN tests with a VLAN-aware bridge for IPv6	2021-12-22 17:14:33 -08:00
vxlan_symmetric.sh	…
vxlan_symmetric_ipv6.sh	selftests: forwarding: Add a test for VxLAN symmetric routing with IPv6	2021-12-22 17:14:34 -08:00

README

Motivation
==========

One of the nice things about network namespaces is that they allow one
to easily create and test complex environments.

Unfortunately, these namespaces can not be used with actual switching
ASICs, as their ports can not be migrated to other network namespaces
(NETIF_F_NETNS_LOCAL) and most of them probably do not support the
L1-separation provided by namespaces.

However, a similar kind of flexibility can be achieved by using VRFs and
by looping the switch ports together. For example:

                             br0
                              +
               vrf-h1         |           vrf-h2
                 +        +---+----+        +
                 |        |        |        |
    192.0.2.1/24 +        +        +        + 192.0.2.2/24
               swp1     swp2     swp3     swp4
                 +        +        +        +
                 |        |        |        |
                 +--------+        +--------+

The VRFs act as lightweight namespaces representing hosts connected to
the switch.

This approach for testing switch ASICs has several advantages over the
traditional method that requires multiple physical machines, to name a
few:

1. Only the device under test (DUT) is being tested without noise from
other system.

2. Ability to easily provision complex topologies. Testing bridging
between 4-ports LAGs or 8-way ECMP requires many physical links that are
not always available. With the VRF-based approach one merely needs to
loopback more ports.

These tests are written with switch ASICs in mind, but they can be run
on any Linux box using veth pairs to emulate physical loopbacks.

Guidelines for Writing Tests
============================

o Where possible, reuse an existing topology for different tests instead
  of recreating the same topology.
o Tests that use anything but the most trivial topologies should include
  an ASCII art showing the topology.
o Where possible, IPv6 and IPv4 addresses shall conform to RFC 3849 and
  RFC 5737, respectively.
o Where possible, tests shall be written so that they can be reused by
  multiple topologies and added to lib.sh.
o Checks shall be added to lib.sh for any external dependencies.
o Code shall be checked using ShellCheck [1] prior to submission.

1. https://www.shellcheck.net/