excessive FRR logs about non-existent VRFs
Open, NormalPublicBUG
Actions

Assigned To

None

Authored By

	aserkin
	Oct 6 2022, 10:44 AM

Description

While subscriber connects/disconnects to remote-access l2tp vpn every routing protocol daemon outputs error message like this:

Oct 06 13:29:20 bgpd[923]: [VCGF0-X62M1][EC 100663301] INTERFACE_STATE: Cannot find IF l2tp0 in VRF 79
Oct 06 13:29:20 ripd[930]: [VCGF0-X62M1][EC 100663301] INTERFACE_STATE: Cannot find IF l2tp0 in VRF 79
Oct 06 13:29:20 ripngd[933]: [VCGF0-X62M1][EC 100663301] INTERFACE_STATE: Cannot find IF l2tp0 in VRF 79
Oct 06 13:29:20 ospfd[936]: [VCGF0-X62M1][EC 100663301] INTERFACE_STATE: Cannot find IF l2tp0 in VRF 79
Oct 06 13:29:20 ospf6d[939]: [VCGF0-X62M1][EC 100663301] INTERFACE_STATE: Cannot find IF l2tp0 in VRF 79
Oct 06 13:29:20 ldpd[957]: [VCGF0-X62M1][EC 100663301] INTERFACE_STATE: Cannot find IF l2tp0 in VRF 79
Oct 06 13:29:20 eigrpd[961]: [VCGF0-X62M1][EC 100663301] INTERFACE_STATE: Cannot find IF l2tp0 in VRF 79
Oct 06 13:29:20 staticd[964]: [VCGF0-X62M1][EC 100663301] INTERFACE_STATE: Cannot find IF l2tp0 in VRF 79
Oct 06 13:29:20 bfdd[967]: [VCGF0-X62M1][EC 100663301] INTERFACE_STATE: Cannot find IF l2tp0 in VRF 79
Oct 06 13:29:20 isisd[200162]: [VCGF0-X62M1][EC 100663301] INTERFACE_STATE: Cannot find IF l2tp0 in VRF 79

Client has nothing to do with VRF 79, moreover - there is no VRF table 79 on the system. The number is random usually. It can be 0 or any.
The messages are very annoying and they have unjustifiably high level - error. On the system with several thousands of clients the log file grows to several Gb in less than an hour. We can't filter errors from routing protocols - as we would lost the BGP peering events then.

Details

Version: 1.4-rolling-202209131208
Is it a breaking change?: Unspecified (possibly destroys the router)
Issue type: Bug (incorrect behavior)

Event Timeline

aserkin created this task.Oct 6 2022, 10:44 AM

pasik subscribed.Oct 6 2022, 11:37 AM

Hi @aserkin! It looks like you have some frr server misbehavior. It sends up/down events with an unexisting vrf id.
Could you make/describe the setup that causes the issue to appear? Thanks

v.huti claimed this task.Oct 6 2022, 2:29 PM

v.huti triaged this task as Normal priority.

v.huti edited a custom field.

This a project for mobile access to enterprise networks. VyOS plays as an MPLS-PE router as well as L2TP Network Server. Every subscriber coming via l2tp is directed to the customer's VRF other than default (with RADIUS attribute)

vyos-lns-1.cfg8 KBDownload

l2tp-connect-disconnect.log9 KBDownload

. Attached log file and configuration from our lab. The production system tested has about 200 VRFs configured.

aserkin added a comment.Oct 6 2022, 4:59 PM

This comment was removed by aserkin.

Any suggestions on the problem, guys?
I see a lot of messages regarding these messages appearing in various scenarios since 2017 or even earlier in FRR community. But did not find any solution actually.

@aserkin as workaround try to change facility level

vtysh -c "conf t" -c "log facility local0"

But it can affect to bgp logs

That does not change the behavior. I get five messages on session start from bfdd, bgpd, ospfd processes, and 16 messages from all FRR daemons on session stop.
The only way to get rid of them is 'log syslog emergencies' but this filters important events as well.

Added more bgpd/ospfd events to the log. The VRF Id seem to be correct. But the events look curious. After session start the interface is first created in vrf default (vrf default, id:0) followed by bgpd/ospfd events, then accel-ppp process moves it to destination vrf (vrf client, id:5) which is follwed by the bgpd/ospfd errors.
Finally, with more or less than 5000 sessions bgpd accidentally becomes unresponsive and utilizes 200% cpu (8 cores are used on VM). Accel-pppd process having all network destinations unreachable also goes unresponsive a bit later.
After that we have to reboot.

l2tp-session-start-stop.log13 KBDownload

syncer edited projects, added VyOS 1.4 Sagitta (1.4.0-GA); removed VyOS 1.4 Sagitta.May 7 2024, 8:03 AM

dmbaturin added a project: Bugs.Sep 15 2024, 5:03 PM

syncer removed v.huti as the assignee of this task.Oct 30 2024, 1:50 PM

syncer edited projects, added VyOS Rolling; removed VyOS 1.4 Sagitta (1.4.0-GA).

syncer moved this task from Need Triage to Backlog - Bug on the VyOS Rolling board.

syncer added a subscriber: Global Notifications.Nov 1 2024, 9:19 PM

	F3206166: vyos-lns-1.cfg
	Oct 6 2022, 4:24 PM

excessive FRR logs about non-existent VRFsOpen, NormalPublicBUGActions

Description

Details

Event Timeline

excessive FRR logs about non-existent VRFs
Open, NormalPublicBUG
Actions