<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://mediawiki.comfac.net/index.php?action=history&amp;feed=atom&amp;title=SOP%3A_Network_Troubleshooting_%26_pfSense_Monitoring_251130</id>
	<title>SOP: Network Troubleshooting &amp; pfSense Monitoring 251130 - Revision history</title>
	<link rel="self" type="application/atom+xml" href="https://mediawiki.comfac.net/index.php?action=history&amp;feed=atom&amp;title=SOP%3A_Network_Troubleshooting_%26_pfSense_Monitoring_251130"/>
	<link rel="alternate" type="text/html" href="https://mediawiki.comfac.net/index.php?title=SOP:_Network_Troubleshooting_%26_pfSense_Monitoring_251130&amp;action=history"/>
	<updated>2026-06-05T11:03:04Z</updated>
	<subtitle>Revision history for this page on the wiki</subtitle>
	<generator>MediaWiki 1.45.1</generator>
	<entry>
		<id>https://mediawiki.comfac.net/index.php?title=SOP:_Network_Troubleshooting_%26_pfSense_Monitoring_251130&amp;diff=79&amp;oldid=prev</id>
		<title>CITEditor: Created page with &quot;= SOP: Network Troubleshooting &amp; pfSense Monitoring =  &#039;&#039;&#039;Source:&#039;&#039;&#039; [https://www.youtube.com/watch?v=utZ6kQpGRoc Lawrence Systems: pfSense Packet Loss and Latency Monitoring Guide]  &#039;&#039;&#039;Purpose:&#039;&#039;&#039; To standardize the diagnosis of intermittent internet connectivity issues using the &quot;Fault Isolation&quot; methodology and pfSense Gateway Monitoring tools.&lt;br&gt; &#039;&#039;&#039;Target Audience:&#039;&#039;&#039; IT Support, Network Administrators, and Technical Staff.  == Part 1: The Methodology (Fault Isolat...&quot;</title>
		<link rel="alternate" type="text/html" href="https://mediawiki.comfac.net/index.php?title=SOP:_Network_Troubleshooting_%26_pfSense_Monitoring_251130&amp;diff=79&amp;oldid=prev"/>
		<updated>2026-02-25T07:22:53Z</updated>

		<summary type="html">&lt;p&gt;Created page with &amp;quot;= SOP: Network Troubleshooting &amp;amp; pfSense Monitoring =  &amp;#039;&amp;#039;&amp;#039;Source:&amp;#039;&amp;#039;&amp;#039; [https://www.youtube.com/watch?v=utZ6kQpGRoc Lawrence Systems: pfSense Packet Loss and Latency Monitoring Guide]  &amp;#039;&amp;#039;&amp;#039;Purpose:&amp;#039;&amp;#039;&amp;#039; To standardize the diagnosis of intermittent internet connectivity issues using the &amp;quot;Fault Isolation&amp;quot; methodology and pfSense Gateway Monitoring tools.&amp;lt;br&amp;gt; &amp;#039;&amp;#039;&amp;#039;Target Audience:&amp;#039;&amp;#039;&amp;#039; IT Support, Network Administrators, and Technical Staff.  == Part 1: The Methodology (Fault Isolat...&amp;quot;&lt;/p&gt;
&lt;p&gt;&lt;b&gt;New page&lt;/b&gt;&lt;/p&gt;&lt;div&gt;= SOP: Network Troubleshooting &amp;amp; pfSense Monitoring =&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Source:&amp;#039;&amp;#039;&amp;#039; [https://www.youtube.com/watch?v=utZ6kQpGRoc Lawrence Systems: pfSense Packet Loss and Latency Monitoring Guide]&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Purpose:&amp;#039;&amp;#039;&amp;#039; To standardize the diagnosis of intermittent internet connectivity issues using the &amp;quot;Fault Isolation&amp;quot; methodology and pfSense Gateway Monitoring tools.&amp;lt;br&amp;gt;&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Target Audience:&amp;#039;&amp;#039;&amp;#039; IT Support, Network Administrators, and Technical Staff.&lt;br /&gt;
&lt;br /&gt;
== Part 1: The Methodology (Fault Isolation) ==&lt;br /&gt;
&lt;br /&gt;
The goal of troubleshooting is not just to fix the problem, but to &amp;#039;&amp;#039;&amp;#039;prove&amp;#039;&amp;#039;&amp;#039; where the failure lies. We use the &amp;#039;&amp;#039;&amp;#039;Process of Elimination&amp;#039;&amp;#039;&amp;#039; to isolate variables in the connection chain.&lt;br /&gt;
&lt;br /&gt;
=== The Connection Chain ===&lt;br /&gt;
&lt;br /&gt;
Visualize the path data takes from the user to the internet. A failure at any point breaks the chain.&lt;br /&gt;
&lt;br /&gt;
=== Isolation Logic ===&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Isolate the Device:&amp;#039;&amp;#039;&amp;#039; If only &amp;#039;&amp;#039;one&amp;#039;&amp;#039; user drops, the issue is at &amp;#039;&amp;#039;&amp;#039;Node A&amp;#039;&amp;#039;&amp;#039;.&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Isolate the Local Network:&amp;#039;&amp;#039;&amp;#039; If &amp;#039;&amp;#039;all&amp;#039;&amp;#039; users drop, but pfSense can still ping the modem, the issue is at &amp;#039;&amp;#039;&amp;#039;Node B&amp;#039;&amp;#039;&amp;#039;.&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Isolate the ISP:&amp;#039;&amp;#039;&amp;#039; If pfSense cannot reach the Public Internet (&amp;#039;&amp;#039;&amp;#039;Node F&amp;#039;&amp;#039;&amp;#039;) despite a valid link to the Modem (&amp;#039;&amp;#039;&amp;#039;Node D&amp;#039;&amp;#039;&amp;#039;), the issue is likely &amp;#039;&amp;#039;&amp;#039;Node E&amp;#039;&amp;#039;&amp;#039; (The ISP).&lt;br /&gt;
&lt;br /&gt;
== Part 2: Configuring pfSense for Accurate Monitoring ==&lt;br /&gt;
&lt;br /&gt;
By default, pfSense monitors the &amp;#039;&amp;#039;&amp;#039;Gateway IP&amp;#039;&amp;#039;&amp;#039; (usually the ISP&amp;#039;s local modem or first hop). You must determine if this is the correct target based on your equipment setup.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Objective:&amp;#039;&amp;#039;&amp;#039; Ensure we are monitoring the &amp;#039;&amp;#039;Internet&amp;#039;&amp;#039;, not just the local modem.&lt;br /&gt;
&lt;br /&gt;
=== Step 1: Verify Equipment Mode &amp;amp; Monitor IP ===&lt;br /&gt;
&lt;br /&gt;
# Navigate to &amp;#039;&amp;#039;&amp;#039;System &amp;gt; Routing &amp;gt; Gateways&amp;#039;&amp;#039;&amp;#039;.&lt;br /&gt;
# Click the &amp;#039;&amp;#039;&amp;#039;Edit (Pencil)&amp;#039;&amp;#039;&amp;#039; icon next to the primary WAN gateway (e.g., WAN_DHCP).&lt;br /&gt;
# Check the &amp;#039;&amp;#039;&amp;#039;Monitor IP&amp;#039;&amp;#039;&amp;#039; field.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Scenario A: Modem is in Bridge Mode (Public IP on pfSense)&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
If your modem is in Bridge Mode, the Gateway IP is usually the ISP&amp;#039;s first hop on their network.&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;Verdict:&amp;#039;&amp;#039; Default settings are usually fine, but changing to a public DNS (Step 2) is still recommended for reliability.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Scenario B: Modem acts as Router (Private IP on pfSense)&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
If your pfSense WAN has a private IP (e.g., 192.168.x.x), the default Gateway is just your local modem.&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;Verdict:&amp;#039;&amp;#039; &amp;#039;&amp;#039;&amp;#039;You MUST change the Monitor IP.&amp;#039;&amp;#039;&amp;#039; Monitoring the default gateway only confirms the cable between pfSense and the modem is working. It tells you nothing about the actual internet connection.&lt;br /&gt;
&lt;br /&gt;
=== Step 2: Set an Off-Premise Monitor IP ===&lt;br /&gt;
&lt;br /&gt;
To test the actual internet connection, set the &amp;#039;&amp;#039;&amp;#039;Monitor IP&amp;#039;&amp;#039;&amp;#039; to a stable, off-premise target:&lt;br /&gt;
&lt;br /&gt;
* &amp;lt;code&amp;gt;1.1.1.1&amp;lt;/code&amp;gt; (Cloudflare DNS)&lt;br /&gt;
* &amp;lt;code&amp;gt;8.8.8.8&amp;lt;/code&amp;gt; (Google DNS)&lt;br /&gt;
* &amp;lt;code&amp;gt;208.67.222.222&amp;lt;/code&amp;gt; (OpenDNS)&lt;br /&gt;
* &amp;#039;&amp;#039;Corporate Option:&amp;#039;&amp;#039; The IP of the company VPN or Relay Server.&lt;br /&gt;
&lt;br /&gt;
Click &amp;#039;&amp;#039;&amp;#039;Save&amp;#039;&amp;#039;&amp;#039; and &amp;#039;&amp;#039;&amp;#039;Apply Changes&amp;#039;&amp;#039;&amp;#039;.&lt;br /&gt;
&lt;br /&gt;
=== Step 3: Tune Latency Thresholds (Optional) ===&lt;br /&gt;
&lt;br /&gt;
If using high-latency connections (Starlink, Satellite, LTE) or if you see false alarms:&lt;br /&gt;
&lt;br /&gt;
# In the Gateway Edit screen, click &amp;#039;&amp;#039;&amp;#039;Display Advanced&amp;#039;&amp;#039;&amp;#039;.&lt;br /&gt;
# Adjust &amp;#039;&amp;#039;&amp;#039;Latency Thresholds&amp;#039;&amp;#039;&amp;#039; (Lower/Upper limits in ms).&lt;br /&gt;
# Adjust &amp;#039;&amp;#039;&amp;#039;Packet Loss Thresholds&amp;#039;&amp;#039;&amp;#039; (Percentage limits).&lt;br /&gt;
&lt;br /&gt;
== Part 3: Visualizing &amp;quot;Intermittent&amp;quot; Issues (RRD Graphs) ==&lt;br /&gt;
&lt;br /&gt;
Intermittent issues are difficult to catch in real-time. pfSense RRD (Round-Robin Database) graphs provide historical evidence.&lt;br /&gt;
&lt;br /&gt;
=== Accessing the Quality Graph ===&lt;br /&gt;
&lt;br /&gt;
# Navigate to &amp;#039;&amp;#039;&amp;#039;Status &amp;gt; Monitoring&amp;#039;&amp;#039;&amp;#039;.&lt;br /&gt;
# Click the &amp;#039;&amp;#039;&amp;#039;Wrench Icon&amp;#039;&amp;#039;&amp;#039; (View Settings).&lt;br /&gt;
# &amp;#039;&amp;#039;&amp;#039;Category:&amp;#039;&amp;#039;&amp;#039; Select System.&lt;br /&gt;
# &amp;#039;&amp;#039;&amp;#039;Graph:&amp;#039;&amp;#039;&amp;#039; Select Quality.&lt;br /&gt;
# &amp;#039;&amp;#039;&amp;#039;Time Period:&amp;#039;&amp;#039;&amp;#039; Select 1 Day (for immediate issues) or 1 Month (for pattern analysis).&lt;br /&gt;
# Click &amp;#039;&amp;#039;&amp;#039;Save View&amp;#039;&amp;#039;&amp;#039; to make this your default if desired.&lt;br /&gt;
&lt;br /&gt;
=== Interpreting the Data ===&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Packet Loss (Red Bars):&amp;#039;&amp;#039;&amp;#039; Vertical red bars indicating data that never reached the destination. &amp;#039;&amp;#039;&amp;#039;Any&amp;#039;&amp;#039;&amp;#039; red bars usually indicate a physical line fault or severe ISP failure.&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Latency/Delay (Blue Line):&amp;#039;&amp;#039;&amp;#039; The time it takes for a ping to return.&lt;br /&gt;
* &amp;#039;&amp;#039;&amp;#039;Standard Deviation (Jitter):&amp;#039;&amp;#039;&amp;#039; How much the latency varies.&lt;br /&gt;
&lt;br /&gt;
== Part 4: Root Cause Analysis (Correlation) ==&lt;br /&gt;
&lt;br /&gt;
To prove the cause, we overlay different metrics to see what else was happening on the firewall during the spike.&lt;br /&gt;
&lt;br /&gt;
# In &amp;#039;&amp;#039;&amp;#039;Status &amp;gt; Monitoring&amp;#039;&amp;#039;&amp;#039;, click the &amp;#039;&amp;#039;&amp;#039;Wrench Icon&amp;#039;&amp;#039;&amp;#039;.&lt;br /&gt;
# &amp;#039;&amp;#039;&amp;#039;Left Axis:&amp;#039;&amp;#039;&amp;#039; Set to Quality (Packet Loss/Delay).&lt;br /&gt;
# &amp;#039;&amp;#039;&amp;#039;Right Axis:&amp;#039;&amp;#039;&amp;#039; Select a correlation metric (see below).&lt;br /&gt;
# &amp;#039;&amp;#039;&amp;#039;Update Graph.&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
=== Correlation Scenarios ===&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Check 1: Bandwidth Saturation&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;Right Axis:&amp;#039;&amp;#039; Traffic (WAN Throughput)&lt;br /&gt;
* &amp;#039;&amp;#039;Analysis:&amp;#039;&amp;#039; If Latency spikes exactly when Traffic is high, the pipe is full.&lt;br /&gt;
* &amp;#039;&amp;#039;Action:&amp;#039;&amp;#039; Upgrade bandwidth or implement Traffic Shaping (QoS).&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Check 2: CPU/System Overload&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;Right Axis:&amp;#039;&amp;#039; System &amp;gt; Processor&lt;br /&gt;
* &amp;#039;&amp;#039;Analysis:&amp;#039;&amp;#039; If Packet Loss correlates with 100% CPU usage, the firewall hardware is the bottleneck, not the ISP.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Check 3: VPN Usage&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;Right Axis:&amp;#039;&amp;#039; OpenVPN or WireGuard &amp;gt; Users (or Traffic)&lt;br /&gt;
* &amp;#039;&amp;#039;Analysis:&amp;#039;&amp;#039; If instability begins exactly when remote users connect, the VPN encryption overhead may be stressing the CPU or saturating the upload speed.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Check 4: The &amp;quot;Clean&amp;quot; Failure (ISP Fault)&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
* &amp;#039;&amp;#039;Analysis:&amp;#039;&amp;#039; If Packet Loss (Red Bars) occurs when Traffic is &amp;#039;&amp;#039;&amp;#039;flat/low&amp;#039;&amp;#039;&amp;#039; and CPU is &amp;#039;&amp;#039;&amp;#039;idle&amp;#039;&amp;#039;&amp;#039;, the issue is external.&lt;br /&gt;
* &amp;#039;&amp;#039;Action:&amp;#039;&amp;#039; &amp;#039;&amp;#039;&amp;#039;Contact ISP.&amp;#039;&amp;#039;&amp;#039; (See Part 5).&lt;br /&gt;
&lt;br /&gt;
== Part 5: Evidence Gathering &amp;amp; Reporting ==&lt;br /&gt;
&lt;br /&gt;
ISPs often dismiss intermittent complaints. Providing raw data logs forces escalation.&lt;br /&gt;
&lt;br /&gt;
=== Exporting Data to CSV ===&lt;br /&gt;
&lt;br /&gt;
# In &amp;#039;&amp;#039;&amp;#039;Status &amp;gt; Monitoring&amp;#039;&amp;#039;&amp;#039;, load the view showing the issue (e.g., &amp;quot;1 Month Quality&amp;quot; or &amp;quot;3 Month View&amp;quot;).&lt;br /&gt;
# Click the &amp;#039;&amp;#039;&amp;#039;Export Button&amp;#039;&amp;#039;&amp;#039; (Arrow pointing into a box) below the graph.&lt;br /&gt;
# Save the .csv file.&lt;br /&gt;
&lt;br /&gt;
=== Visualizing in LibreOffice Calc / Excel ===&lt;br /&gt;
&lt;br /&gt;
# Open the CSV file.&lt;br /&gt;
# Select the &amp;#039;&amp;#039;&amp;#039;Timestamp&amp;#039;&amp;#039;&amp;#039; column and the &amp;#039;&amp;#039;&amp;#039;Packet Loss&amp;#039;&amp;#039;&amp;#039; column.&lt;br /&gt;
# Insert a &amp;#039;&amp;#039;&amp;#039;Line Chart&amp;#039;&amp;#039;&amp;#039;.&lt;br /&gt;
# &amp;#039;&amp;#039;&amp;#039;Highlight the outages.&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
#* &amp;#039;&amp;#039;Example:&amp;#039;&amp;#039; &amp;quot;Connection drops daily between 14:00 and 16:00.&amp;quot;&lt;br /&gt;
# Save the chart as a PDF and attach it to the ISP Support Ticket.&lt;br /&gt;
&lt;br /&gt;
&amp;#039;&amp;#039;&amp;#039;Note:&amp;#039;&amp;#039;&amp;#039; When submitting this data to an ISP, explicitly state: &amp;#039;&amp;#039;&amp;quot;I have isolated the issue to the modem/street level. My internal firewall logs show packet loss to 8.8.8.8 occurring during periods of zero bandwidth usage, ruling out local congestion.&amp;quot;&amp;#039;&amp;#039;&lt;br /&gt;
&lt;br /&gt;
== Part 6: Multi-WAN Performance Comparison ==&lt;br /&gt;
&lt;br /&gt;
In environments with multiple gateways (Load Balancing or Failover), comparing performance simultaneously is critical to ruling out shared hardware failures (e.g., the firewall itself) versus specific ISP failures.&lt;br /&gt;
&lt;br /&gt;
=== Configuring the Comparative Graph ===&lt;br /&gt;
&lt;br /&gt;
# Navigate to &amp;#039;&amp;#039;&amp;#039;Status &amp;gt; Monitoring&amp;#039;&amp;#039;&amp;#039;.&lt;br /&gt;
# Click the &amp;#039;&amp;#039;&amp;#039;Wrench Icon&amp;#039;&amp;#039;&amp;#039; (Settings).&lt;br /&gt;
# Configure the axes to display two ISPs at once:&lt;br /&gt;
#* &amp;#039;&amp;#039;&amp;#039;Left Axis:&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
#** Category: System&lt;br /&gt;
#** Graph: Quality&lt;br /&gt;
#** Specific Selection: WAN_DHCP (Primary ISP)&lt;br /&gt;
#* &amp;#039;&amp;#039;&amp;#039;Right Axis:&amp;#039;&amp;#039;&amp;#039;&lt;br /&gt;
#** Category: System&lt;br /&gt;
#** Graph: Quality&lt;br /&gt;
#** Specific Selection: OPT1 or WAN2 (Secondary ISP)&lt;br /&gt;
# Click &amp;#039;&amp;#039;&amp;#039;Update Graph&amp;#039;&amp;#039;&amp;#039;.&lt;br /&gt;
&lt;br /&gt;
=== Interpreting Comparative Data ===&lt;br /&gt;
&lt;br /&gt;
This view allows you to see if an outage is &amp;#039;&amp;#039;&amp;#039;Global&amp;#039;&amp;#039;&amp;#039; (Router/Power issue) or &amp;#039;&amp;#039;&amp;#039;Isolated&amp;#039;&amp;#039;&amp;#039; (ISP issue).&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
! Graph Observation !! Diagnosis&lt;br /&gt;
|-&lt;br /&gt;
| &amp;#039;&amp;#039;&amp;#039;Only Primary ISP shows Packet Loss&amp;#039;&amp;#039;&amp;#039; || &amp;#039;&amp;#039;&amp;#039;Isolated ISP Failure.&amp;#039;&amp;#039;&amp;#039; The issue is specific to the Primary ISP line. The firewall hardware is functioning correctly because the Secondary ISP is clear.&lt;br /&gt;
|-&lt;br /&gt;
| &amp;#039;&amp;#039;&amp;#039;Both ISPs show Packet Loss simultaneously&amp;#039;&amp;#039;&amp;#039; || &amp;#039;&amp;#039;&amp;#039;Global Hardware Failure.&amp;#039;&amp;#039;&amp;#039; If two independent ISPs fail at the exact same second, the issue is likely the pfSense hardware (CPU overload), a shared switch, or a power fluctuation.&lt;br /&gt;
|-&lt;br /&gt;
| &amp;#039;&amp;#039;&amp;#039;Secondary ISP shows high Latency&amp;#039;&amp;#039;&amp;#039; || &amp;#039;&amp;#039;&amp;#039;Backup Quality Check.&amp;#039;&amp;#039;&amp;#039; Ensure your backup line is actually viable. High latency on a backup line might mean it is unsuitable for failover.&lt;br /&gt;
|}&lt;br /&gt;
&lt;br /&gt;
[[Category:SOP]]&lt;br /&gt;
[[Category:Networking]]&lt;br /&gt;
[[Category:pfSense]]&lt;br /&gt;
[[Category:Comfac]]&lt;/div&gt;</summary>
		<author><name>CITEditor</name></author>
	</entry>
</feed>