Network Throughput Analysis and Troubleshooting Techniques – Testkings

Throughput is a fundamental metric in network performance that indicates the rate at which data is successfully transmitted between two points over a network. When throughput falls below expected levels, users can experience slow file transfers, lagging applications, and degraded overall network performance.

Troubleshooting throughput issues requires a methodical approach. Because data traverses multiple devices and network segments—such as local switches, routers, WAN links, and remote endpoints—identifying the source of throughput degradation can be complex.

This part introduces the foundational concepts of throughput troubleshooting. It highlights the importance of establishing expected performance baselines, understanding potential bottlenecks, and adopting a structured diagnostic methodology.

What is Network Throughput and Why Does it Matter?

Network throughput differs from bandwidth. Bandwidth refers to the maximum theoretical capacity of a network link, while throughput measures actual data transfer rates observed during network use. Factors like congestion, hardware limitations, configuration issues, and protocol overhead influence throughput.

A discrepancy between expected bandwidth and actual throughput can result in frustrating delays and reduced productivity. For example, a WAN link advertised at 50 Mbps might consistently deliver only 20 Mbps, indicating a performance problem.

Understanding throughput and its practical impact on user experience is essential for network engineers tasked with maintaining reliable and efficient networks.

Initial Steps in Throughput Troubleshooting

The first step in any troubleshooting process is to confirm and quantify the issue. User complaints alone are subjective; objective measurement provides evidence to guide the investigation.

A common approach involves reproducing the throughput issue using a dedicated testing tool that measures the transfer rate between two endpoints. Once a throughput deficit is confirmed, the next step is to understand the network path and devices involved.

Mapping the data path—from the source workstation, through switches and routers, across the WAN, to the destination device—is crucial. This mapping clarifies which devices and interfaces should be examined for errors or configuration issues.

Common Causes of Throughput Bottlenecks

In any networked environment—whether a small office, a large enterprise, or a cloud-based system—network performance is a vital component of productivity and service delivery. One of the most common and often misunderstood performance issues is a throughput bottleneck. Throughput refers to the amount of data that can be transferred across a network in a given time, typically measured in bits per second. When users experience slow data transfer, dropped connections, or poor application performance, a bottleneck in throughput is often to blame.

The causes of throughput issues can span the entire network path, from end-user devices to core routers and even beyond into the broader internet. Identifying and resolving these problems requires a methodical approach to pinpoint where the limitation occurs. Below are the most common causes of throughput bottlenecks, broken down by their location within the network.

End devices (client or server issues)

Sometimes, the problem isn’t with the network at all—it starts at the source or destination device. These could be laptops, servers, or mobile devices involved in sending or receiving data.

Several issues on end devices can limit throughput. Hardware limitations, such as older network interface cards, especially those limited to lower speeds, can restrict performance regardless of how fast the rest of the network is. CPU and memory constraints can also slow things down if the device is overloaded. Additionally, improper network settings like a misconfigured MTU or TCP window size can interfere with optimal data flow. Background applications using up bandwidth can further limit available resources for critical tasks.

Local area network switches

The next common location for bottlenecks is within the local network, particularly at switches that connect end devices. These devices handle traffic inside the building or campus network.

Port speed mismatches are a frequent cause of limited throughput. If a device connects at 100 Mbps instead of 1 Gbps, for example, performance will suffer. Duplex mismatches, where one end of the link operates in full-duplex mode and the other in half-duplex, can cause collisions and slow performance. Faulty or poor-quality cables are another factor that can introduce errors and signal loss, especially at higher speeds. Reviewing switch logs for errors, collisions, or interface downgrades can help identify problems in this layer.

Routers and WAN links

Routers, which handle traffic between different network segments or out to the internet, and wide-area network (WAN) links are also frequent bottlenecks.

Interface congestion may occur when a router is processing more traffic than its interfaces can handle, resulting in delays or dropped packets. Traffic shaping or bandwidth policing, often used to manage bandwidth usage, can unintentionally restrict legitimate traffic. If these policies are overly strict or misconfigured, they can throttle throughput for users and services. Routing inefficiencies may also play a role. Suboptimal paths increase latency and reduce bandwidth availability. Finally, hardware limitations or high CPU usage on routers can degrade performance, particularly if advanced features like encryption or firewalling are active.

External and upstream network factors

In some cases, the source of a bottleneck lies beyond the local or corporate network. These external causes are usually harder to detect and may require coordination with service providers.

Internet service providers (ISPs) sometimes impose throttling during high-usage periods or apply usage-based bandwidth limits. Even with a fast plan, the actual available speed may vary due to shared infrastructure. Congestion on upstream links or backbone networks between ISPs can also degrade performance. Traffic may be routed through longer or more congested paths due to dynamic routing changes. When using cloud services, issues in the provider’s infrastructure—such as overloaded virtual machines or maintenance events—can result in slow performance. Similarly, if your traffic passes through a content delivery network (CDN), outages or rerouting in that network can cause delays.

Diagnosing throughput issues effectively

Because throughput bottlenecks can occur at many points, effective diagnosis requires a layered, methodical approach. Starting at the endpoint, verify whether the device is capable of the expected network speed and not constrained by CPU, memory, or software issues. From there, examine switch and router interfaces for speed mismatches, errors, or shaping policies. Use tools such as traffic counters, flow analysis, and performance testing utilities to gather data.

Testing throughput between internal devices using tools like iperf can help isolate whether the problem exists inside the network or on the internet side. For suspected external issues, use traceroute or similar utilities to trace the path to remote endpoints and identify delays or hops with high latency.

You can also compare performance at different times of day. If performance drops consistently during peak hours but improves later, that’s a strong indicator of congestion or throttling beyond your local infrastructure.

Tips for narrowing down the cause

To identify the root of the issue, test from multiple endpoints and locations. If one user experiences slow throughput but others do not, the problem likely lies with their specific device or connection. If all users in a location are affected, focus on the local switch or router. If only connections to external sites are affected, consider ISP or upstream issues. Document your findings, including test results and configuration data, to support discussions with service providers if escalation is necessary.

Throughput bottlenecks are a common challenge in networks of all sizes. They can result from limitations or misconfigurations at the device, switch, router, or ISP level. Understanding the most common causes—including end device limitations, local network problems, WAN congestion, and external factors—enables IT teams to methodically investigate and resolve performance problems. With a structured troubleshooting approach and the right diagnostic tools, it becomes much easier to maintain optimal network performance and deliver a smooth experience for users and applications alike.

Example Use Case: Investigating Reduced Transfer Speeds Between Two Sites

To illustrate the troubleshooting process, consider the following scenario:

A user at Site 1 reports that transfers to a workstation at Site 2 are slower than usual. Previously, transfers could reach 50 Mbps, but now throughput peaks at approximately 20 Mbps. The WAN link is known to support 50 Mbps.

The network engineer’s task is to systematically determine whether the bottleneck exists on the user’s device, local network, WAN provider’s circuit, or remote endpoint.

This example frames the process of measuring throughput, verifying device configurations, examining interface statistics, and reviewing applied policies to identify and resolve throughput issues.

Measuring Throughput with Iperf

Once it’s confirmed that users are experiencing reduced throughput, the next important step is to measure and quantify the problem using reliable tools. One popular tool for this purpose is called iperf.

Iperf is a network testing application used to measure the maximum bandwidth between two endpoints. It can generate TCP or UDP traffic and reports useful metrics like bandwidth, packet loss, and jitter. Because it sends real traffic between devices, iperf gives an accurate reflection of the actual throughput available on the network path.

Setting Up Iperf for Testing

Iperf is a popular open-source tool used to test the performance of a network connection. It is commonly used by IT professionals, network administrators, and engineers to measure bandwidth between two devices. Iperf can test both TCP and UDP traffic, giving detailed results such as throughput, jitter, and packet loss. This helps identify bottlenecks, test network upgrades, and evaluate real-world network conditions.

To use iperf, you need two devices: one to act as a server and the other to act as a client. These devices are usually standard desktop or laptop computers. Iperf is not supported directly on most routers, switches, or other networking hardware unless those systems support third-party applications or have a built-in operating system that allows software installation.

Understanding the Client-Server Model

The basic structure of an iperf test relies on a client-server setup. The server waits to receive data, while the client sends data to the server. This communication enables iperf to analyze how much data is transferred, how long the transfer takes, and how reliable the connection is during the test.

The two devices should be connected through the same network or have connectivity across the internet. In either case, both devices must be able to reach each other through their IP addresses. Any firewalls or security systems between them must allow the communication to pass.

Installing Iperf on Both Devices

Before running a test, iperf must be installed on both the client and server devices. The installation method depends on the operating system, but it is widely available for platforms like Windows, macOS, and Linux. Once installed, iperf is ready to be used from the system’s command-line interface.

Iperf has different versions, but the most commonly used today is iperf3. It is more reliable and offers better results compared to older versions. It is important that both devices use the same version of iperf to avoid compatibility issues during testing.

Starting the Iperf Server

On the server device, iperf is started in a listening mode. This device waits for incoming test traffic from the client. It listens on a specific network port, commonly port 5201, and remains idle until the client initiates a connection.

The server device must stay active for the duration of the test. If the server is interrupted or closed before the client completes the test, no results will be generated.

Running the Iperf Client

On the client device, you initiate the test by specifying the IP address of the server device. This begins the data transfer process from the client to the server. The default test duration is usually 10 seconds, during which the client sends traffic to the server at full capacity. The test output is displayed in real time and shows statistics such as total data sent, average bandwidth, and other key performance indicators.

Iperf can also run in reverse mode, where the server sends data to the client instead. This is useful for testing the return path in a two-way network connection.

TCP vs. UDP Testing

By default, iperf tests use TCP, the same protocol used for most internet services like web browsing and email. TCP testing is great for understanding the general capacity and stability of a network connection, including packet retransmission and congestion control behavior.

However, iperf also supports UDP, a protocol often used for real-time applications such as video calls or online gaming. When testing with UDP, iperf provides additional statistics like jitter (variation in packet arrival time) and packet loss. These metrics are important for evaluating quality of service for time-sensitive applications.

In a UDP test, you can also control the bandwidth rate. This allows you to simulate specific conditions, such as a 100 megabit per second video stream, and test how well the network handles the traffic without introducing delay or packet loss.

Understanding Test Results

After completing a test, both the client and server will display the results. The key statistics typically include:

Transfer size: The total amount of data that was sent during the test.
Bandwidth: The average rate of data transfer, typically shown in megabits or gigabits per second.
Retransmits (for TCP): A higher number of retransmissions can indicate congestion, poor connections, or interference.
Jitter and packet loss (for UDP): These values help identify how stable the connection is and whether it’s suitable for voice or video applications.

Consistent high bandwidth, low jitter, and no packet loss are signs of a healthy and high-performing network.

Advanced Features

Iperf is highly customizable. You can adjust the test duration, change ports, test in both directions, and even simulate multiple data streams at the same time. This makes it suitable for simple testing as well as complex performance evaluations.

For example, if you want to simulate heavy traffic or real-world usage, you can run parallel streams or schedule multiple tests. You can also capture the results in a file for further analysis or documentation.

Use Cases for Iperf

Iperf is used in many real-world scenarios, including:

Testing the performance of newly installed network cables or switches
Measuring wireless signal strength in different areas of a building
Verifying that your internet service matches what your provider advertises
Benchmarking connections between on-site servers and cloud environments
Diagnosing performance issues like slow file transfers or streaming interruptions

Iperf is a simple yet powerful tool that provides detailed information about network performance between two devices. By using a client-server model, iperf sends and receives traffic to measure how fast and how reliably data travels across your network. Whether you’re troubleshooting a problem, validating a service level agreement, or planning infrastructure upgrades, iperf gives you the insights needed to make informed decisions. With just two devices and a little preparation, you can perform professional-grade network testing quickly and accurately.

Confirming Throughput and Isolating Issues

By running iperf tests between the two problem workstations, you can determine whether the user’s report of slow speeds is accurate. If the throughput reported by iperf is significantly below the expected bandwidth, it confirms the problem exists somewhere in the network.

To exclude the possibility that the issue is caused by the specific machines themselves, repeat the test between different devices at the same locations. If the reduced throughput remains consistent, it suggests the problem lies in the network or WAN connection rather than on the endpoints.

Verifying Network Connectivity and Interface Health

After confirming the throughput issue, the next phase is to verify that the network path itself is healthy, both physically and logically.

Mapping the Data Path

It’s important to understand exactly which network devices the data travels through between the source and destination. This usually includes local switches, routers, and WAN links. Having a clear topology map or diagram helps in focusing troubleshooting efforts on the right devices and links.

Checking Speed and Duplex Settings

One common cause of throughput issues is a mismatch in speed or duplex settings between connected ports on switches or routers. For example, one side might be set to full duplex and the other to half duplex, causing collisions and slowdowns.

Inspecting the configurations and status of these ports helps ensure both ends are negotiating the same speeds and duplex modes, which is necessary for optimal performance.

Monitoring for Errors and Log Messages

Interface errors such as CRC errors, collisions, or physical layer faults can severely affect throughput. By examining the error counters and reviewing device logs, you may find evidence of problems like failing cables, flapping links, or faulty hardware.

These issues can often be corrected by replacing cables, adjusting configurations, or swapping hardware components.

Verifying Routing and Data Paths

Ensuring that traffic is actually flowing along the expected paths is crucial. Unexpected routing changes, loops, or blackholes can cause congestion and reduce effective throughput. Tools like traceroute and routing table inspection can confirm that packets are traveling along intended routes without detours or drops.

Analyzing Interface Traffic Rates

Next, examine the traffic rates on interfaces along the data path to see where throughput decreases.

Monitoring the input and output rates on the switch ports and router interfaces connected to the endpoints and the WAN can reveal where traffic drops below expected levels.

Interface statistics are usually averaged over a time period, commonly five minutes, but can be adjusted to shorter intervals to catch short bursts or fluctuations.

If the rates are consistent and high at the interfaces near the source but drop sharply at the WAN-facing interface, the bottleneck is likely at or beyond the WAN link.

Reviewing Traffic Policies and Their Impact on Throughput

Once it’s established that throughput is lower than expected and interface health is verified, attention turns to configurations that could be intentionally or unintentionally limiting traffic flow. Traffic policies such as policing and shaping can restrict bandwidth, often resulting in throughput that does not match the available capacity.

Understanding Traffic Policing and Shaping

Traffic policing enforces strict bandwidth limits by dropping packets that exceed a configured rate. This can cause throughput to cap at a certain value but may introduce packet loss and retransmissions.

Traffic shaping buffers and smooths traffic bursts to conform to a specified rate, reducing packet loss but potentially adding latency. Shaping is often used on outbound interfaces to prevent exceeding a link’s capacity.

Both policing and shaping are applied through policy maps configured on router or switch interfaces. Even if the physical link supports a higher bandwidth, these policies can restrict throughput.

Identifying Policies Applied to Interfaces

Review the configuration of interfaces along the path, especially WAN-facing interfaces, for any service policies. These policies typically reference named policy maps that define bandwidth limits or quality of service rules.

Checking for these policies is important when troubleshooting throughput issues because a misconfigured or overly restrictive policy can be the root cause of reduced speeds.

Verifying and Adjusting Policy Limits

If a service policy is found limiting bandwidth below the expected throughput, confirm whether the configured values are intentional. Sometimes, changes or errors introduce incorrect limits, such as setting a shaping policy to 20 Mbps on a 50 Mbps link.

Testing throughput before and after modifying or removing these policies can confirm if they are responsible for the bottleneck.

Coordinated Policy Updates and Testing

When a restrictive policy is identified as the cause of low throughput, the next step is to plan and execute configuration changes carefully.

Coordinate with relevant teams to schedule a maintenance window if needed. Modify the policy settings to increase bandwidth limits or remove the policy temporarily.

After updating, re-run throughput tests between the endpoints to verify that speeds improve as expected.

Document the changes and monitor the network to ensure no unintended side effects occur.

Additional Considerations in Throughput Troubleshooting

Beyond obvious policies, other factors may influence throughput and should be considered during troubleshooting.

Quality of Service (QoS) Configurations

QoS policies that prioritize or deprioritize certain types of traffic can affect throughput for specific applications or users. Ensure that QoS settings align with business priorities and do not inadvertently throttle important traffic.

Network Congestion and Utilization

High utilization on WAN links or network segments can cause congestion and reduce throughput. Monitoring traffic patterns over time can reveal if throughput drops coincide with peak usage periods.

Hardware Limitations and Aging Infrastructure

Older switches, routers, or cables may struggle to handle modern bandwidth demands. Check device specifications and health to rule out hardware as a bottleneck.

External Network Issues

Problems with the ISP, backbone, or cloud provider may reduce throughput beyond your control. Engaging with providers and using performance monitoring tools can help identify these external causes.

In this part, we explored how traffic policies like policing and shaping can restrict throughput even when physical links have sufficient capacity.

We reviewed the importance of identifying and verifying these policies on network devices, adjusting configurations as needed, and testing throughput after changes.

Additional factors such as QoS, congestion, hardware limitations, and external network issues also play roles in throughput performance and should be considered in a comprehensive troubleshooting process.

Advanced Testing Methods for Throughput Analysis

After addressing configuration and policy-related causes, further testing can help identify more subtle or intermittent throughput issues.

Using Packet Capture and Analysis

Capturing network traffic at key points along the path can reveal retransmissions, packet loss, or protocol inefficiencies impacting throughput. Packet analyzers can decode TCP flow behaviors to detect problems like excessive retransmissions or window size limitations.

This level of analysis requires skill and appropriate tools but can pinpoint hidden issues that standard throughput tests may miss.

Testing with Different Protocols and Traffic Types

Throughput can vary depending on the protocol used (TCP vs UDP) and the size and type of traffic.

Testing with UDP traffic helps identify issues related to packet loss or jitter, while TCP tests reveal throughput impacted by congestion control and flow management.

Using diverse test scenarios provides a more complete picture of network performance.

Leveraging Built-in Device Diagnostics

Some switches and routers support embedded throughput testing or performance diagnostics that can test bandwidth internally without relying on external devices.

Utilizing these features can help isolate device-specific bottlenecks.

Monitoring and Alerting for Proactive Throughput Management

Ongoing monitoring is key to preventing throughput degradation before it affects users.

Implementing Network Performance Monitoring Tools

Deploy tools that continuously track interface utilization, error rates, latency, and throughput metrics across the network.

These tools can generate alerts when throughput drops below thresholds or errors spike, allowing rapid response.

Correlating Application and Network Performance

Understanding how network throughput impacts critical applications helps prioritize troubleshooting and investment.

Tools that link network metrics with application performance provide actionable insights.

Regular Capacity Planning and Trend Analysis

Reviewing throughput trends over time identifies growth patterns and helps forecast when upgrades or redesigns are necessary.

Proactive capacity planning avoids unexpected bottlenecks.

Best Practices to Prevent Throughput Issues

Finally, adopting operational best practices can reduce the frequency and impact of throughput problems.

Maintain consistent and accurate documentation of network topology and device configurations.
Use automated configuration management and validation to prevent misconfigurations.
Regularly review and update traffic policies to align with current business needs.
Schedule routine maintenance windows for software updates and hardware inspections.
Educate users and IT staff on recognizing and reporting throughput issues promptly.
Test new configurations or hardware in controlled environments before wide deployment.

Throughput troubleshooting requires a structured approach combining measurement, verification, configuration review, advanced diagnostics, and ongoing monitoring.

By following a step-by-step methodology—from initial testing with tools like iperf to examining traffic policies and performing deep packet analysis—network engineers can efficiently isolate and resolve throughput bottlenecks.

Proactive monitoring and capacity planning further ensure that networks maintain the performance needed to support today’s demanding applications.

This comprehensive approach not only improves user experience but also strengthens overall network reliability and business continuity.

Final Thoughts

Throughput issues can significantly impact network performance and user experience, yet their causes are often multi-faceted and require a disciplined troubleshooting approach. Understanding throughput as the actual data transfer rate—not just theoretical bandwidth—is critical in diagnosing and resolving performance problems.

Starting with objective measurement tools like iperf, followed by verifying connectivity and interface health, helps build a factual foundation. From there, reviewing configurations such as traffic policies, shaping, and QoS settings can uncover intentional or inadvertent limitations on throughput.

Advanced diagnostics like packet capture and protocol-specific testing provide deeper insights into more elusive problems. Meanwhile, ongoing monitoring and capacity planning empower network teams to identify trends and potential bottlenecks before they impact users.

The key to efficient throughput troubleshooting lies in a methodical, stepwise approach coupled with good documentation and collaboration. By combining technical know-how with practical tools and proactive practices, network administrators can maintain optimal throughput, ensuring reliable, high-performance network services that meet business needs.