WHERE TO BUY  |   SITE MAP  |  CONTACT US  
 

 Voice/Fax over Frame Relay vs.Voice/Fax over IP

INTRODUCTION

The development of very fast, inexpensive microprocessors and special purpose switching chips, coupled with highly reliable fiber optic transmission systems, have made it possible to build economical, ubiquitous, high-speed packet-based data networks. Similarly, the development of very fast, inexpensive digital signal processors (DSPs) has made it practical to digitize and compress voice and fax signals into data packets. The natural evolution of these two developments has been to combine digitized voice and fax packets with packet data, creating integrated data/voice networks.

This convergence of telecommunications and data communications has been motivated primarily by the cost savings that accrue from operating a single, shared network. A corporation with a wide area packet data network extending to each of its branch offices can often integrate its intracompany voice and fax traffic with its data network and save enough in intracompany toll-calls to pay for the DSP-based integration equipment in less than one year. The overall savings usually results in an effective toll-rate of 1-to-2 cents per minute averaged over the five-year life of the network.

The total savings often adds up to tens of thousands of dollars per remote site; a 50 branch international network can readily enjoy better than a million dollars of cost savings. Even an integrated, domestic-only North American network, where toll-costs are already low, can save over $10,000 per remote site over a five-year period.

Packet Data Networks
There are a several different types of wide area packet data networks that can support integrated voice/fax traffic, with varying degrees of success. The principal data network technologies in use today are X.25, Frame Relay, ATM, SNA, Novell/IPX, TDM, and TCP/IP. Because TDM is not a packet technology, it is much less efficient than the other technologies, and is rapidly being replaced by them. X.25 and SNA, based on older packet technologies, are too slow and delay-prone to effectively carry voice and fax traffic. The Novell/IPX technology was designed primarily for LAN applications and does poorly in the WAN environment; it is quickly being replaced by TCP/IP.

The remaining packet technologies - Frame Relay, ATM, and TCP/IP - are the principal networking techniques companies use today to build new wide area data networks. Today's WANs are constructed using leased lines running these three technologies, and using public data network services based on them. By a substantial margin, Frame Relay and TCP/IP are used the most, both over leased lines and as public data services.

WAN Market
According to the December 1997 issue of Data Communications magazine, the largest WAN market segment is leased-line services, at worldwide revenues of $15.6 billion forecast for 1998, followed by Frame Relay services at $6.2 billion, commercial TCP/IP services at $6.2 billion, and ATM services at $0.6 billion. The corresponding growth rates (1997 to 1998) are forecast at 10% for leased lines, 59% for Frame Relay, 30% for commercial TCP/IP services, and 109% for ATM.

ATM, Frame Relay, and TCP/IP are each able to carry voice and fax traffic. ATM has been designed from the beginning to carry voice/fax traffic, while Frame Relay and TCP/IP were originally designed to carry only data. Because adding voice and fax to Frame Relay and TCP/IP has been an 'after market' activity, there are areas of caution and even concern that must be addressed when companies build integrated WANs using Frame Relay or TCP/IP.

This paper discusses these issues, provides guidelines for voice/data integration, and furnishes recommendations for the best course of action in three networking scenarios:

When adding voice and fax traffic to an existing corporate data network.
When building a new integrated corporate voice/data network.
When building a new voice/fax packet network to carry public voice/fax calls.

CORPORATE VOICE/DATA INTEGRATION

Adding voice and fax to corporate Frame Relay (hereafter, FR) networks or TCP/IP (hereafter, just IP) networks require the deployment of a voice/data integration device (VDID) at each corporate location. For Frame Relay, the device is a voice/fax-enabled router or Frame Relay access device (FRAD) (see Figure 1a). For IP, the device is a voice over IP (VoIP) gateway (see Figure 1b).

In both cases, the device connects to legacy telephony equipment, usually a PBX, key telephone system (KTS), or a fax machine on one side, and to the network on the other side. The VoIP gateway connects to the location's LAN, not directly to the WAN. VoIP gateways are fundamentally LAN devices, relying on existing non-voice/fax-enabled routers to provide WAN access. The FRAD, on the other hand, connects directly to the WAN as well as the telephony equipment. In addition, it can also provide routing functionality as well as legacy connectivity over the WAN.


Figure 1a: Typical voice over Frame Relay network

Figure 1b: Typical voice over IP network

Digitized Voice and Fax
Analog and digital voice and fax signals enter the VDID where they are processed by a DSP and converted into data packets. Analog voice is first digitized at 64Kbps using the PCM format, compressed into a reduced format (typically 8Kbps CELP), and placed into the appropriate packet type (FR or IP). Digital voice from the PBX is already in PCM format, thus the PCM conversion step is omitted. The far-end VDID reverses the process, taking the compressed packets, decompressing them to 64Kbps, and playing the information out in the appropriate format (analog or digital). The process operates simultaneously at both ends for full-duplex speech. The term 'CODEC' is used to represent the COmpression/DECompression technique or process.

Analog fax is also digitized at 64Kbps into the PCM format, then demodulated by the DSP back into the original digital format it had inside the fax machine and placed into the appropriate packet type. At the remote VDID, the digital stream is remodulated back into the fax's original format.

All DSP activity occurs in real-time. Digitized voice/fax packets are placed onto the LAN by the VoIP gateway and onto the WAN by the voice/fax-enabled router/FRAD (RFRAD). The VDID telephony connections are either analog or digital trunks. There are usually 2-to-4 analog VDID trunks at smaller remote sites, 4-to-8 analog or digital trunks at regional locations, and 8-to-24/30 or more digital trunks at a headquarters location.

Determining VDID Trunk Requirements
Since the cost to integrate voice/fax with data depends largely on the cost of the VDID equipment required at each site, and the VDID cost is proportional to the number of VDID trunks, only the minimum necessary number of trunks is provisioned. Two trunks will handle about 3 hours of intracompany phone/fax communications over an 8-hour day with a 95% likelihood of the caller receiving dial-tone on the first try; four trunks will handle about 12 hours, and so on. Table 1 summarizes VDID trunk capacity information.

From experience, the 95% dial-tone availability figure is just high enough not to frustrate the user from using the VDN (and thereby defeating the cost savings). Higher percentages can be used, but the increased number of VDID trunks required increasingly negates the cost savings.

  Number of VDID Phone/Fax Trunks
Period 2 4 8 12 24
1 Hour 0.4 1.45 4.32 7.6 19
2 Hours 0.7 2.9 8.64 15.2 38
4 Hours 1.5 5.8 17.28 30.4 76
6 Hours 2.2 8.7 25.92 45.6 114
8 Hours 2.9 11.6 34.56 60.8 152
9 Hours 3.3 13.05 38.88 68.4 171
  Number of Hours of 95%-available Dial Tone per Period
Table 1: Synopsis of VDID trunk capacity

At the headquarters location, the trunk count is often derived as a percentage of the total number of branch trunks. For example, 20 remote branches with two trunks each, totaling 40 trunks, would in most cases require only 15-to-20 trunks at headquarters. The branch trunks would contend for the pool of headquarters trunks, with a contention ratio between 8-to-3 (40-to-15) to 2-to-1 (40-to-20). Table 2 summarizes some common contention ratios.

Number of Remote Trunks Number of Remote Sites with Two Trunks per Site Typical Number of Headquarters Trunks Typical Contention Ratios
2 1 2 1:01
4 2 3-4 1.3:1 - 1:1
6 3 4 1.5:1
8 4 5-6 1.6:1 - 1.8:1
10 5 5-6 1.7:1 - 2:1
16 8 8-9 1.8:1 - 2:1
24 12 10-13 1.8:1 - 2.4:1
32 16 13-16 2:1 - 2.5:1
64 32 24-29 2.2:1 - 2.7:1
Table 2: Contention ratios of total remote trunks to HQ trunks

Calling Over the Voice/Data Network
To place a call using the voice/data network (VDN), a user in one location calls a party at another location by first dialing a fixed, published access number, usually a '6' or '8.' This instructs the caller's PBX to select the next available trunk connecting the PBX to the location's VDID. The VDID immediately generates dial tone to the caller's ear. The caller keys-in the telephone number of the called party. This is usually a location number and an extension. The VDID translates the number into the network address of the called party's VDID (remote VDID) and forwards the extension to this VDID.

The remote VDID feeds the extension number to the called party's PBX, which rings the called party's phone. The two VDIDs establish a packet telephony circuit between them across the VDN, and conversation commences. If the called party's phone is busy, or the VDIDs cannot find or connect to each other, appropriate busy tones are generated by the calling VDID. When one or both parties are connected to KTS's instead of PBX's, the calling sequence is similar, except the caller punches the appropriate key to select the VDN, and there is no called-party extension when the remote VDID connects to a KTS.

Importance of User-Transparency and Legacy Telephony Interoperability
The entire calling sequence is transparent to both parties, as if they were using PBX tie lines, KTS off-premise extensions, or speed-dial buttons. This is very important in corporate VDN environments because the cost to retrain users on how to make phone calls would negate much of the intended savings. In the above scenario, the telecom manager only needs to publish a one-page memo with brief instructions and a list of the branch location numbers.

Furthermore, if a user has to go through an involved, new calling sequence, such as those involved when using a PC as a telephone, in most cases the user would take the easier approach and just use the PSTN, frustrating any cost savings. In the business environment, a user will not tolerate any reduction in the level of telephony service just to reduce the company's phone bill. The user is measured on getting the task done, not on reducing overhead expenses. At home, the consumer may well tolerate less convenient or reduced quality telephone connections to reduce personal telephony costs, but that willingness does not translate to the business environment, where telephony costs are not personal.

Another key characteristic of corporate voice/data integration is the use of existing, legacy telephony equipment. If the VDID or VDN require new telephony equipment, the cost savings from toll avoidance would be offset by the cost of the new equipment, nullifying the integration proposition. The new equipment, moreover, would require user retraining, already a non-starter. For these reasons, corporate WAN voice/data integration does not incorporate the desktop PC as a mainstream telephony device.

VOICE/DATA INTEGRATION DEVICE FUNCTIONALITY

Most VDIDs (RFRADs and VoIP gateways) interface to legacy telephony equipment in similar ways and perform the similar DSP-based voice and fax signal processing. Although a specific vendor's VDID will have its own particular set of features, there is a minimal set of functionality that is required to achieve the basic goals of user-transparency, legacy telephony interoperability, low bandwidth consumption, and attractive cost savings.

Basic Functionality

FXS, FXO, and E&M analog trunk interfaces, 2&4 wire, ground start and loop start. These connect to PBX's, KTS's, or directly to telephones and fax machines.
T1 and E1 digital trunk interfaces for PBX connectivity. Both T1 and E1 should include basic Channel Associated Signaling (CAS) and Common Channel Signaling (CCS) support.
Ability to generate dial tone, busy tone, and trunk busy tone to the calling party, and ringing signaling to the called party instrument.
Last packet replay (for voice only) to make up for late, corrupted, or dropped voice packets. Replaying the last voice packet smoothes over the gap, and the listener usually does not notice this when it occurs infrequently. Corporate networks are normally well behaved, with very low packet loss.
Port contention with hunt groups, allowing a local VDIC to access the next available trunk on a remote VDIC. The trunks can be grouped, allowing controlled access within a specific group.
Automatic trunk busy-out when the VDIC detects a problem with one of its trunks, with itself, or with the WAN. This presents as a busy signal or trunk-busy signal to the caller.
Speech compression supporting natural speech conversations with excellent full duplex voice quality, at a compression rate of 8Kbps or less (not including any packet overhead).
Silence suppression with low-bandwidth (1Kbps or less) comfort noise or background noise regeneration. Silence suppression should reduce bandwidth requirements by at least 50%; it should not introduce any noticeable clipping at the beginning of speech activity.
Echo cancellation. This is not acoustic echo cancellation, rather it is reflected-energy cancellation. When a two-wire telephone cable connects to a four-wire PBX interface or to a telco central office (CO) interface, a special electrical circuit called a hybrid is used to convert between 2-wires and 4-wires. Although hybrid circuits are very efficient in their conversion ability, a small percentage of telephony energy is not converted but instead is reflected back to the caller. This is called echo.
When the caller is near the PBX or CO switch, the echo comes back so quickly that the speaker cannot discern it. However, when the delay is more than about 10 milliseconds, the caller can hear an echo. To prevent this, VDID's include special DSP code that listens for the echo signal and subtracts it from the incoming audio signal. Echo cancellation is especially important because network delays can easily be 40-50 milliseconds, so the echo from the far-end hybrid can be quite pronounced at the near end. Far-end echo cancellation eliminates this.
Fax support in real time for G3 fax, to 9.6Kbps.
Accurate Dual Tone Multi-Frequency (DTMF) tone regeneration, sufficient for most voice-mail and similar applications.
Digit store-and-forward (no second dial tone) for transparent dialing plan implementation.
Call setup time of less than 3 seconds.
Support for at least one Ethernet LAN connection.
Centralized control of all VDIDs, for alarm, status monitoring, diagnostics, database maintenance and operating code maintenance.
In addition to these VDID necessities, the better voice/data integration devices also support several enhancements that improve the user experience, reduce costs, expand the level of telephony interoperability, and improve manageability.

Enhanced Functionality

Cross-platform port contention, transparent to the caller. When a local VDID is requesting a trunk from a far-end VDID, and there are none available (all are in use or otherwise busied-out), VDIDs with cross-platform port contention will select another VDID until a trunk is found.
Support for Token Ring LANs.
SMNP support, allowing standards-based network management.
Voice/fax on any port, at no extra cost. Voice or fax is auto-detected without user intervention, based on the nature of the signal.
Dynamic jitter buffering, automatically self-adjusting to the delay characteristics of the WAN. See below for a discussion of delay and jitter.
Country-specific tone and ringing signaling. This allows VDID users in different countries to hear tones they are familiar with, reducing any retraining issues and promoting VDN usage.
Flash-hook pass-through, allowing a remote user to control special PBX functions that depend on the usage of the flash-hook. These can include voice mail and other call-processing functions.
Precise DTMF or Multi-Frequency (MF) tone detection and regeneration, both at the beginning of a call and during the call. This guarantees the caller can control all far-end telephony applications that use DTMF.
Any of DTMF, MF, or pulse at one end to any of DTMF, MF, or pulse at the other end.
Centrex support, for FXO to FXO call disconnect supervision. This assures that the call will be disconnected.
No central point of failure. A central directory server, for example, that does not have a redundant back-up unit somewhere else in the network would be a central point of failure. Many VDIDs distribute directory information to the individual VDIDs, avoiding this problem.
Scalability to many VDIDs, for large VDNs. Networks where hundreds of VDIDs are currently installed.
Force connect or equivalent (sometimes called ring-down), enabling a caller to cause a pre-determined far-end phone to ring whenever the caller goes off-hook.
Call inhibit and receive inhibit settings, controlling who can make certain calls and who can receive certain calls.
Full gain control including input level gain and output level gain settings, to tune the quality of the speech that is affected by the relative strength of the electrical telephony signal entering and exiting the VDID.
Controllable ringing frequency, to customize the ringing of the telephone.
Fully packaged platform (versus just selling boards or software). Some VDID vendors only sell boards or software, requiring a customer to buy from a systems integrator, introducing the possibility of finger pointing when there is a problem.
Redundant power supply option, for added integrity at large sites. For smaller sites, with less than 8 VDID trunks, the site can use the PSTN as a backup if something in the VDN or data network fails; redundancy is usually not required. For larger sites, however, PSTN access as a backup for a large number of failed VDID trunks is usually unavailable, so reliability and redundancy are key.
Call-accounting for corporate bill-backs. Some users will bill the remote branch offices to recover the cost of deploying the VDIDs and any costs for extra VDN bandwidth.
Local emulation capability so that one site can have any interface with any protocol independent of other sites on the network. This is not possible with 'transparent passthrough.'
A variety of signaling-system support within CAS and CCS. There are several different signaling-systems that are specific to certain PBX's or country conventions.
Because VoFR has been mainstream for over five years, FRADs are usually more fully featured than VoIP gateways when compared on a similar cost-per-trunk basis. The feature gap, however, is closing and will probably be negligible in 2 to 3 years. Beyond the feature gap, though, there are other important differences that are intrinsic in the nature of each VDID's packet technology.

COMPARING VoFR AND VoIP PACKET TECHNOLOGIES

There are four main areas of material differences between VoIP and VoFR packet management technologies:

Packet overhead
WAN access-line carrying capacities and associated cost implications
Packet prioritization
Packet segmentation or fragmentation and its effect on data packet sizes
Each area of difference presents more of a challenge to the VoIP gateway approach than the VoFR approach, both because VoFR is more mature and because VoIP has a few inherent technological handicaps.

Comparing VoFR and VoIP Packet Overhead
The main handicap is the greater packet overhead required by IP. Both FR and IP packets are constructed with packet header information, voice/fax header information, and the digitized, compressed voice and fax information (the payload).

Each payload represents a fragment of speech called a 'talk spurt.' Representing a long talk spurt requires the VDID to accumulate the speech information over an extended period of time, and possibly introducing delay into the conversation. Conversations with added delay sound unnatural and cause the corporate telephony user to bypass the VDN, defeating any hoped-for cost savings. This delay-limiting requirement dictates a maximum talk spurt size for voice (and consequently, fax) of 50 bytes at a compressed data rate of 8Kbps (1000 bytes per second, or one byte per millisecond). Fifty bytes represent 50 milliseconds of speech activity, the longest practical period without contributing too much to the overall delay.

Tables 3a and 3b show the bandwidth consumed in Kbps for FR and IP telephony packets. The speech compression CODEC is assumed to need 8Kbps. The sum of the CODEC bandwidth and the packet overhead is the peak bandwidth consumed by an active conversation. The peak only occurs for a few seconds and is replaced by 0Kbps during the natural periods of silence in a typical conversation. While one person talks, the other listens, yielding silence about 50% of the time over a 20-30 second period. Pauses and interruptions in the conversation contribute another 10%. The net bandwidth consumption averaged over the 20-30 second period is about 40% of the peak bandwidth. These figures are borne out in studies done at Bell Labs.

The method in which silence suppression is implemented can affect both the perception of voice quality and bandwidth consumed. Absence of sound is often perceived as a broken connection because the listener is accustomed to hearing a certain minimal level of background noise or 'comfort noise'. There are two methods of solving this problem. The first is to inject white noise generated by the VDID at either end of the connection to let the listener know that the connection is alive. The advantage of this method is that it does not consume any bandwidth over the WAN link. However, it results in a reduced level of perceived voice quality. The other method involves actually sampling the background noise. This method produces a significant improvement in voice quality, but consumes more bandwidth. The way that the comfort noise feature is implemented differs with each vendor and can often be configured by the network manager. Bandwidth consumption can vary from 0 to 2Kbps, plus overhead.

Typical CODEC Bandwidth 8Kbps
Frame Relay and Voice/Fax Packet Overhead 2Kbps
Total FR Bandwidth 10Kbps
Less 60% Silence - 6Kbps
Net Bandwidth Consumption Averaged over a 20-30 Second Period for an Active Speech Conversation 4Kbps
Table 3a: VoFR packet overhead and average bandwidth consumption

Typical CODEC Bandwidth 8Kbps
IP/UDP and Voice/Fax Packet Overhead 7Kbps
Total IP Bandwidth 15Kbps
Less 60% Silence - 9Kbps
Net Bandwidth Consumption Averaged over a 20-30 Second Period for an Active Speech Conversation 6Kbps
Table 3b: VoIP packet overhead and average bandwidth consumption

The IP packet overhead is an estimate for the WAN environment. It can range up to 9Kbps when the IP packet is encapsulated in another protocol, such as Frame Relay's RFC 1490. If the router accessing the WAN performs header compression, the overhead will be reduced. Because of the wide mix of routers in corporate networks, we will assume the routers do not have header compression, and use the 7Kbps figure. The overhead in the LAN environment can be as high as 9Kbps due to Ethernet or Token Ring encapsulation, but LAN bandwidth consumption is usually unimportant.

Comparing VoFR and VoIP Bandwidth Consumption
The preceding figures indicate that IP telephony traffic consumes about 50% more WAN bandwidth than FR telephony traffic. Actual measurements bear this out. The implications for the corporate VDN are two-fold: fewer active virtual voice/fax trunks on the WAN access line, and less bandwidth left over for non-telephony traffic ('residual bandwidth'). The first implication comes from simply dividing the WAN access line's bandwidth by the peak trunk bandwidth. On a 64Kbps FR WAN access line, for example, there can be a maximum of 64/10 = 6 active voice trunks, compared to a maximum of 64/15 = 4 active voice trunks for a 64Kbps IP line.

If the comfort noise feature is used to improve voice quality, the difference between VoIP and VoFR bandwidth consumption becomes even more pronounced. A 1Kbps background noise sample transmitted during periods of silence, would mean adding 4.8Kbps (1Kbps + 7Kbps overhead = 8Kbps, less 40% active speech = 4.8Kbps) to the net bandwidth consumed for VoIP and 1.8Kbps (1Kbps + 2Kbps overhead = 3Kbps, less 40% active speech = 1.8Kbps) for VoFR. This would bring the total average bandwidth consumed to 10.8Kbps and 5.8Kbps for VoIP and VoFR respectively, a significant advantage for VoFR.

The method of simply dividing the WAN bandwidth by the peak trunk bandwidth assures there will never be voice trunks competing for bandwidth, assuming there is no high priority data traffic (see below for this case). This simple division approach does not, however, take into account the beneficial effects of statistical multiplexing. With stat mux effects, the trunk count will go up, depending on the original trunk count and the acceptable level of dropped voice/fax packets.

The stat mux effect only applies to higher speed WAN lines. For example, a T1 (1.5Mbps) line will support about 150 trunks according to the division method, but about 185 stat muxed with 99.9% packet integrity. Because the level of packet integrity required can vary by application, and because the trunk counts in corporate data/voice networks are rarely this high, we will disregard the stat mux effect in corporate applications and consider it only in pure voice/fax applications. The second implication is slightly more complicated, and requires a discussion of the duty cycle of a voice/fax trunk. When the VDIC trunk is not in use, the net bandwidth consumption is 0Kbps. The actual amount to time a VDIC trunk is in use is called the trunk's 'duty cycle.' It varies according to the number of VDIC trunks, and normally does not exceed a loading factor that corresponds to the appropriate 95% dial-tone availability figure.

Table 4 shows the likely maximum average duty cycles for various trunk counts, assuming an 8-hour day. In Table 4, for example, the maximum average percent of time a single VDIC trunk is active is 36% over an 8-hour day (as one of 4 trunks carrying the maximum amount of traffic consistent with a 95% dial-tone availability target). The 36% average usually develops over a 20-30 minute period (this is different from the 20-30 second period mentioned above for silence suppression), and implies that the amount of bandwidth the trunk consumes on average over this period is 36% of 6Kbps (for IP) or 4Kbps (for FR), that is, 2.2Kbps for IP or 1.4Kbps for FR.

Number of VDIC Trunks 2 4 8 12 24
Maximum Average Percent of Time ('Duty Cycle') Trunks are Active 18% 36% 54% 63% 79%
Table 4: Maximum average duty cycles for various trunk counts

By taking the appropriate duty cycle for each trunk count, the minimum average amount of residual bandwidth that is available for other data traffic can be calculated. For example, with 2 FR trunks, each trunk has an 18% duty cycle. Both trunks combined can support 2.9 hours per 8-hour day of 95% available dial tone. Each trunk consumes 10Kbps peak, 4Kbps with silence suppression, and 0.72Kbps (4K x 18%) averaged over 20-30 minutes. The residual bandwidth on a 64Kbps line would be 64.0-0.72 = 63.28Kbps.

Comparing VoFR and VoIP WAN Telephony Trunk Capacity (With Data Integration)
Table 5 shows the residual bandwidth that is available for data averaged over a 20-30 minute period (this is access bandwidth less the total of the trunk duty-cycle bandwidths) for both FR and IP approaches. It also shows the number of VDIC telephony trunks that will fit in a given WAN access bandwidth (access bandwidth divided by total peak trunk bandwidth) for both FR and IP.

The amount of residual bandwidth determines whether it will be necessary to increase the WAN access bandwidth to support adding voice and fax traffic to the existing data traffic. The goal, when adding voice and fax to corporate data networks, is to avoid having to add bandwidth if at all possible. The extra cost to add bandwidth can significantly reduce the savings that come from the 'free' phone and fax calls. If the corporate data network has enough spare bandwidth, then there will be no additional bandwidth costs.

For a given access speed, the residual data bandwidth for FR and IP are similar, within 7% of each other. The difference is in the number of VDIC trunks that the line will carry. Note that the figures given for the number of telephony trunks at the higher WAN access speeds (above 256Kbps) are conservative; no allowance has been made for the beneficial effects on the trunk capacity by statistically multiplexing voice/fax traffic.

  Frame Relay Packet with Overhead IP Packet with Overhead
WAN Access Bandwidth (Kbps) Maximum # Router or FRAD Trunks** Residual Bandwidth (Kbps)* Maximum# VoIP Gateway Trunks** Residual Bandwidth (Kbps)*
28.8 2 27.4 1 27.7
33.6 2 32.2 2 31.4
56 5 47.6 3 51
64 6 53 4 55.4
128 12 98 8 102
256 25 176 17 184
384 38 247 25 264
Table 5: WAN access link packet trunk carrying capacity
* Residual bandwidth averaged over a 20-30 minute period; 8-hour day is assumed.
** Voice/fax trunks, analog or channelized digital, connected to the PBX, KTS, CO, or fax machine. No allowance is made for the beneficial effects of stat muxing trunks.

Another useful way to view this information is by trunk count, as shown in Table 6. Although the amount of bandwidth consumed by VoIP is 50% higher than VoFR, when the application is voice/data integration (versus just voice/fax) this has less of an impact than it might seem. Because the average residual bandwidths are so high, the percentage differences between them are fairly small.

For example, 4 trunks require a minimum of 40/60Kbps (FR/IP). They leave only 24/4Kbps out of a 64Kbps line at their peak bandwidth consumption (all 4 speakers speaking in the same direction at the same time). This is a large difference in favor of FR - by a margin of 6 to 1. However, they leave an average of 58.2/55.2Kbps residual bandwidth, only a 5 percent advantage for FR. Even with 24 trunks running over a 512Kbps trunk, the FR residual-bandwidth advantage is just 10%.

  Residual Bandwidth (Kbps) FR/IP
Voice Trunks Minimum Bandwidth Req'd (Kbps) FR/IP Avg. with Silence Suppression (Kbps) FR/IP Avg. with Duty Cycle (Kbps) FR/IP

28.8Kbps

WAN

56Kbps

WAN

64Kbps

WAN

128Kbps

WAN

256Kbps

WAN

512Kbps

WAN

2 20/30 8/12 1.4/2.2 27.3/NF 55/54 63/62 127/126 255/254 511/510
4 40/60 16/24 5.8/8.6 NF/NF 50/NF 58/55 122/119 250/247 506/503
8 80/120 32/48 17/26 NF/NF NF/NF NF/NF 111/102 239/230 495/486
12 120/180 48/72 30/45 NF/NF NF/NF NF/NF 98/NF 226/211 482/467
24 240/360 96/144 76/114 NF/NF NF/NF NF/NF NF/NF 180/NF 436/398
Table 6: Bandwidth consumption
* NF denotes not enough bandwidth for the VDIC trunks' minimum bandwidth to fit (no fit); 8-hour day assumed.

Integrating Voice/Fax with Background and Real-Time Data
The preceding discussion was based on the assumption that real-time, low-delay voice and fax traffic on VDNs is given priority over non-real-time data (sometimes called background data). This includes e-mail, file transfer, and browser traffic. By-and-large, background traffic can be briefly (seconds, not minutes) delayed in favor of real-time traffic without impacting the end-user. This has been borne-out in field measurements.

In fact, the marriage of voice/fax and data over WAN links has a mutually advantageous relationship. For low-speed WAN access, data is usually background and voice/fax is real-time. For high-speed access, there is often a mixture of real-time and background data with real-time voice/fax. But, there is no material real-time data vs. real-time voice/fax conflict because voice/fax consumes a small amount of bandwidth. Prioritizing it first does not usually adversely slow down real-time data on high-speed lines, whether FR or IP.

This way of combining voice/fax and data has changed considerably from the past. Before the advent of DSP and packet technologies, voice/fax typically consumed a constant 64Kbps without the possibility of silence suppression or the duty-cycle factor. Data consumed a small amount of bandwidth and was added onto voice/fax networks. Today, voice/fax averages 1-3Kbps, around 2-4% of 64Kbps (as much as a 50-fold improvement).

When there are real-time or quasi-real-time data applications, such as SNA, running over a low speed WAN access line, it is necessary to subtract the real-time data's bandwidth from the line's bandwidth to determine how much residual bandwidth is available for voice and fax. For example, if a 19.2Kbps SNA data stream is running over a 56Kbps link, the residual 36.8Kbps could be used for voice/fax traffic. This would allow two (IP) or three (FR) VDIC trunks (50% more for FR, a significant difference). The remaining bandwidth would be available for background data.

Moreover, because of gaps in the SNA data stream - usually about 50%, or 9.6Kbps in this case - another 9.6Kbps would be available for additional background data communications. With 2 VDIC trunks, voice/fax would average 1-2Kbps, SNA would average 9.6Kbps, and a total average of 44Kbps would be available for background data communications.

Comparing VoFR and VoIP WAN Telephony Trunk Capacity (Without Data Integration)
Heretofore we have compared VoFR and VoIP in an integrated data/voice environment. Although the voice/fax IP packet size is 50% more than the VoFR packet size, the residual bandwidth remaining for data is similar, especially at access speeds of 56Kbps or higher. The main disadvantage is the reduced number of VoIP telephony trunks that can be carried on a WAN access line compared to the number for VoFR, potentially a serious handicap at access speeds lower than 64Kbps.

When the discussion turns to a pure packet voice/fax environment, the situation is substantially more favorable toward VoFR. If a company is just running voice and fax on a WAN line - with no data - the residual bandwidth is of no value. In this case, Table 5 shows that VoFR maintains a consistent 50% advantage over VoIP, directly corresponding to the packet-length ratio.

For example, on a T1-speed connection between the US and Japan, VoFR will support 153 simultaneously active voice/fax connections, with an 8Kbps CODEC, compared with 102 for VoIP, not counting any statistical multiplexing advantages (which would accrue proportionally). If this connection were part of a commercial international telephony-bypass operation that sells toll-minutes to the public or other carriers, profits would be seriously impacted.

In the corporate voice/data setting, only rarely would a T1 line to a remote office be expected to carry more than, say, 60 voice/fax conversations; typically this would be more like 15-30. A substantial majority of the bandwidth would be used for data. In this case, the residual bandwidth figures would predominate, and FR would not be heavily favored over IP.

Network Delay and Jitter
One of the key ingredients of good voice quality and high user acceptance is low voice delay. Delay is introduced by several means: by the VDID's CODEC, by the VDID's WAN/LAN functions, by the WAN itself (both the WAN devices and WAN media), and by telephony equipment. In almost no cases does the LAN introduce any material delay.

Delay is often discussed in terms of both the average delay and the variation in delay. The variation in delay is called jitter. Average delay describes the average length of time a VDN packet takes to move from a local VDID to a remote VDID. Jitter describes the variability in the arrival time of VDN packets. Latency is a term often used to describe the sum of the average delay and the jitter.

As VDN latency exceeds about 200-250 ms, the two parties will increasingly adopt a half-duplex communications mode, where one speaks, the other listens and pauses to make sure the speaker is done. If the pauses are ill timed, they end up 'stepping' on each other's speech. This is the problem that occurs when two people converse over a satellite telephony connection. The result is a reduction in perceived voice quality.

When a VDID packet is inordinately delayed and does not arrive at the far-end in time to fit into the voice stream playing out from of the far-end VDID, it is discarded, and the previous packet is replayed. If this happens too often or twice in a row, the listener will perceive reduced voice quality.

To allow for variable packet arrival time and still produce a natural stream of speech, the far-end VDID does not play out the speech as soon as the first packet (after a period of silence) arrives. Instead, it holds the packet for a certain time in part of its memory, called the jitter buffer, and then plays it out. The amount of this hold time is the measure of the jitter buffer, e.g., a 50 ms hold time implies a 50 ms jitter buffer.

The jitter-buffer hold time adds to the overall delay, so if the network has high jitter, the overall effect will be a long perceived delay in the voice stream. For example, a VDN might have a moderately average delay of 130 ms and a variability of 5 ms. The VDN is said to have 5 ms of jitter, a low figure. The jitter buffer-hold time is only 5 ms so the effective latency will only be 135 ms, still moderate.

On the other hand, if a VDN has a low average delay of 50 ms, but 10% of the time the delay goes out to a long 200-250 ms (while 90% of the time the delay is a brief 33 ms), the jitter buffer would have to be 200-250 ms and the latency would be 250 ms, a high figure. Jitter can be more important than average delay in VDN applications.

While a high jitter buffer will compensate for chronically late packets, it adds to the latency. A low jitter buffer will reduce the latency but may be so small it frequently misses slightly delayed packets. It can be difficult to pick the right jitter buffer because VDN jitter conditions can change throughout the day. For this reason, the better VDIDs use dynamic jitter buffering that is constantly adjusting to the smallest size that is consistent with good voice quality.

Comparing VoFR and VoIP Packet Prioritization
For voice/data integration to work well over corporate VDNs, the VDN's jitter and average delay (together, latency) must be low; less than about 200 ms. One of the key means of achieving low latency is voice/fax packet prioritization. The means of achieving prioritization, however, are different for IP and FR. In general, the FR approach is easier than the IP approach.

With VoIP, the router that connects the site's LAN to its WAN access line is instructed to look for voice/fax IP packets and put them ahead of any data packets waiting in the router's transmit queue. This way, a string of outgoing data packets will not add to the variability of the arrival time of voice packets. Voice/fax packet prioritization is especially important at WAN access speeds of 56/64Kbps - 512Kbps. At T1/E1 speeds, it may not be required.

There are two methods of instructing the router to prioritize voice/fax IP packets. In the first, the network administrator explicitly programs the router to look for the VoIP gateway's 'well known UDP port number.' The 'well known UDP port number' is a reserved port number registered by the gateway manufacture for its exclusive use worldwide. In the second approach, a prioritization protocol that is understood by both the router and the gateway is used. An example of such a protocol is RSVP, a new prioritization standard that certain router vendors are now including in the operating software for some of their routers.

With ReSourcereserVation Protocol (RSVP), when the gateway determines it needs to place or receive a voice/fax call, it establishes an RSVP session with the router, using the LAN to pass information. The gateway instructs the router to prioritize the voice/fax packets for the duration of the call. When the call is terminated, the gateway instructs the router to cease the prioritizing. RSVP, however, is not yet in widespread deployment.

RSVP also includes provisions for constraining packet delay and guarantying bandwidth availability, but on a well-managed corporate IP network, only the prioritization feature needs to be used by the gateway. It is usually only necessary to prioritize the voice/fax packet at the edge of the VDN, where the WAN access speed is usually the lowest, and the potential for conflict, the greatest.

With Frame Relay, the prioritizing of the voice/fax packets ahead of data packets is done automatically in the voice-enabled FRAD, without any user setup or involvement. Moreover, when there are multiple active voice/fax trunks, some FR implementations can prioritize at different levels within the mix of active trunks, while the IP approach usually has only one priority level. Although the effects of prioritization are the same with FR and IP, FR has the edge in manageability and possibly multi-level prioritization.

The prioritization of voice/fax packets ahead of data packets will cause the data packets to be delayed in WAN transmission wherever there are active conversations on VDID trunks. The effects on WAN data packets are negligible for background data. For real-time data, it is possible that voice/fax prioritization could interfere with their timely delivery. It is important to arithmetically subtract the real-time data bandwidth requirements before calculating the number of voice/fax trunks that the WAN access line can accommodate. Because of its lower packet overhead, FR has an advantage here, as aforementioned.

The effects on LAN data packets, however, are negligible, because the bandwidth per active trunk is so low compared to LAN speeds (15Kbps vs. 10-100Mbps) and because there is no VoIP packet prioritization on the LAN. With VoFR, the voice/fax packets are generated inside the RFRAD and forwarded directly to the WAN, without transiting the LAN.

Comparing VoFR and VoIP Packet Segmentation
Packet prioritization is important to prevent data packets waiting to transmit to the WAN from delaying voice/fax packets. A similar problem exists when a single long data packet has begun to transmit and a voice/fax packet is now ready to transmit. The v/f packet must wait until the data packet is out the door. This can cause enough jitter to increase the latency by a large margin. For example, waiting for a 1500 byte Ethernet packet to transmit over a 56Kbps-access line can take more than 200 ms. Longer Token Ring packets can add even more delay.

It is important therefore for the VDID to segment any long data packets when there are active voice/fax calls, especially for lower-speed WAN access lines. The maximum sizes of data packets consistent with good voice quality, for various access speeds, are shown in Table 7.

WAN Access Speed (Kbps) Maximum WAN Packet Size (bytes)
56/64 256
128 512
192 768
256 1024
384 1536
512 2048*
1544 6144*
Table 7: Maximum packet sizes
* Ethernet packets do not exceed 1536 bytes. In an Ethernet LAN environment, packet segmentation is not needed above WAN access speeds of 256Kbps.

The consequence of segmenting data packets is a reduction in WAN access data transmitting efficiency. Since there is a fixed packet header for each packet, FR or IP, creating smaller packets increases the percentage of WAN access used for packet headers at the expense of useful data. In other words, the packet header as a percentage of payload, goes up, and WAN efficiency goes down. When there is active packet segmentation, the advantage is to FR since the header is smaller. With IP, the overhead can reduce WAN efficiency by 10-15%; with FR, by 2-4%.

With FR, packet segmentation occurs automatically within the RFRAD whenever there is an active call. During the call, all data packets are segmented according to information similar to that shown in Table 7. When there are no active calls, packet segmentation stops and long packets resume.

With VoIP, packet segmentation is achieved by instructing the WAN-access router to segment outbound data packets, also according to information similar to that shown in Table 7. The router is instructed to segment voice/fax IP packets either by the network administrator's explicit programming of the router to segment all packets, both voice/fax and data, all of the time, or by using a gateway-to-router protocol such as RSVP.

With RSVP, in a fashion similar to packet prioritizing, when the gateway determines it needs to place or receive a voice/fax call, it establishes an RSVP session with the router. The gateway instructs the router to segment both the voice/fax and data packets, but only for the duration of the call. Packet segmentation stops and long packets resume when there are no active calls, as in the RFRAD.

Without a protocol such as RSVP, the reduced WAN efficiency is constant and applied to all data packets. Today's routers do not segment differently depending on UDP port numbers. Since most routers and VoIP gateways do not support RSVP or a similar control protocol, VoIP will average about 10%-15% lower WAN efficiency on lower speed lines, regardless of whether or not there are active voice/fax calls. This is the second inherent disadvantage of VoIP compared to VoFR. For Ethernet packets on WAN access lines faster than 256Kbps, there are no VoIP packet segmentation disadvantages.

COMPARING FRAME RELAY AND IP WAN SERVICES

WANs are usually built using leased lines, Frame Relay services, and/or IP WAN services. For leased lines, there are no new differences to discuss between FR over leased lines and IP over leased lines. The packet technology differences discussed above determine the advantages of one over the other in the leased line environment.

There are several different types of FR and IP WAN services available. For FR, there are standard and premium grades of service from several North American vendors. Standard grade services deliver acceptable levels of service for most VoFR applications, with network delays typically in the 55ms to 130ms range, depending on WAN access speeds, and packet corruption levels less than 0.5%. Premium grade services reduce the delays by about 15 ms and improve the packet corruption levels to less than 0.01%. They generally cost an additional 10-20%, and are not needed in most applications.

IP WAN services are a little more complicated. There are several different types of IP network services, and different grades of service within them. Many corporate IP networks use routers running IP over FR networks (IP over FR or IPoFR), in which case they fall under the preceding discussion. The method of encapsulating IP packets inside Frame Relay frames can be vendor proprietary or based on a standard, such as the Internet Engineering Task Force's (IETF) RFC 1490.

In fact, the typical voice/fax-enabled RFRAD will send voice and fax in native frames and encapsulate IP for maximum efficiency. To the casual observer, who sees an IP LAN connected to the RFRAD and some voice lines connected to the RFRAD, the impression can be that the voice is part of an IP WAN connection. Thus the observer can draw the incorrect conclusion that the voice is VoIP rather than VoFR.

Some corporate IP networks use the relatively new IP VPNs (virtual private networks) that are supplied by several vendors. These IP VPNs, which usually run over ATM, Sonet, or FR, offer similar levels of service guarantees as the Frame Relay services. The main differences from Frame Relay services are availability and cost. FR is widely available around the world at speeds up to T1/E1, and selectively at speeds up to T3/E3. IP VPNs are readily available in the US and selectively internationally, at similar speeds.

FR and IP VPN cost differences vary according to the corporate network's topology, discussed later on in this section. In other regards, their differences come from their different packetizing technologies, discussed previously. For a mix of voice and data, where the data has a lower priority, they are similar regarding bandwidth efficiency. For a traffic mix of voice and delay-sensitive or high priority data, FR excels.

There are also corporate networks that use the Internet to transmit interoffice data, using firewalls for information security. Because the Internet has very high jitter and intermittently large packet losses, it is not suitable for corporate interoffice telephony. It can, however, be used for interoffice corporate voice messaging and interoffice fax.

The Internet can also be used to provide a reduced-quality telephony service for consumers willing to trade-off quality for price. This is especially appealing in the international arena. In these applications, a PC is used as a telephone. The PC accesses the Internet, and connects to another PC on the Internet. There is no charge for this service beyond the access charges. In a variation on this, the Internet-connected PC accesses a VoIP gateway on the Internet in a specific city, and, using the city's PSTN, calls a legacy telephone in that city via the gateway. There is an additional charge for this, but usually fairly small.

In yet another variation, a legacy telephone dials a VoIP gateway in a nearby location, instructs the gateway to connect to another gateway in a distant location, and dials out using the PSTN to a legacy telephone. Local toll charges apply. This is pure legacy long-distance using the Internet to carry the call between the two locations. Some service providers improve voice quality by using high-speed, semi-dedicated IP bandwidth, not the normal Internet, but market the service as Internet telephony because of the Internet's marketing appeal.

In this application, however, the inefficiency of VoIP compared to VoFR for pure voice applications, as discussed above, can reduce the advantage of using semi-dedicated IP bandwidth. It is not clear whether this enhanced consumer-grade VoIP service will continue to use semi-dedicated IP bandwidth; some service providers are switching to voice over Frame Relay or voice over ATM to reduce network costs.

Comparing VoFR and VoIP Network Topologies and Costs
There is a fundamental difference between FR and IP network topologies; FR is connection oriented, while IP is connectionless. In a FR network, each router/FRAD (voice-enabled or not) that connects to the FR service has one or more virtual connections (VCs) through the service to one or more distant RFRADs. The VCs are either permanent (PVCs) or switched (SVCs). They act like permanent or temporary leased lines connecting the RFRADs together. In the great majority of FR-based networks, the connections are PVCs; SVCs are fairly new and are not widely available, nor are SVC-ready RFRADs.

With Frame Relay, the user pays for the WAN access line (according to the bandwidth and distance of the line) to the FR service, for the port into the service (according to the bandwidth of the access line), and for each PVC between each RFRAD. The WAN access line from the RFRAD to the FR service can carry multiple PVCs. In many cases, there are several PVCs from one RFRAD to another and/or to multiple other RFRADs.

For large networks with fully meshed PVCs (each site connected to every other site with at least one PVC), the costs can skyrocket. A fully meshed 10-site network, for example, would have only 45 PVCs. A fully meshed 100-site network would have 4950 PVCs, while a fully meshed 1000 site network would have 4,999,500 PVCs.

In an IP network, the routers and level-3 switches within the network determine the next router to send the IP packet to according to header information, bandwidth availability and other factors. Unlike Frame Relay, where each frame in a PVC travels the same route, IP frames from the same source to the same destination can take a different route each time. They can even arrive out of order, and are subsequently placed in the correct order.

This has an advantage over FR of avoiding the VC concept and associated meshing costs. However, it has the disadvantage of potentially higher jitter from variable routing. This is especially apparent in the Internet, where there are huge variations in possible packet routes. In most corporate IP networks, however, the variable routing does not appreciably add to the jitter because most corporate IP WANs are not bandwidth constrained and similar direct routes are usually available for consecutive packets.

For corporate networks, therefore, the main difference between public FR and IP VPNs is in the cost of the services and the network topology. When the network topology is primarily 'homerun' or 'hub-and-spoke,' where each branch communicates mainly with headquarters and branch-to-branch communications can be accomplished by routing through headquarters, the FR PVC count is proportional to the number of branch sites. For these network topologies, PVC count is fairly small and public FR costs less than public IP.

For larger, more fully meshed network topologies, where many FR PVCs would be used, public FR costs more than public IP. Table 8 summarizes the costs for separate FR and IP connections and for several networks based on FR and IP WAN services from two prominent North American carriers who offer both.

  Carrier A Carrier B
Single Connection (Kbps) Frame Relay (4) IP Frame Relay (4) IP
56 $277 $480 $282 $700
128 487 764 482 1,100
512 1,047 1,321 1,042 1,900
TI (1544) 2,217 1,780 2,207 2,700
10-site Hub & Spoke (1) 3,785 6,121 3,830 8,900
25-site Hub & Spoke 9,110 13,780 9,225 20,200
50-site Hub & Spoke 18,220 27,560 18,450 40,400
100-site Hub & Spoke 36,440 55,120 36,900 80,800
1000-site Hub & Spoke (5) 355,660 544,080 360,300 797,200
10-site Fully Meshed (2) 3,890 4,800 3,940 7,000
25-site Fully Meshed 15,725 12,000 15,850 17,500
25-site Partially Meshed (3) 10,925 12,000 11,051 17,500
50-site H&S, 6 PVCs/site 26,220 27,560 26,450 40,400
Table 8: A comparison of VoFR and VoIP network costs
1. Using 56K access line to each site, plus appropriate higher speed line to hub location; one multiplexed PVC per remote site for Frame Relay.
2. Using 56K to each site; no hub location, one PVC from each site to every other site.
3. 12 PVCs to each site.
4. Using suitable CIR for data/voice integration.
5. Using dual T3 access at hub, but with multi-T1 pricing.
Costs do not include LEC access charges. No allowances made for any special pricing or discounts.

For most corporate networks, which are usually hub-and-spoke, with a normal mix of data and voice, public FR is less expensive than IP by 35-50%. For predominately voice/fax traffic in hub-and-spoke or point-to-point networks, FR has an even greater advantage because less WAN bandwidth is required. Only for larger, more highly meshed networks (around 25-50 remote sites) do IP VPNs have a cost advantage.

For networks designed with leased lines, WAN costs are identical for the same FR and IP bandwidth. In a typical application where most of the traffic is background data, the bandwidth requirements will be similar, so the WAN costs should be the same. The topology should not favor FR or IP. For pure voice/data networks, IP bandwidth requirements will be about 50% greater, so the IP WAN costs will also be greater, again independent of topology. In many cases, the increase will be in the 25-50% range, and sometimes as much as 60-80%, or more.

Both public FR and IP VPNs are available with WAN access speeds up to T3 (45Mbps). Higher speeds are also available for IP VPNs, an advantage for large networks. There are about 50-60 thousand corporate voice/data public FR networks currently deployed worldwide, while the number of VoIP networks, running as IP over leased lines, IP over public FR, and as IP directly connected to IP VPNs, is in the neighborhood of 2-4 thousand.

Comparing VoFR and VoIP Standards Activity
There are standards committees for both VoFR and VoIP. The work in standardizing VoFR commenced about 5 years ago under the auspices of the Frame Relay Forum (FRF). The FRF has produced two Implementation Agreements (IAs). The first IA is FRF.11, providing for multiplexed PVCs, speech compression using G.726 (ADPCM) and G.729 (8K CS-ACELP), fax demodulation, and the format for carrying multiple voice samples in single frames.

The second IA is FRF.12, providing for frame segmentation across a Frame Relay network. This standardizes an approach to managing the long-packet problem mentioned earlier. The FRF is continuing to work on expanding these standards. They are not yet rich enough to provide for full multivendor interoperability, but several vendors have implemented the FRF standards thus far. In fact, an IA is not really a 'standard,' rather it is an agreement by vendors that can form the basis for a subsequent standard produced by a recognized standards body.

The work in standardizing VoIP is more recent, about 2 years old, and is carried on under the auspices of the IMTC VoIP Forum. The IMTC has recently produced its first VoIP Interoperability Implementation Agreement, IA 1.0. IA 1.0 is based on the ITU H.323 conferencing standard. While H.323 is in fact a standard, IA 1.0 is only an agreement, a precursor to a standard.

The VoIP IA 1.0 specifies agreements in four areas: VoIP connections using the H.323 protocol suite with H.225 for dynamic address mapping; VoIP CODECs G.723.1 and G.711; how to process and transport DTMF signals; and how to manage silence suppression. As with the FRF IAs, IA 1.0 is not yet rich enough to provide for full multivendor interoperability. The work has already begun on IA 2.0.

On the other hand, H.323 is sufficiently complete to provide interoperability with respect to basic voice conferencing using PCs as telephones and a PC client such as Microsoft NetMeeting (H.323 compliant). This application, however, is designed primarily for voice over the Internet, and is not often found in corporate VoIP implementations, as discussed earlier. H.323 Rev 2 is also sufficient to provide the interoperable gatekeeper or directory services for VoIP gateways to 'find' each other across an IP network. This is useful in a multivendor VoIP corporate environment.

SUMMARY

Both VoIP and VoFR offer viable approaches to add voice and fax traffic to corporate data networks. Both also offer viable approaches for building pure voice/fax networks. For most applications, there are no major differences in the quality of the voice/fax communications, assuming the traffic is not being carried over the Internet. There can be, however, material differences in the costs to implement and operate VoIP and VoFR solutions. The differences vary according to the application; voice/fax over corporate data networks and pure voice/fax networking.

On the VDID side, the differences arise from the two inherent VoIP disadvantages: the larger IP packet header; and the lack of a widely deployed router control protocol, such as RSVP, to control packet segmentation. On the WAN services side, the differences arise from the substantial price premium for IP VPN services over FR services. Both differences are summarized according to the application.

Summarizing VoFR and VoIP in Corporate Voice/Data Networks
For corporate voice/data networks, voice/fax-enabled RFRADs with richer functionality are more widely available than similarly equipped VoIP gateways. For some networks, the added functionality is important. In addition, the channel expertise and support for VoFR is better than VoIP, because of the significantly larger installed base and more extensive experience.

IP's larger packet header materially affects the voice/data solution when the WAN access line runs at lower speeds. In the 19.2Kbps - 56/64Kbps range, the effect is especially noticeable. The number of VoIP trunks the line can carry is materially less than the number of VoFR trunks. Above 128Kbps, the effect is marginal, because these WAN access speeds will usually accommodate all of the voice/fax trunks needed at the remote site, whether VoFR or VoIP. The packet segmentation problem is also only material for lower speed lines. For speeds in the 19.2Kbps - 56/64Kbps range, the problem can reduce efficiency by 10-15%. Above 256Kbps, the problem is immaterial.

In most cases, building corporate networks using IP VPNs is more expensive than using public FR services. This is true regardless of voice/fax integration. The exceptions are larger, highly meshed networks, where IP's connectionless nature overcomes FR's costs for large numbers of PVCs. Another advantage of IP VPNs over public FR services is the IP VPNs' availability for dial-in access from scattered remote locations. This can be very valuable for certain organizations, but has little application for branch office networking and affords few opportunities for voice/data integration.

To build integrated corporate voice/data networks when there is an existing IP data network with in-place routers or FRADs, use VoIP gateways. In general, the cost to discard the existing routers and FRADs and install new voice/fax-enabled RFRADs is more expensive than adding VoIP gateways, and there is less disruption deploying the LAN-attached gateway than deploying new WAN devices. This conclusion assumes the WAN access speeds are sufficient to accommodate the added IP voice/fax traffic. If WAN bandwidth must be increased for VoIP but not for VoFR, the cost advantage in VoIP gateway deployment can quickly disappear.

When building a new network for IP data traffic (or for non-IP traffic) that will also carry added voice and fax, use voice/fax-enabled RFRADs and public Frame Relay services. In general, this will be less expensive than using data-only routers or FRADs and VoIP gateways, and will yield better WAN access efficiencies. The voice and fax traffic will traverse the WAN as native FR frames, while the IP data traffic will be encapsulated, probably with the RFC 1490 standard.

The VoFR approach is also better when expanding an existing IP data network by a significant amount, or when upgrading the router equipment of an existing IP data network. Both scenarios offer an opportunity to install voice/fax-enabled RFRADs. VoFR is also the better approach when adding voice and fax traffic to an existing non-IP data network, such as SNA.

With either approach, avoid IP VPNs unless dial-access or some other inherent need for dynamic IP access is required, or the network is large and highly meshed. The FR services are less expensive and more widely available.

Characteristic Frame Relay IP
VDID Functionality Rich Improving
Channel Experienced Young
Installed Base Large Small
Voice Quality Very Good Very Good
Delay Low Low
Jitter Small Small
Voice/Fax Packet Overhead Low High
WAN Access V/F Channel Capacity
- Lower Speed Lines Adequate Marginal
- Higher Speed Lines Adequate Adequate
WAN Access Efficiency Degradation
- Lower Speed Lines 2-4% 10-15%
- Higher Speed Lines 0-2% 0-5%