A Closer Look at Converged Data Center Networks, Part 2
Scott Carlson, Bill Holland, Mike Ko, Siva Kodukula, Jeff Palm and Renato Recio, IBM
Sep 05, 2007 (4:26 PM)
URL: http://www.commsdesign.com/showArticle.jhtml?articleID=201804028
Click here for Part 1.
Why create Convergence Enhanced Ethernet?
The industry has pursued several fabric convergence options, including InfiniBand (IB) and iSCSI. Each of these options has done well in a segment of the IT market. However, none has been able to fully satisfy enterprise data center storage requirements.
Enterprise data center storage customers are conservative when it comes to deploying new storage standards. Most want the new standard's ecosystem to be fully available and, more importantly, vetted out, before they deploy it. In other words, for any given new standard initiative, customers want all the basic capabilities (e.g. reliability, availability, serviceability, performance) to be robust enough to meet their application environment's demands. Additionally, these customers have a large install base of Fibre Channel components (servers, storage and switches) and want to protect that investment.
We will cover how IB, iSCSI and Fibre Channel over Internet Protocol (FCIP) compares to Fibre Channel over Convergence Enhanced Ethernet (FCoCEE) later in this article. But first we will describe why Convergence Enhanced Ethernet (CEE) was needed, versus just using Fibre Channel (FC) over (today's) Ethernet.
To satisfy performance requirements, storage fabrics require a lossless traffic service. Today's Ethernet can provide lossless behavior through the use of the IEEE 802.3x Pause standard. This standard allows a congested Ethernet port to assert back-pressure through an XON/XOFF protocol. Using IEEE 802.3x, the receiver will send an XOFF when it can't take any more data. The sender will then pause until the receiver sends an XON to resume communications. IEEE 802.3x works fine for Ethernet networks that are dedicated to storage traffic, because congestion only affects the storage traffic.
If Fibre Channel were to run directly over an Ethernet link that is also carrying other traffic classes (e.g. LAN or HPC), the IEEE 802.3x pause mechanism causes head of line blocking for all traffic classes flowing over the congested fabric segment. For example, if a storage flow experiences periods of traffic bursts that cause congestion, the congestion spreads to other traffic classes, such as sockets communications between servers or between servers and clients. Obviously, performance collapse is not acceptable for a convergence fabric. Over-provisioning can always be used, but is more expensive and must be continually upgraded to stay far above the possible IO demands.
The industry is working on enhancements to Ethernet that provide per priority flow and congestion control. The standard version of these enhancements is known as Convergence Enhanced Ethernet (CEE). For customers seeking FC and Ethernet convergence, FCoCEE is the model we advocate.
What is Convergence Enhanced Ethernet?
IBM is working with Broadcom, Brocade, Cisco, Emulex, Intel and several other companies to create a set of IEEE 802 standard functions that improve Ethernet's ability to converge fabrics in the data center. Convergence Enhanced Ethernet is the term used to refer to the IEEE 802 standard version of these functions.
CEE consists of the following enhancements being pursued in IEEE 802 working groups:
As shown in Figure 6, the functions described above allow CEE to provide traffic differentiation, such that multiple traffic classes can flow over the same link, without impacting each other. CEE can be combined with iSCSI to provide traffic differentiation at the Ethernet link layer for iSCSI data flows. For example, network adapters can associate a specific traffic class with iSCSI traffic and a different traffic class for application sockets communications. CEE can then be used to provide service differentiation between these two traffic classes.

There are two additional mechanisms that will further enhance convergence over Ethernet, but are not absolutely required, these are:
Figure 6 also shows that Fibre Channel traffic can take advantage of CEE. IBM is working with Brocade, Cisco, Emulex, Intel and several other companies to create a set of IEEE 802 standard functions that improve Ethernet's ability to converge fabrics in the data center. The next section will describe how CEE can be used in fabric convergence scenarios.
CEE Convergence Options
CEE enhances the NAS and iSCSI option covered earlier, by providing traffic differentiation at the link layer. Today, data center switches can use network layer traffic differentiation, but that level requires switches with layer 2+ capabilities, which have higher latencies (and cost) than layer 2 switches. Similar to layer 2+ switches that support traffic differentiation, CEE switches must support per traffic class resources, which means more buffer resources than switches that don't support traffic differentiation. However, unlike today's layer 2+ switches that support traffic differentiation, CEE switches do not have to retain IP routing tables or associated contexts.
As shown in Figure 7, CEE also enables an emerging convergence option, FC directly over CEE. On the server side, this option uses either FC or FCoCEE based adapters to connect servers to storage. If FCoCEE is used, then the same adapter can also be used to connect the server to other servers or networking equipment. That is, the same adapter can be used to carry other traffic classes, such as HPC, LAN or IPC messages. On the storage side, this option uses either FC or FCoCEE based adapters to connect servers.

For data centers that have a large FC install base, FCoCEE enables new servers or storage deployed in that data center to use a single link for both Ethernet and FC communications. Many enterprise data center customers will likely want the infrastructure required to enable this option to be vetted out before production deployment begins. That infrastructure includes: interoperable support from FCoCEE adapter vendors, interoperable support from FCoCEE switch and gateway vendors and most importantly the management software required to interlink the virtual and physical FC fabric to the physical CEE fabric. Given that a portion of the HPC and Analytics markets is interested in network convergence today, we expect FCoCEE to fare well in these two market segments (esp. when IB's high bandwidth and low latency are not needed).

When comparing these two future options with today's options, one important consideration is the management complexity. NAS and iSCSI over Ethernet or CEE use a single network management infrastructure for storage, LAN and IPC. However, FCoCEE requires two network management infrastructures: an FC based infrastructure that manages virtual FC (i.e. FCoCEE attached) and physical FC (i.e. natively attached) components; and an Ethernet infrastructure to manage the underlying physical Ethernet or CEE fabric. The interlinking between these the FC and Ethernet network management infrastructures will be necessary for production deployment of FCoCEE.
As can be seen from the table below, no single solution fully satisfies all the requirements. Our view is that IB fits well in cluster convergence scenarios where performance is critical. NAS and iSCSI fit well in mid-sized environments and for middle tier storage in multi-tier server environments. NAS and iSCSI are also well suited in cases where no FC exists, because they require one less fabric management type (i.e. all the fabric management is Ethernet based). Given that large enterprises place a high bar on storage (availability, security, quality, etc...), we expect FCoCEE will initially play in the HPC and Analytics market as an alternative to IB for environments where performance is less critical. To enable enterprise deployment of FCoCEE, we believe the functions listed in the table need to be standardized, developed, tested and hardened in real world environments. The table shows the current state of these functions for each option. As FCoCEE matures (i.e. these functions are made available), we expect it will play well in large enterprises wanting to pursue FC convergence. For a brief overview of the functions, see the Appendix.

Appendix
Defines the terms in the above table:
Functions needed for enterprise deployment: