Diskless node


A diskless node is a workstation or personal computer without disk drives, which employs network booting to load its operating system from a server.
Diskless nodes are sometimes known as network computers or hybrid clients. Hybrid client may either just mean diskless node, or it may be used in a more particular sense to mean a diskless node which runs some, but not all, applications remotely, as in the thin client computing architecture.
Advantages of diskless nodes can include lower production cost, lower running costs, quieter operation, and manageability advantages.
In many universities and in some large organizations, PCs are used in a similar configuration, with some or all applications stored remotely but executed locally—again, for manageability reasons. However, these are not diskless nodes if they still boot from a local hard drive.

Distinction between diskless nodes and centralized computing

Diskless nodes process data, thus using their own CPU and RAM to run software, but do not store data persistently—that task is handed off to a server. This is distinct from thin clients, in which all significant processing happens remotely, on the server—the only software that runs on a thin client is the "thin" client software, which handles simple input/output tasks to communicate with the user, such as drawing a dialog box on the display or waiting for user input.
A collective term encompassing both thin client computing, and its technological predecessor, text terminals, is centralized computing. Thin clients and text terminals can both require powerful central processing facilities in the servers, in order to perform all significant processing tasks for all of the clients.
Diskless nodes can be seen as a compromise between fat clients and centralized computing, using central storage for efficiency, but not requiring centralized processing, and making efficient use of the powerful processing power of even the slowest of contemporary CPUs, which would tend to sit idle for much of the time under the centralized computing model.
Centralized computing
or Thin client
Diskless nodeDataless nodeFat client
Local hard drives used for data
Local hard drives used for OS
Local general-purpose processing used

Principles of operation

The operating system for a diskless node is loaded from a server, using network booting. In some cases, removable storage may be used to initiate the bootstrap process, such as a USB flash drive, or other bootable media such as a floppy disk, CD or DVD. However, the firmware in many modern computers can be configured to locate a server and begin the bootup process automatically, without the need to insert bootable media.
'' book-size LAN station was an early diskless system based on an Intel 80286 processor and produced by Taiwan's Flytech Technology circa 1991.
For network auto-booting, the Preboot Execution Environment or Bootstrap Protocol network protocols are commonly used to find a server with files for booting the device. Standard full-size desktop PCs are able to be network-booted in this manner with an add-on network card that includes a UNDI boot ROM. Diskless network booting is commonly a built-in feature of desktop and laptop PCs intended for business use, since it can be used on an otherwise disk-booted standard desktop computer to remotely run diagnostics, to install software, or to apply a disk image to the local hard drive.
After the bootstrapping process has been initiated, as described above, bootstrapping will take place according to one of three main approaches.

This third approach makes it easier to use client OS than having a complete disk image in RAM or using a read-only file system. In this approach, the system uses some "write cache" that stores every data that a diskless node has written. This write cache is usually a file, stored on a server. It can also be a portion of the client RAM. This write cache can be persistent or volatile. When volatile, all the data that has been written by a specific client to the virtual disk are dismissed when said client is rebooted, and yet, user data can remain persistent if recorded in user profiles or home folders. The two major commercial products that allow the deployment of Diskless Nodes that can boot Microsoft Windows or Linux client OS use such write caches. The Citrix product cannot use persistent write cache, but VHD and HP product can.

Diskless Windows nodes

Windows 3.x and Windows 95 OSR1 supported Remote Boot operations, from NetWare servers, Windows NT Servers and even DEC Pathworks servers.
Third party software Vendors such as Qualystem, LanWorks, Ardence, APCT and Xtreamining Technology have developed and marketed software products aimed to remote-boot newer versions of the Windows product line: Windows 95 OSR2 and Windows 98 were supported by Qualystem and Lanworks, Windows NT was supported by APCT and Ardence, and Windows 2000/XP/2003/Vista/Windows 7 are supported by Hewlett Packard and Citrix Systems.

Comparison with fat clients

Software installation and maintenance

With essentially a single OS image for an array of machines, installing software and maintaining installed software can be more efficient. Furthermore, any system changes made during operation can be either wiped out when the power is removed such as Windows XP Embedded remote boot or prohibited entirely. This allows use in public access areas or in schools etc., where users might wish to experiment or attempt to "hack" the system.
However, it is not necessary to implement network booting to achieve either of the above advantages - ordinary PCs can be configured to download and reinstall their operating systems on a nightly basis, with extra work compared to using shared disk image that diskless nodes boot off.
Modern diskless nodes can share the very same disk image, using a 1:N relationship. This makes it very easy to install and maintain software applications: The administrator needs to install or maintain the application only once, and the clients can get the new application as soon as they boot off the updated image. Disk image sharing is made possible because they use the write cache: No client competes for any writing in a shared disk image, because each client writes to its own cache.
All the modern diskless nodes systems can also use a 1:1 Client-to-DiskImage relationship, where one client "owns" one disk image and writes directly into said disk image. No write cache is used then.
Making a modification in a shared disk image is usually made this way:
  1. The administrator makes a copy of the shared disk image that he/she wants to update
  2. The administrator boots a diskless node in 1:1 mode from the copy of the disk image he/she just made
  3. The administrator makes any modification to the disk image
  4. The administrator shutdowns the diskless node that was using the disk image in 1:1 mode
  5. The administrator shares the modified disk image
  6. The diskless nodes use the shared disk image as soon as they are rebooted.

    Centralized storage

The use of central disk storage also makes more efficient use of disk storage. This can cut storage costs, freeing up capital to invest in more reliable, modern storage technologies, such as RAID arrays which support redundant operation, and storage area networks which allow hot-adding of storage without any interruption. Further, it means that losses of disk drives to mechanical or electrical failure—which are statistically highly probable events over a timeframe of years, with a large number of disks involved—are often both less likely to happen and less likely to cause interruption. This also means that the nodes themselves are less likely to have hardware failures than fat clients.
Diskless nodes share these advantages with thin clients.

Performance of centralized storage

However, this storage efficiency can come at a price. As often happens in computing, increased storage efficiency sometimes comes at the price of decreased performance.
Large numbers of nodes making demands on the same server simultaneously can slow down everyone's experience. However, this can be mitigated by installing large amounts of RAM on the server, by adding more servers, or by adding more disks to a RAID array. In any case this is also a problem which can affect any client-server network to some extent, since, of course, fat clients also use servers to store user data.
Indeed, user data may be much more significant in size and may be accessed far more frequently than operating systems and programs in some environments, so moving to a diskless model will not necessarily cause a noticeable degradation in performance.
Greater network bandwidth will also be used in a diskless model, compared to a fat client model. This does not necessarily mean that a higher capacity network infrastructure will need to be installed—it could simply mean that a higher proportion of the existing network capacity will be used.
Finally, the combination of network data transfer latencies and contention latencies can lead to an unacceptable degradation in performance compared to using local drives, depending on the nature of the application and the capacity of the network infrastructure and the server.

Other advantages

Another example of a situation where a diskless node would be useful is in a possibly hazardous environment where computers are likely to be damaged or destroyed, thus making the need for inexpensive nodes, and minimal hardware a benefit. Again, thin clients can also be used here.
Diskless machines may also consume little power and make little noise, which implies potential environmental benefits and makes them ideal for some computer cluster applications.

Comparison with thin clients

Major corporations tend to instead implement thin clients, since much lower specification hardware can be used for the client. Of course, diskless nodes can also be used as thin clients. Moreover, thin client computers are increasing in power to the point where they are becoming suitable as fully-fledged diskless workstations for some applications.
Both thin client and diskless node architectures employ diskless clients which have advantages over fat clients, but differ with regard to the location of processing.

Advantages of diskless nodes over thin clients