What are you looking for ?
Advertise with us
RAIDON

Fujitsu Develops Optimized Software-Controlled SSD for Big Data Processing

Application to in-memory databases, enabling "three-fold" increase in processing performance

Fujitsu Laboratories Ltd.  has developed a SSD in which flash memory can be directly controlled by software running on a server.

By optimizing data positioning for access from an in-memory database, it achieved a processing performance three times faster than that of ordinary SSDs.

With in-memory databases, which enable high-speed analysis by loading data onto servers on a type of memory called DRAM (Dynamic Random Access Memory), when the volume of data exceeds the capacity of memory, lags in access to storage and other factors reduce processing speeds. As a result, there has been a desire for technology that could expand memory using high-speed SSDs.

Fujitsu Laboratories has developed a SSD that enables R/W commands for each flash memory chip directly from software. It also developed a ‘read-ahead’ feature that, in accordance with the access pattern, enables parallel data retrievals from multiple flash memory chips without the instructions interfering with each other. By loading data into DRAM before using the data from the in-memory database, data usage and data loading are processed simultaneously, enabling high-speed big data processing, with no access lags, even with limited DRAM capacity.

Background
In recent years, the amount of data processed by computers, including a variety of data from sensors, is increasing. Accordingly, there is a rising need for high-speed processing technology for big data analytics, which generates new value from these massive volumes of data.

With in-memory databases, which achieve high-speed analysis by storing all data in DRAM on servers, large volumes of data that exceed DRAM capacity simply cannot be processed. One way of resolving this issue is to expand memory capacity by using high-capacity SSDs, and progress is being made on technologies for separately using DRAM and memory expansion.

Memory expansion for in-memory database
Figure1

FUJITSU_SSD_FIG1

The performance of memory expanded with SSDs depends on the performance of the SSD itself, and because this impacts analytical performance, there has been a desire for faster SSDs.

Issues
SSDs are equipped with multiple flash memory chips, but from the perspective of software running on a server, the inside of an SSD is a black box, and R/W operations cannot be directly performed on the flash memory inside. As a result, there are times when R/W instructions are competing for the same flash memory chip which is a factor that degrades performance.

About developed technology
Fujitsu Laboratories has developed an SSD that enables R/W commands for each flash memory chip directly from software running on a server. It has also developed software that distributes read instructions from an in-memory database among multiple flash memory chips to operate in parallel. Features of this technology are described below.

Software-controlled SSD that enables R/W commands
directly to multiple flash memory chips from software running on server
In a prototype, an interface called PCIe was used, which enables high-speed data transmissions and is installed on typical motherboards. To fully utilize the wide bandwidth of PCIe, Fujitsu Laboratories developed a software-controlled SSD, equipped with 16 control channels and 256 flash memory chips, in which the interface enables R/W commands directly to each flash memory chip from server software (figure 2). As a result, transmission speeds of roughly 5.5 gigabytes per second were achieved.

Developed software-controlled SSD
Figure 2

FUJITSU_SSD_FIG2

 

Technology for parallel ‘read-ahead’ from multiple flash memory chips
prior to using data from  in-memory database
Fujitsu Laboratories also developed software that analyzes the memory access patterns of an in-memory database and, instead of retrieving the data from flash memory each time, retrieves the data in parallel from multiple flash memory chips in advance so that the data are read instantaneously, without competing for access.

Accelerating speed of in-memory database
Figure 3

FUJITSU_SSD_FIG3

Results
With the developed software-controlled SSD and technology for data ‘read-ahead,’ company’s labs was able to improve processing performance by roughly three times compared to a conventional SSD, even with limited DRAM capacity. This technology therefore enables high-speed analysis of the increasing volume of big data.

Future plans
Fujitsu Laboratories plans to increase the parallelization of data retrievals and consider applications beyond in-memory databases with the aim of bringing this technology into practical use in fiscal 2017.

Articles_bottom
ExaGrid
AIC
ATTOtarget="_blank"
OPEN-E