We Ain't Afraid of No File Fragmentation: Causes and Prevention of Its Performance Impact on Modern Flash SSDs

Authors: 

Yuhun Jun, Sungkyunkwan University and Samsung Electronics Co., Ltd.; Shinhyun Park, Sungkyunkwan University; Jeong-Uk Kang, Samsung Electronics Co., Ltd.; Sang-Hoon Kim, Ajou University; Euiseong Seo, Sungkyunkwan University

Awarded Best Paper!

Abstract: 

A few studies reported that fragmentation still adversely affects the performance of flash solid-state disks (SSDs) particularly through request splitting. This research investigates the fragmentation-induced performance degradation across three levels: kernel I/O path, host-storage interface, and flash memory accesses in SSDs. Our analysis reveals that, contrary to assertions in existing literature, the primary cause of the degraded performance is not due to request splitting but stems from a significant increase in die-level collisions. In SSDs, when other writes come between writes of neighboring file blocks, the file blocks are not placed on consecutive dies, resulting in random die allocation. This randomness escalates the chances of die-level collisions, causing deteriorated read performance later. We also reveal that this may happen when a file is overwritten. To counteract this, we propose an NVMe command extension combined with a page-to-die allocation algorithm designed to ensure that contiguous blocks always land on successive dies, even in the face of file fragmentation or overwrites. Evaluations with commercial SSDs and an SSD emulator indicate that our approach effectively curtails the read performance drop arising from both fragmentation and overwrites, all without the need for defragmentation. Representatively, when a 162 MB SQLite database was fragmented into 10,011 pieces, our approach limited the performance drop to 3.5%, while the conventional system experienced a 40% decline.

Open Access Media

USENIX is committed to Open Access to the research presented at our events. Papers and proceedings are freely available to everyone once the event begins. Any video, audio, and/or slides that are posted after the event are also free and open to everyone. Support USENIX and our commitment to Open Access.

BibTeX
@inproceedings {294797,
author = {Yuhun Jun and Shinhyun Park and Jeong-Uk Kang and Sang-Hoon Kim and Euiseong Seo},
title = {We Ain{\textquoteright}t Afraid of No File Fragmentation: Causes and Prevention of Its Performance Impact on Modern Flash {SSDs}},
booktitle = {22nd USENIX Conference on File and Storage Technologies (FAST 24)},
year = {2024},
isbn = {978-1-939133-38-0},
address = {Santa Clara, CA},
pages = {193--208},
url = {https://www.usenix.org/conference/fast24/presentation/jun},
publisher = {USENIX Association},
month = feb
}