Also note what a consumer SSDs can be made even with a single flash chip. A more performant ones are made of bunch of chips internally (essentially a RAID0 with some magic) so they can do a parallel operations if the data resides on the different flash blocks. Still, if your thread is only doing one operation a time with blocks < flash rewrite block size you will hit the write amplification anyway.
I think if you do the same test but without a memory mapped file (ie let the OS and disk subsystem do their thing) you will get much more speed.
This [0] comment is totally on point.
Also note what a consumer SSDs can be made even with a single flash chip. A more performant ones are made of bunch of chips internally (essentially a RAID0 with some magic) so they can do a parallel operations if the data resides on the different flash blocks. Still, if your thread is only doing one operation a time with blocks < flash rewrite block size you will hit the write amplification anyway.
I think if you do the same test but without a memory mapped file (ie let the OS and disk subsystem do their thing) you will get much more speed.
[0] https://news.ycombinator.com/item?id=33751973