Post by cadey@pony.social

@cadey@pony.social

#1 public

replies: #2 #5 #9 #14 #16 #17

SRE graph scrying test:

I'm backing up a bunch of files to S3, there's moments where the throughput goes down and moments where it goes up.

Assuming no network activity in a way that matters, why would the throughput number be going up and down like this?

reply (6)

7mo

Daniel L @glitchf@octodon.social

#2 public

in reply to #1 - replies: #3

@cadey The numbers going down mean lots of little small files. Going up means big long transfers.

7mo

@cadey@pony.social

#3 public

in reply to #2 - replies: #4

@glitchf Correct!

reply (1)

7mo

Cal Paterson @calpaterson@fosstodon.org

#4 public

in reply to #3

@cadey @glitchf some of the better s3 tooling (eg rclone) have ways to mitigate this via more concurrency

7mo

luciano @luciano@parens.social

#5 public

in reply to #1 - replies: #6

@cadey disk go brr vs disk go 🥶?

reply (1)

7mo

@cadey@pony.social

#6 public

in reply to #5 - replies: #7

@luciano What makes you think that?

reply (1)

7mo

Mikle_Bond @Mikle_Bond@pony.social

#7 unlisted

in reply to #6 - replies: #8

@cadey @luciano
on a side note, it might be close being correct. A drive can show this access pattern at some point of us life, usually before death.

Here's your unscheduled reminder to check if smartd is configured properly)

reply (1)

7mo

@cadey@pony.social

#8 unlisted

in reply to #7

@Mikle_Bond @luciano I mean I'm also doing a ZFS resilvering because a drive died lol

7mo

Richard Schneeman @Schneems@ruby.social

#9 public

in reply to #1 - replies: #10

@cadey is there some kind of rate limiting or throttling going on?

Kinda reminds me of https://www.schneems.com/2020/07/08/a-fast-car-needs-good-brakes-how-we-added-client-rate-throttling-to-the-platform-api-gem/

reply (1)

7mo

@cadey@pony.social

#10 public

in reply to #9 - replies: #11

@Schneems Nope! The remote service (which is like S3 but not S3) is able to accept data at my NAS' line rate, and my drives are able to supply data also at line rate.

reply (1)

7mo

Richard Schneeman @Schneems@ruby.social

#11 public

in reply to #10 - replies: #12

@cadey super weird.

reply (1)

7mo

@cadey@pony.social

#12 public

in reply to #11 - replies: #13

Spoilers

@Schneems Biblical amounts of small files, there's more overhead in creating lots of little files than there are creating very few big files

reply (1)

7mo

Mikle_Bond @Mikle_Bond@pony.social

#13 unlisted

in reply to #12

Spoilers

@cadey @Schneems
The difference in throughput becomes even more drastic with SMB on windows and even more educational.

7mo

fogs

@andrew@montagne.uk

#14 public

in reply to #1 - replies: #15

@cadey lots of small files vs small number of big files?

reply (1)

7mo

@cadey@pony.social

#15 public

in reply to #14

@andrew Correct!

7mo

@lillian

#16 public

in reply to #1

@cadey a LLM is generating `aws s3 cp` commands for each file and different filenames (presumably that parse into more tokens) take longer for the model to generate commands for

7mo

AMS @AMS@infosec.exchange

#17 unlisted

in reply to #1

@cadey Copying kernel source or some similar pile of tiny files?

7mo

Thread refresh