Oldest - Comments - selfhosted - Sprites kbin instance

This magazine is from a federated server and may be incomplete. Browse more on the original instance.

TCB13, 1 year ago in Is this Seagate Exos drive too good to be true?

It depends. They’re simply the most annoying drives out there because Seagate on their wisdom decided to remove half of the SMART data from reports and they won’t let you change the power settings like other drives. Those drives will never spin down, they’ll even report to the system they’re spun down while in fact they’ll be still running at a lower speed. They also make a LOT of noise.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

hperrin, 1 year ago

Aren’t they meant to go in data centers? You wouldn’t want a drive in a data center to spin down. That introduces latency in getting the data off of them.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

TCB13, 1 year ago

That should be a choice of the OS / controller card not of the drive itself. Also what datacenter wants to run drives that don’t report half of the SMART data just because they felt like it?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

lemmyvore, 1 year ago

Data centers replace drives when they fail and that’s about it. They don’t care much about SMART data.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

fruitycoder, 1 year ago

We used to use smart data to predict when to order new drives and on really bad looking days increase our redundancy. Nothing like getting a bad series of drives for PB of data to make you paranoid I guess.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

lemmyvore, 1 year ago

What kind of attributes did you find relevant? I imagine the 19x codes…

I’ve read the Blackblaze statistics and I’m using a tool (Scrutiny) that takes those stats into account for computing failure probability, but at the end of the day the most reliable tell is when a drive gets kicked out of an array (and/or can’t pass the long smart test anymore).

Meanwhile, I have drives with “lesser” attributes sitting on warning values (like command timeout) and ofc I monitor them and have good drives on standby, but they still seem to chug along fine for now.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ScreaminOctopus, 1 year ago

I got a set off ebay, Jesus christ they’re loud. I ended up returning them cause I could hear the grinding through my whole house

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Lem453, 1 year ago

I have 3 14tb exos drives. I have them in a Roswell 4u hotseap chassis. Running unraid.

It’s nearly inaudible over the very reasonable case fans. No grinding noises. I can hear the heads moving a bit but it’s quite subtle. Not sure why people have such different experiences with these

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

TCB13, 1 year ago

I’m questioning your auditory acuity :P

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

czardestructo, 1 year ago

I noticed when they first spin up on boot they do some sub routine and they’re pretty loud and chatty. First time I heard it I was spooked but it worked fine and I just use it for backup so I just moved on. Once it’s on and in normal operation it’s like any other disk I’ve used over the decades. Nothing as loud as an old scsci disk or a quantum fireball.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

TCB13, 1 year ago

Ahaha that’s about what they do.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

czardestructo, 1 year ago (edited 1 year ago)

I have an Exos x16 and x18 drive and they both spin down fine in Debian using hdparm. I use them for cold storage and they’re perfectly adequate.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

TCB13, 1 year ago

Care you share your hdparm config then?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

czardestructo, 1 year ago

It’s really boring, Debian 12: /dev/disk/by-uuid/8f041da5-6f7a-4ff5-befa-2d3cc61a382c { spindown_time = 241 write_cache = off }

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

TCB13, 1 year ago

Tried that and doesn’t seem to work. :(

Relevant documentation for others about -S / spindown_time:

Values from 1 to 240 specify multiples of 5 seconds, yielding timeouts from 5 seconds to 20 minutes. Values from 241 to 251 specify from 1 to 11 units of 30 minutes, yielding timeouts from 30 minutes to 5.5 hours. A value of 252 signifies a timeout of 21 minutes.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Dark_Arc, 1 year ago in Suggestions for Short Rack Mount Case

www.amazon.com/gp/aw/d/B09227RQV2?psc=1&ref=p…

This is my favorite rack mount chassis I’ve worked with … and it coincidentally is in that ballpark.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Lettuceeatlettuce, 1 year ago

Interesting design, I’ll look at it, thanks!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

towerful, 1 year ago (edited 1 year ago) in How would you build a GPU-heavy node?

If you are doing high bandwidth GPU work, then PCIe lanes of consumer CPUs are going to be the bottleneck, as they generally only support 16 lanes.
Then there are the threadrippers, xeons and all the server/professional class CPUs that will do 40+ lanes of PCIe.

A lane of PCIe3.0 is about 1GBps (Byte not bit).
So, if you know your workload and bandwidth requirements, then you can work from that.
If you don’t need full 16 lanes per GPU, then a motherboard that supports bifurcation will allow you to run 4 GPUs with 4 lanes each from a CPU that has 16 lanes if PCIe. That’s 4GBps per GPU, or 32Gbps.
If it’s just for transcoding, and you are running into limitations of consumer GPUs (which I think are limited to 3 simultaneous streams), you could get a pro/server GPU like the Nvidia quadros, which have a certain amount of resources but are unlimited in the number of streams it can process (so, it might be able to do 300 FPS of 1080p. If your content is 1080p 30fps, that’s 10 streams). From that, you can work out bandwidth requirements, and see if you need more than 4 lanes per GPU.

I’m not sure what’s required for AI. I feel like it is similar to crypto mining, massive compute but relatively small amounts of data.

Ultimately, if you think your workload can consume more than 4 lanes per GPU, then you have to think about where that data is coming from. If it’s coming from disk, then you are going to need raid0 NVMe storage which will take up additional PCIe lanes.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ielisa, 1 year ago

Nvidia transcode limit is 5 for consumer GPUs these days, and its very easy to lift that limit if you need with github.com/keylase/nvidia-patch

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

towerful, 1 year ago

5? Holy heck, that’s amazing. I remember helping people that had built streaming rigs to use during the pandemic, and wondering why their production was stuttering and having issues with a bunch remote callers. Some of that work ended up being CPU bound.
Although, looks like that patch is for Linux? Not much use if your running vmix or some other windows-only software.
In OPs case, however, that’s not a problem

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

ielisa, 1 year ago

I think you can get it to work with windows somehow , but I’ve never needed to try: github.com/keylase/nvidia-patch/issues/520

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

grue, 1 year ago

I’m not sure what’s required for AI. I feel like it is similar to crypto mining, massive compute but relatively small amounts of data.

If you’re talking about training models, I think it requires both massive compute and massive amounts of data.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

shnizmuffin, 1 year ago in Hardware question

Probably not!

What models of GPU and Motherboard are you using?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

AimlessNameless, 1 year ago

I’ve got an Nvidia Tesla P40 and haven’t purchased a motherboard yet. It’s currently sitting and doing nothing in my DL380.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

bigredgiraffe, 1 year ago

Do you want to not use your DL380? IF no it might make a good moonlight host!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

AimlessNameless, 1 year ago

My DL380 draws about 200W idle so I’m trying to downscale

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

subtext, 1 year ago in Hardware question

Without specific experience, my assumption would be no. Much like when plugging into a desktop computer’s motherboard HDMI port instead of the GPU HDMI port.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

OpticalMoose, 1 year ago in Hardware question

I just did a quick bing chat search (“does DRI_PRIME work on systems without a cpu with integrated graphics?”) and it says it will work. I can’t check for you because my CPUs all have graphics.

I CAN tell you that some motherboards will support it (my ASUS does) and some don’t (my MSI).

BTW, I’m talking about Linux. If you’re using Windows, there’s a whole series of hoops you have to jump through. LTT did a video a while back.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

tal, 1 year ago

While it might work in the OS, setting the OS up may be a pain (the installer may or may not work like that) and I strongly suspect that the BIOS can’t handle it.

I suspect that an easier route would be to use a cheap, maybe older, low-end graphics card for the video output and then using DRI_PRIME with that.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

OpticalMoose, 1 year ago (edited 1 year ago)

It’s probably a pain to set up in Windows. In Linux, it just works, there’s nothing to set up. I’m using it right now.

OP really should have mentioned their OS.

Edit: Actually, nevermind both my posts. I know DRI_PRIME works by using my APU for regular desktop activity, and routing discrete GPU output in whenever a game is being played. But I don’t know if it’s possible to make it use the dGPU all the time.

Even if it did, it would only work inside the OS, so if you had to boot into the BIOS for anything, you wouldn’t have a display. So for all intents and purposes, it wouldn’t really work.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

fuckwit_mcbumcrumble, 1 year ago in Hardware question

No. The video card is only wired to send video out through it’s ports (which don’t exist) and the ports on the motherboard are wired to go to the nonexistent iGPU on the CPU.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Appoxo, 1 year ago

Depends. You can send the signal in Windows through another port.
But if it works without an iGPU…

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

fuckwit_mcbumcrumble, 1 year ago

In windows you’re not sending the signal directly through another port. You’re sending the dGPU’s signal through the iGPU to get to the port.

On a laptop with nvidia optimus or AMD’s equivalent you can see the increased iGPU usage even though the dGPU is doing the heavy lifting. it’s about 30% usage on my 11th gen i9’s iGPU routing the 3080s video out to my 4k display.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Appoxo, 1 year ago

In that case nevermind.
Carry on.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

martini1992, 1 year ago in Question - ZFS and rsync

The drives in the zpool, are they SMR drives? Slow write speed and disks dropping out are a symptom of I remember correctly

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

isles, 1 year ago

They’re Seagate Exos, www.seagate.com/products/cmr-smr-list/ and appear to be CMR

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

martini1992, 1 year ago

So next I’d be checking logs for sata errors, pcie errors and zfs kernel module errors. Anything that could shed light on what’s happening. If the system is locking up could it be some other part of the server with a hardware error, bad ram, out of memory, bad or full boot disk, etc.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

bdonvr, 1 year ago

I don’t think they make SMR drives that big

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Valmond, 1 year ago (edited 1 year ago) in No posts when surfing through my i stance

On the support community on lemmy.world I get:

Socket timeout has expired [the url link, socket_timeout=10000]

Maybe I should just reboot oe something but I’d rather understand an eventual underlying problem…

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

key, 1 year ago in No posts when surfing through my i stance

19 has federation bugs. Mainly outgoing but I’ve also seen incoming federation gradually fail. Restart the docker container routinely (cron job) until fixes come out.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Valmond, 1 year ago (edited 1 year ago)

Ouch, thank you 🥲!

How often do you restart it/whats it doing/any idea what’s no longer working or why?

Good luck to the developers!

And thank you obviously!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Cyber, 1 year ago in Question - ZFS and rsync

I don’t have practical experience with ZFS, but my understanding is that it uses RAM a lot… if that’s new, it might be worth checking the RAM by booting up memtest (for example) and just ruling that out.

Maybe also worth watching the system with nmon or htop (running in another tmux / screen pane) at the beginning of the next session, then when you think it’s jammed up, see what looks different…

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

isles, 1 year ago (edited 1 year ago)

Awesome, thanks for giving some clues. It’s a new build, but I didn’t focus hugely on RAM, I think it’s only 32GB. I’ll try this out.

Edit: I did some reading about L2ARC, so pending some of these tests, I’m planning to get up to 64gb ram and then extend with an l2arc SSD, assuming no other hardware errors.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

sonstwas, 1 year ago (edited 1 year ago)

Based on this thread it’s the deduplication that requires a lot of RAM.

See also: wiki.freebsd.org/ZFSTuningGuide

Edit: from my understand the pool shouldn’t become inaccessible tho and only get slow. So there might be another issue.

Edit2: here’s a guide to check whether your system is limited by zfs’ memory consumption: github.com/openzfs/zfs/issues/10251

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Cyber, 1 year ago

Just another thought… Maybe just format the drives as a massive EXT4 JBOD (just for a temp test) and copy the data again - just to see if ZFS is the problem… maybe it’s something else altogether? Maybe - and I hope not - the USB source drive is failing after long reads?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

isles, 1 year ago

I believe there’s another issue. ZFS has been using nearly all RAM (which is fine, I only need RAM for system and ZFS anyway, there’s nothing else running on this box), but I was pretty convinced while I was looking that I don’t have dedup turned on. Thanks for your suggestions and links!

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

s38b35M5, 1 year ago in Question - ZFS and rsync

If you’re running TrueNAS, the replication feature was the smoothest and easiest way to move large amounts of data when I did it 18 months back. Once the destination location was accessible from the sending host, it was as simple as kicking off a snapshot, resulting in a fully usable replica on the receiving host. IIRC, IXsystems staff told me rsync can be problematic compared to the replication/snapshot system, as permissions and other metadata can be lost.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

BlueEther, 1 year ago in No posts when surfing through my i stance

Most of yhe subscribed communities seem to be working on your instance. When did you subscribe to permacomputing@lemmy.sdf.org? was it after the upgrade to 0.19.x?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Valmond, 1 year ago

Good question! I think it was after.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

loganb, 1 year ago in Question - ZFS and rsync

Just to make sure. Are you copying to your ZFS pool directory or a dataset? Check to male sure your paths are correct.

Push vs pull shouldn’t matter but I’ve always done push.

If your zpool is not accessible anymore after a transfer then there is a low-level problem here as it shouldn’t just disappear.

I would installe tmux on your ZFS system and have a window with htop running, dmesg, and zpool status running to check your system while you copy files. Something that severe should become self evedent pretty quickly.

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

grue, 1 year ago in Hardware question

This might be an X/Y problem. Why do you think you need HDMI output on a server?

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

AimlessNameless, 1 year ago

Because installing an OS without iLo, serial or video output would be a bit of a hassle

reply

report

activity

copy /kbin url

copy original url

open original url

Loading...

Federation

Status:

On | Off

Instances:

/m/selfhosted@lemmy.world

Threads (222)

Microblog (0)

All Content

People

Magazines

Collections

Magazine

selfhosted

@selfhosted@lemmy.world

A place to share alternatives to popular online services that can be self-hosted without giving up privacy or locking you into a service you don’t control.

Rules:

Be civil: we’re here to support and learn from one another. Insults won’t be tolerated. Flame wars are frowned upon.
No spam posting.
Don’t duplicate the full text of your blog or github here. Just post the link for folks to click.
Submission headline should match the article title (don’t cherry-pick information from the title to fit your agenda).
No trolling.

Resources:

> Any issues on the community? Report it using the report flag.

> Questions? DM the mods!

Created: 1 year ago
Owner: Sprite_tm
Subscribers: 1
Online: -

Moderators

Sprite_tm