WIP

Similar todos

Valerian Saliou

PRO

@valerian

experience 6 hours GPU total downtime at Vultr, knocking down all

#mirage AI services, will migrate to Scaleway cause Vultr are 🤡

2024-08-08 06:23:45 UTC

See similar todos

No replies yet

Valerian Saliou

PRO

@valerian

spend 1h debugging some k8s complexity-induced issue after a node failure at cloud provider for

#crisp mirage gpus

2024-02-15 01:18:28 UTC

See similar todos

No replies yet

Valerian Saliou

PRO

@valerian

fix broken nvidia a100 gpu server at vultr which has put

#crisp mirage down for whole night due to being out-of-stock and no replacement physical node could be allocated

2023-11-15 08:02:56 UTC

See similar todos

No replies yet

Scott Robinson

@ScottWRobinson

Fixing downed servers for the 3rd time in a week and tweet about it x.com/ScottWRobinson/status/1…

#blocksender

2024-03-22 16:12:15 UTC

See similar todos

Valerian Saliou

PRO

@valerian

try to upgrade

#crisp Mirage AI Kuberbetes cluster version, miserably fail at it cause of broken GPU image NVIDIA drivers from the cloud provider which stalled the upgrade process, destroy cluster and rebuild all infrastructure from scratch all evening 🥲

2024-02-10 13:42:42 UTC

See similar todos

Lis D

PRO

@lis

Contact cloudways for a WTF on a small site... apparently hit another glitch. Hitting a lot of bugs this week - the amount of time support admin is taking me lately is out of control#admin

2024-03-02 11:13:13 UTC

See similar todos

No replies yet

Valerian Saliou

PRO

@valerian

wake up from mayhem with my NVIDIA A16s, 70% of them were down all night and only fixed this morning by Vultr. A40 for large LLM still up fortunately

#crisp

2024-04-04 07:13:49 UTC

See similar todos

No replies yet

Bjarn Bronsveld

@bjarn

get cloudflare incident acknowledged that plagued me for 4 days

#spectate

2023-08-28 19:23:33 UTC

See similar todos

No replies yet

Johnny

@johnnymakes

appear to have fixed a production latency by tweaking resources + procfile... but it has been a day of tinkering 💀

#wfhland

2020-08-04 22:13:12 UTC

See similar todos

No replies yet

Matthias

@Matthias

finally fix bug that caused trouble for 1-2 days :)

#indie

2023-07-20 11:02:53 UTC

See similar todos

No replies yet

Rik Schennink

PRO

@rikschennink

#pintura fix issue with background flickering on certain GPUs

2019-08-27 10:58:12 UTC

See similar todos

No replies yet

Bjarn Bronsveld

@bjarn

Experience hetzner network issues again, still down even after they acknowledged it on phone. 6 hours and counting.

2023-11-15 22:05:20 UTC

See similar todos

No replies yet

Cat Stickler

PRO

@cat

Client invoice issues ~for 13 hours~ 🫠

2023-08-07 19:51:09 UTC

See similar todos

No replies yet

levelsio

PRO

@levelsio

debug with the new GPU API for hours

#avatarai

2022-11-10 20:15:07 UTC

See similar todos

No replies yet

Joao Aguiam

@joao

Spent big part of the day with a production issue that couldn't be hot-fixed quickly while an event with 350+ people was happening. Javascript could be more permissive sometimes 🤦‍♂️