There have been multiple accounts created with the sole purpose of posting advertisement posts or replies containing unsolicited advertising.

Accounts which solely post advertisements, or persistently post them may be terminated.

Major IT outage affecting banks, airlines, media outlets across the world

All our servers and company laptops went down at pretty much the same time. Laptops have been bootlooping to blue screen of death. It’s all very exciting, personally, as someone not responsible for fixing it.

Apparently caused by a bad CrowdStrike update.

Edit: now being told we (who almost all generally work from home) need to come into the office Monday as they can only apply the fix in-person. We’ll see if that changes over the weekend…

jedibob5 ,

Reading into the updates some more… I’m starting to think this might just destroy CloudStrike as a company altogether. Between the mountain of lawsuits almost certainly incoming and the total destruction of any public trust in the company, I don’t see how they survive this. Just absolutely catastrophic on all fronts.

NaibofTabr ,

If all the computers stuck in boot loop can’t be recovered… yeah, that’s a lot of cost for a lot of businesses. Add to that all the immediate impact of missed flights and who knows what happening at the hospitals. Nightmare scenario if you’re responsible for it.

This sort of thing is exactly why you push updates to groups in stages, not to everything all at once.

rxxrc OP ,

Looks like the laptops are able to be recovered with a bit of finagling, so fortunately they haven’t bricked everything.

And yeah staged updates or even just… some testing? Not sure how this one slipped through.

dactylotheca ,
@dactylotheca@suppo.fi avatar

Not sure how this one slipped through.

I’d bet my ass this was caused by terrible practices brought on by suits demanding more “efficient” releases.

“Why do we do so much testing before releases? Have we ever had any problems before? We’re wasting so much time that I might not even be able to buy another yacht this year”

GoofSchmoofer ,
@GoofSchmoofer@lemmy.world avatar

At least nothing like this happens in the airline industry

dactylotheca ,
@dactylotheca@suppo.fi avatar

Certainly not! Or other industries for that matter. It’s a good thing executives everywhere aren’t just concentrating on squeezing the maximum amount of money out of their companies and funneling it to themselves and their buddies on the board.

Sure, let’s “rightsize” the company by firing 20% of our workforce (but not management!) and raise prices 30%, and demand that the remaining employees maintain productivity at the level it used to be before we fucked things up. Oh and no raises for the plebs, we can’t afford it. Maybe a pizza party? One slice per employee though.

Munkisquisher ,

Yeah saw that several steel mills have been bricked by this, that’s months and millions to restart

gazter ,

Got a link? I find it hard to believe that a process like that would stop because of a few windows machines not booting.

TheBat ,
@TheBat@lemmy.world avatar

a few windows machines with controller application installed

That’s the real kicker.

drspod ,

Those machines should be airgapped and no need to run Crowdstrike on them. If the process controller machines of a steel mill are connected to the internet and installing auto updates then there really is no hope for this world.

TheBat ,
@TheBat@lemmy.world avatar

But daddy microshoft says i gotta connect the system to the internet uwu

IsThisAnAI ,

What lawsuits do you think are going to happen?

Nachorella ,

They can have all the clauses they like but pulling something like this off requires a certain amount of gross negligence that they can almost certainly be held liable for.

IsThisAnAI ,

Whatever you say my man. It’s not like they go through very specific SLA conversations and negotiations to cover this or anything like that.

Cryophilia ,

Forget lawsuits, they’re going to be in front of congress for this one

IsThisAnAI ,

For what? At best it would be a hearing on the challenges of national security with industry.

RegalPotoo ,
@RegalPotoo@lemmy.world avatar

Agreed, this will probably kill them over the next few years unless they can really magic up something.

They probably don’t get sued - their contracts will have indemnity clauses against exactly this kind of thing, so unless they seriously misrepresented what their product does, this probably isn’t a contract breach.

If you are running crowdstrike, it’s probably because you have some regulatory obligations and an auditor to appease - you aren’t going to be able to just turn it off overnight, but I’m sure there are going to be some pretty awkward meetings when it comes to contract renewals in the next year, and I can’t imagine them seeing much growth

Skydancer ,

Nah. This has happened with every major corporate antivirus product. Multiple times. And the top IT people advising on purchasing decisions know this.

SupraMario ,

Yep. This is just uninformed people thinking this doesn’t happen. It’s been happening since av was born. It’s not new and this will not kill CS they’re still king.

jedibob5 ,

Don’t most indemnity clauses have exceptions for gross negligence? Pushing out an update this destructive without it getting caught by any quality control checks sure seems grossly negligent.

Wooki ,

Testing is production will do that

TheBat ,
@TheBat@lemmy.world avatar

Not everyone is fortunate enough to have a seperate testing environment, you know? Manglement has to cut cost somewhere.

Blisterexe ,

Manglement is the good term lmao

ThrowawaySobriquet ,

I think you’re on the nose, here. I laughed at the headline, but the more I read the more I see how fucked they are. Airlines. Industrial plants. Fucking governments. This one is big in a way that will likely get used as a case study.

Cryophilia ,

The London Stock Exchange went down. They’re fukd.

rozodru ,
@rozodru@lemmy.ca avatar

It’s just amatuer hour across the board. Were they testing in production? no code review or even a peer review? they roll out for a Friday? It’s like basic level start up company “here’s what not to do” type shit that a junior dev fresh out of university would know. It’s like “explain to the project manager with crayons why you shouldn’t do this” type of shit.

It just boggles my mind that if you’re rolling out an update to production that there was clearly no testing. There was no review of code cause experts are saying it was the result of poorly written code.

Regardless if you’re low level security then apparently you can just boot into safe and rename the crowdstrike folder and that should fix it. higher level not so much cause you’re likely on bitlocker which…yeah don’t get me started no that bullshit.

regardless I called out of work today. no point. it’s friday, generally nothing gets done on fridays (cause we know better) and especially today nothing is going to get done.

Revan343 ,

explain to the project manager with crayons why you shouldn’t do this

Can’t; the project manager ate all the crayons

candybrie ,

Why is it bad to do on a Friday? Based on your last paragraph, I would have thought Friday is probably the best week day to do it.

Lightor ,

Most companies, money included, try to roll out updates during the middle of start of a week. That way if there are issues the full team is available to address them.

Bell ,

Don’t we blame MS at least as much? How does MS let an update like this push through their Windows Update system? How does an application update make the whole OS unable to boot? Blue screens on Windows have been around for decades, why don’t we have a better recovery system?

sandalbucket ,

Crowdstrike runs at ring 0, effectively as part of the kernel. Like a device driver. There are no safeguards at that level. Extreme testing and diligence is required, because these are the consequences for getting it wrong. This is entirely on crowdstrike.

bdonvr ,

The amount of servers running Windows out there is depressing to me

franklin ,
@franklin@lemmy.world avatar

The four multinational corporations I worked at were almost entirely Windows servers with the exception of vendor specific stuff running Linux. Companies REALLY want that support clause in their infrastructure agreement.

Avatar_of_Self ,

I’ve worked as an IT architect at various companies in my career and you can definitely get support contracts for engineering support of RHEL, Ubuntu, SUSE, etc. That isn’t the issue. The issue is that there are a lot of system administrators with “15 years experience in Linux” that have no real experience in Linux. They have experience googling for guides and tutorials while having cobbled together documents of doing various things without understanding what they are really doing.

I can’t tell you how many times I’ve seen an enterprise patch their Linux solutions (if they patched them at all with some ridiculous rubberstamped PO&AM) manually without deploying a repo and updating the repo treating it as you would a WSUS. Hell, I’m pleasantly surprised if I see them joined to a Windows domain (a few times) or an LDAP (once but they didn’t have a trust with the Domain Forest or use sudoer rules…sigh).

Blackmist ,

I’ve had my PC shut down for updates three times now, while using it as a Jellyfin server from another room. And I’ve only been using it for this purpose for six months or so.

I can’t imagine running anything critical on it.

ccdfa ,

Windows server, the OS, runs differently from desktop windows. So if you’re using desktop windows and expecting it to run like a server, well, that’s on you. However, I ran windows server 2016 and then 2019 for quite a few years just doing general homelab stuff and it is really a pain compared to Linux which I switched to on my server about a year ago. Server stuff is just way easier on Linux in my experience.

conciselyverbose ,

It doesn’t have to, though. Linux manages to do both just fine, with relatively minor compromises.

Expecting an OS to handle keeping software running is not a big ask.

sugar_in_your_tea ,

Yup, I use Linux to run a Jellyfin server, as well as a few others things. The only problem is that the CPU I’m using (Ryzen 1st gen) will crash every couple weeks or so (known hardware fault, I never bothered to RMA), but that’s honestly not that bad since I can just walk over and restart it. Before that, it ran happily on an old Phenom II from 2009 for something like 10 years (old PC), and I mostly replaced it because the Ryzen uses a bit less electricity (enough that I used to turn the old PC off at night; this one runs 24/7 as is way more convenient).

So aside from this hardware issue, Linux has been extremely solid. I have a VPS that tunnels traffic into my Jellyfin and other services from outside, and it pretty much never goes down (I guess the host reboots it once a year or something for hardware maintenance). I run updates when I want to (when I remember, which is about monthly), and it only goes down for like 30 sec to reboot after updates are applied.

So yeah, Linux FTW, once it’s set up, it just runs.

0xD ,

Well with your level of expertise you should probably not be running anything, to be honest :)

ji17br ,

Wow dude you’re so cool. I bet that made you feel so superior. Everyone on here thinks you are so badass.

Rinox ,

I dunno, but doesn’t like a quarter of the internet kinda run on Azure?

atocci ,
Rinox ,

I guess Spotify was running on the other 40%, as many other services

pewgar_seemsimandroid ,

so 40% of azure crashes a quarter of the internet…

unconfirmedsourcesDOTgov ,

Where did you think Microsoft was getting all (hyperbole) of their money from?

sasquash ,

never do updates on a Friday.

rozodru ,
@rozodru@lemmy.ca avatar

yeah someone fucked up here. I mean I know you’re joking but I’ve been in tech for like 20+ years at this point and it was always, always, ALWAYS, drilled into me to never do updates on Friday, never roll anything out to production on Friday. Fridays were generally meant for code reviews, refactoring in test, work on personal projects, raid the company fridge for beer, play CS at the office, whatever just don’t push anything live or update anything.

And especially now the work week has slimmed down where no one works on Friday anymore so you 100% don’t roll anything out, hell it’s getting to the point now where you just don’t roll anything out on a Thursday afternoon.

0x0 ,

And especially now the work week has slimmed down where no one works on Friday anymore

Excuse me, what now? I didn’t get that memo.

meanmon13 ,

Yeah it’s great :-) 4 10hr shifts and every weekend is a 3 day weekend

rozodru ,
@rozodru@lemmy.ca avatar

sorry :( yeah I, at most, do 3 days in the office now. Fridays are a day off and Mondays mostly everyone just works from home if at all. downtown Toronto on Mondays and Fridays is pretty much dead.

Blackmist ,

Yep, anything done on Friday can enter the world on a Monday.

I don’t really have any plans most weekends, but I sure as shit don’t plan on spending it fixing Friday’s fuckups.

sugar_in_your_tea ,

And honestly, anything that can be done Monday is probably better done on Tuesday. Why start off your week by screwing stuff up?

We have a team policy to never do externally facing updates on Fridays, and we generally avoid Mondays as well unless it’s urgent. Here’s roughly what each day is for:

  • Monday - urgent patches that were ready on Friday; everyone WFH
  • Tuesday - most releases; work in-office
  • Wed - fixing stuff we broke on Tuesday/planning the next release; work in-office
  • Thu - fixing stuff we broke on Tuesday, closing things out for the week; WFH
  • Fri - documentation, reviews, etc; WFH

If things go sideways, we come in on Thu to straighten it out, but that almost never happens.

spyd3r ,
@spyd3r@sh.itjust.works avatar

Never update unless something is broken.

Toribor ,
@Toribor@corndog.social avatar

This is fine as long as you politely ask everyone on the Internet to slow down and stop exploiting new vulnerabilities.

Ookami38 ,

I think vulnerabilities found count as “something broken” and chap you replied to simply did not think that far ahead hahah

huginn ,

For real - A cyber security company should basically always be pushing out updates.

sugar_in_your_tea ,

Exactly. You don’t know what the vulnerabilities are, but the vendors pushing out updates typically do. So stay on top of updates to limit the attack surface.

Major releases can wait, security updates should be pushed as soon as they can be proven to not break prod.

iknowitwheniseeit ,

BTW, I use Arch.

Nachorella ,

If it was Arch you’d update once every 15 minutes whether anything’s broken or not.

sugar_in_your_tea ,

I use Tumbleweed, so I only get updates once/day, twice if something explodes. I used to use Arch, so my update cycle has lengthened from 1-2x/day to 1-2x/week, which is so much better.

Passerby6497 ,

That’s advice so smart you’re guaranteed to have massive security holes.

kadotux , (edited )

Here’s the fix: (or rather workaround, released by CrowdStrike) 1)Boot to safe mode/recovery 2)Go to C:\Windows\System32\drivers\CrowdStrike 3)Delete the file matching “C-00000291*.sys” 4)Boot the system normally

StV2 ,

It’s disappointing that the fix is so easy to perform and yet it’ll almost certainly keep a lot of infrastructure down for hours because a majority of people seem too scared to try to fix anything on their own machine (or aren’t trusted to so they can’t even if they know how)

r00ty Admin ,
r00ty avatar

It might not even be that. A lot of places have many servers (and even more virtual servers) running crowdstrike. Some places also seem to have it on endpoints too.

That's a lot of machines to manually fix.

HaleHirsute ,

They also gotta get the fix through a trusted channel and not randomly on the internet. (No offense to the person that gave the info, it’s maybe correct but you never know)

kadotux ,

Yeah, and it’s unknown if CS is active after the workaround or not (source: hackernews commentator)

letsgo ,

True, but knowing what the fix might be means you can Google it and see what comes back. It was on StackOverflow for example, but at the time of this comment has been taken offline for moderation - whatever that means.

huginn ,

Yeah and a lot of corpo VPNs are gonna be down from this too.

ColeSloth ,

Meh. Even if it bricked crowdstrike instead of helping, you can just restore the file you deleted. A file in that folder can’t brick a windows system.

NaibofTabr ,

This sort of fix might not be accessible to a lot of employees who don’t have admin access on their company laptops, and if the laptop can’t be accessed remotely by IT then the options are very limited. Trying to walk a lot of nontechnical users through this over the phone won’t go very well.

AccountMaker ,

Yup, that’s me. We booted into safe mode, tried navigating into the CrowdStrike folder and boom: permission denied.

Cryophilia ,

Half our shit can’t even boot into safe mode because it’s encrypted and we don’t have the keys rofl

Munkisquisher ,

And people need to travel to remote machines to do this in person

thehatfox ,
@thehatfox@lemmy.world avatar

Might seem easy to someone with a technical background. But the last thing businesses want to be doing is telling average end users to boot into safe mode and start deleting system files.

If that started happening en masse we would quickly end up with far more problems than we started with. Plenty of users would end up deleting system32 entirely or something else equally damaging.

Ookami38 ,

I do IT for some stores. My team lead briefly suggested having store managers try to do this fix. I HARD vetoed that. That’s only going to do more damage.

nyarla ,

Yes but the recovery menu may have been configured to ask for administrative credentials, to prevent unwanted access to the computer, and then fixing the problem would take way longer.

CaptainBasculin ,

A driver failure, yeesh. It always sucks to deal with it.

cheeseburger ,
@cheeseburger@lemmy.ca avatar

I’m on a bridge still while we wait for Bitlocker recovery keys, so we can actually boot into safemode, but the Bitkocker key server is down as well…

gnutrino ,

Gonna be a nice test of proper backups and disaster recovery protocols for some organisations

huginn ,

Chaos Monkey test

WagnasT ,

Man, it sure would suck if you could still get to safe mode from pressing f8. Can you imagine how terrible that’d be?

a_postmodern_hat ,

You hold down Shift while restarting or booting and you get a recovery menu. I don’t know why they changed this behaviour.

Ookami38 ,

That was the dumbest thing to learn this morning.

resin85 ,

Not that easy when it’s a fleet of servers in multiple remote data centers. Lots of IT folks will be spending their weekend sitting in data center cages.

richtellyard ,

This is going to be a Big Deal for a whole lot of people. I don’t know all the companies and industries that use Crowdstrike but I might guess it will result in airline delays, banking outages, and hospital computer systems failing. Hopefully nobody gets hurt because of it.

RegalPotoo ,
@RegalPotoo@lemmy.world avatar

Big chunk of New Zealands banks apparently run it, cos 3 of the big ones can’t do credit card transactions right now

oderus ,

It was mayhem at PakNSave a bit ago.

emmanuel_car ,

In my experience it’s always mayhem at PakNSave.

deadbeef79000 ,

If anything, it’s probably calmed P’n’S down a bit…

index ,

cos 3 of the big ones can’t do credit card transactions right now

Bitcoin still up and running perhaps people can use that

I_Miss_Daniel ,

Bitcoin Cash maybe. Didn’t they bork Bitcoin (Core) so you have to wait for confirmations in the next block?

whotookkarl ,
@whotookkarl@lemmy.world avatar

Several 911 systems were affected or completely down too

Damage ,

The thought of a local computer being unable to boot because some remote server somewhere is unavailable makes me laugh and sad at the same time.

Munkisquisher ,

A remote server that you pay some serious money to that pushes a garbage driver that prevents yours from booting

lanolinoil ,
@lanolinoil@lemmy.world avatar

yeah so you can’t get Chinese government spyware installed.

Passerby6497 ,

Not only does it (possibly) prevent booting, but it will also bsod it first so you’ll have to see how lucky you get.

Goddamn I hate crowdstrike. Between this and them fucking up and letting malware back into a system, I have nothing nice to say about them.

Cryophilia , (edited )

It’s bsod on boot

And anything encrypted with bitlocker can’t even go into safe mode to fix it

Passerby6497 ,

It doesn’t consistently bsod on boot, about half of affected machines did in our environment, but all of them did experience a bsod while running. A good amount of ours just took the bad update, bsod’d and came back up.

rxxrc OP ,

I don’t think that’s what’s happening here. As far as I know it’s an issue with a driver installed on the computers, not with anything trying to reach out to an external server. If that were the case you’d expect it to fail to boot any time you don’t have an Internet connection.

Windows is bad but it’s not that bad yet.

__init__ ,

It’s just a fun coincidence that the azure outage was around the same time.

NaibofTabr ,

Wow, I didn’t realize CrowdStrike was widespread enough to be a single point of failure for so much infrastructure. Lot of airports and hospitals offline.

The Federal Aviation Administration (FAA) imposed the global ground stop for airlines including United, Delta, American, and Frontier.

Flights grounded in the US.

The System is Down

EncryptKeeper ,

Yeah my plans of going to sleep last night were thoroughly dashed as every single windows server across every datacenter I manage between two countries all cried out at the same time lmao

TheBat ,
@TheBat@lemmy.world avatar

How many coffee cups have you drank in the last 12 hours?

Cryophilia ,

I work in a data center

I lost count

TheBat ,
@TheBat@lemmy.world avatar

What was Dracula doing in your data centre?

KingThrillgore ,
@KingThrillgore@lemmy.ml avatar

Because he’s Dracula. He’s twelve million years old.

THE WORMS

jj4211 ,

I work in a datacenter, but no Windows. I slept so well.

Though a couple years back some ransomware that also impacted Linux ran through, but I got to sleep well because it only bit people with easily guessed root passwords. It bit a lot of other departments at the company though.

This time even the Windows folks were spared, because CrowdStrike wasn’t the solution they infested themselves with (they use other providers, who I fully expect to screw up the same way one day).

szczuroarturo ,

I always wondered who even used windows server given how marginal its marketshare is. Now i know from the news.

rottingleaf ,

Well, I’ve seen some, but they usually don’t have automatic updates and generally do not have access to the Internet.

Mjpasta710 ,

This is a crowdstrike issue specifically related to the falcon sensor. Happens to affect only windows hosts.

Eril ,

My current company does and I hate it so much. Who even got that idea in the first place? Linux always dominated server-side stuff, no?

Pringles ,

Marginal? You must be joking. A vast amount of servers run on Windows Server. Where I work alone we have several hundred and many companies have a similar setup. Statista put the Windows Server OS market share over 70% in 2019. While I find it hard to believe it would be that high, it does clearly indicate it’s most certainly not a marginal percentage.

jj4211 ,

I’m not getting an account on Statista, and I agree that its marketshare isn’t “marginal” in practice, but something is up with those figures, since overwhelmingly internet hosted services are on top of Linux. Internal servers may be a bit different, but “servers” I’d expect to count internet servers…

Delta_V ,

Not too long ago, a lot of Customer Relationship Management (CRM) software ran on MS SQL Server. Businesses made significant investments in software and training, and some of them don’t have the technical, financial, or logistical resources to adapt - momentum keeps them using Windows Server.

For example, small businesses that are physically located in rural areas can’t use cloud based services because rural internet is too slow and unreliable. Its not quite the case that there’s no amount of money you can pay for a good internet connection in rural America, but last time I looked into it, Verizon wanted to charge me $20,000 per mile to run a fiber optic cable from the nearest town to my client’s farm.

Semi_Hemi_Demigod ,
@Semi_Hemi_Demigod@lemmy.world avatar

Did you feel a great disturbance in the force?

rottingleaf ,

How’s it going, Obi-Wan?

jayandp ,
iAvicenna ,
@iAvicenna@lemmy.world avatar
Sylence ,
@Sylence@lemmy.dbzer0.com avatar

Yep, stuck at the airport currently. All flights grounded. All major grocery store chains and banks also impacted. Bad day to be a crowdstrike employee!

iknowitwheniseeit ,

My flight was canceled. Luckily that was a partner airline. My actual airline rebooked me on a direct flight. Leaves 3 hours later and arrives earlier. Lower carbon footprint. So, except that I’m standing in queue so someone can inspect my documents it’s basically a win for me. 😆

AnUnusualRelic ,
@AnUnusualRelic@lemmy.world avatar

An offline server is a secure server!

CanadaPlus ,

Honestly my philosophy these days, when it comes to anything proprietary. They just can’t keep their grubby little fingers off of working software.

At least this time it was an accident.

invisiblegorilla ,

Ironic. They did what they are there to protect against. Fucking up everyone’s shit

Telorand ,

Maybe centralizing everything onto one company’s shoulders wasn’t such a great idea after all…

Excrubulent ,
@Excrubulent@slrpnk.net avatar

Wait, monopolies are bad? This is the first I’ve ever heard of this concept. So much so that I actually coined the term “monopoly” just now to describe it.

joostjakob ,

Someone should invent a game, that while playing demonstrates how much monopolies suck for everyone involved (except the monopolist)

KingJalopy ,

And make it so you lose friends and family over the course of the 4+ hour game. Also make a thimble to fight over, that would be dope.

Telorand ,

I mean, I’m sure those companies that have them don’t think so—when they aren’t the cause of muti-industry collapses.

jaybone ,

Yes, it’s almost as if there should be laws to prevent that sort of thing. Hmm

jaybone ,

The too big to fail philosophy at its finest.

StaySquared ,

CrowdStrike has a new meaning… literally Crowd Strike.

nintendiator ,

Since when has any antivirus ever had the intent of actually protecting against viruses? The entire antivirus market is a scam.

recapitated ,

Clownstrike

lando55 ,

Crowdshite haha gotem

WhatAmLemmy ,

CrowdCollapse

misk ,
@misk@sopuli.xyz avatar

My work PC is affected. Nice!

wreckedcarzz ,
@wreckedcarzz@lemmy.world avatar

Plot twist: you’re head of IT

R00bot ,
@R00bot@lemmy.blahaj.zone avatar

Same! Got to log off early 😎

Munkisquisher ,

Dammit, hit us at 5pm on Friday in NZ

BigRedUndead ,

4:00PM here in Aus. Absolutely perfect for an early Friday knockoff.

Magnolia_ ,

Noice!

ililiililiililiilili ,

My dad needed a CT scan this evening and the local ER’s system for reading the images was down. So they sent him via ambulance to a different hospital 40 miles away. Now I’m reading tonight that CrowdStrike may be to blame.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • [email protected]
  • random
  • lifeLocal
  • goranko
  • All magazines