What are the long-term consequences of the many massive password leakages which have occurred. The upshot? Hackers are getting MUCH better at cracking passwords, and "clever" techniques can no longer be regarded as safe.
There has never been an article that received so much tweeting to my attention as a recent article in Ars Technica. Their cybersecurity guy, Dan Goodin, did a very nice, very comprehensive, four-page piece about sort of a snapshot on where we stand with password hacking.
Now, he didn't really draw any conclusions. We will,
Well, it turns out having access to more than 100 million actual in-use passwords, which is what are now available, freely downloadable over the Internet, having those actual passwords has changed the complexion of password-cracking. All of those things that we sort of "wink, wink" about doing, like changing alphabetic characters into the numbers that they resemble, or those sorts of things - and we'll talk about what those are because they've all been analyzed now. What's happened is, and this is another thing you would expect, over time there's evolution of the technology. The cracking is really getting better.
If anybody still thinks that they're being cute with the way they're designing passwords, I hope to be able to increase their security further by putting them off of those habits because they're just not working any longer.
I started, after reading article, more religiously using the generator built into LastPass, and I set it for 12 characters and special characters mixed and everything. Although I'm a little disappointed. I opened an account at a new bank the other day, and I was actually quite disappointed. First of all, I could only use, I can't remember what it was, 12 or 13. After that it stopped. I couldn't use more characters, which I know means that they're not hashing passwords, or it wouldn't matter. And second, that they wouldn't allow me to use special characters.
So Dan's article, or his security blog posting, was "Passwords Under Assault." Anyone who wants to read the entire four-page piece can just Google "Passwords Under Assault," and it's the first link that comes up. And he titled it, "Why passwords have never been weaker and crackers have never been stronger." Which sort of reminds us of the famous Bruce Schneier quote, where he noted years ago that attacks never get weaker, they only get better. And Dan said, "Thanks to real-world data, the keys to your digital kingdom are under assault."
So essentially what's happened is there have been consequences, there's evolutionary effects that we would expect, that is, passwords are very tasty fruit for hackers to try to grab. And, unfortunately, websites have proven themselves surprisingly inept at managing user logon credentials. We're routinely, actually, covering the major breaches in passwords. It was just a couple months ago, in June, that LinkedIn famously lost control of 6.5 million passwords. What's happened is, as a consequence of those and other breaches - there was another major gaming site that lost, I think it was 32 million of their user passwords all at once. And so what's happened is it's moved the hackers' understanding of what passwords people are using from theoretical, like the planets of the Klingon universe, to the actual. And we've learned weird things, like "monkey" is used unusually often.
For some bizarre reason, lots of people chose the word "monkey." Well, nobody would guess that. So it's only by looking, doing statistical analysis of actual password databases, that these sorts of things come out. Another thing that is often occurring is that people capitalize words, instead of them being all uppercase or all lowercase. They tend to - first character is capital, then the rest of them are lowercase. Many times people create passwords which are word followed by four numbers, like their date of birth, for example, or 1492, something that is memorable to them, but they think, oh, this is clever.
So the problem with patterns, like the idea of eight characters where the first one is uppercase and the other ones are lowercase and then, for example, a four-digit number, if you made it five digits, that is, if you broke the pattern, then you get security. If you don't, what analysis of databases have shown hackers is that, in the same way that for some bizarre reason the password "monkey" gets chosen way more often than randomly, people are using eight-character alphabetic words followed by four-character numbers, I mean, exactly that pattern. And so what happens is, if that's known, or even just believed, that is, if it's tried for, then it completely changes the math.
For example, say that you didn't know what a 12-character password was, and that it could use the full alphabet and special characters and numbers. Well, any one character, as we've talked about many times, could have approximately 96 different possibilities. So 12 of those would be 96^12, since it's 96 for the first character, 96 for the second character, 96 for the third. But we also know that that really only applies if the 12 characters are really random. They could be anything. And 96 raised to the power of 12 is 612.7 times 10^21. Huge number. That's 612,700 billion billion possibilities for 12 characters.
But people don't choose their 12 characters randomly. And what statistical analysis of these captured online databases have shown hackers is that, as I was saying, for example, there's a huge preponderance of first letter is capitalized, the next seven are lowercase alpha, and then they're followed by four digits that is, like, a year. It's something generally in the 20th Century. So what that does is that dramatically changes the math. Now that means you only have 26^8 power since you have only - you know you're going to have capital A through capital Z, then lowercase A through Z for the next seven characters. Then say that you didn't even constrain it to a modern-era year, but you just did 0000 to 9999, so now you're at 26^8 times 10,000. Well, that's only 2.08 million possibilities, compared to 612,700 billion billion possibilities.
So the point is that, what hackers have done is, by analyzing the actual databases of captured passwords, they have found all of these tendencies. It is absolutely no longer the case that we can do anything clever. We cannot use, like, "Prince$$," where we change the S's into dollar signs. They got that. You can't use...
You can't turn your E's into 3's. They got that, too. I mean, all of the kinds of things that people typically do, thinking that they're being clever, trying to sort of - essentially we're trying to compromise. We're trying to come up with something that's sort of ours and that we think nobody else is going to do. Well, surprisingly, because we're all human, and we have similar experience, we're generally doing the same things, it turns out. When you statistically look at 100 million passwords, there aren't that many possible things that people can do that meet these criteria. And of course there's certainly some communication among people. Not everyone is coming up with these things on their own. They're talking to their friends about, oh, what do you do, how do you make passwords? And so they share some of their ideas.
Oh, the site was RockYou.com which, in 2009, through a SQL injection attack, lost their 32 million plaintext passwords, which all went into this huge 100 million-plus hopper for statistical analysis.
So the other thing that has happened is, and this is the evolutionary part, not only are hackers really focusing on this, but as we know, there's been huge movement in technology over time. We've talked about how GPUs, the graphic processing units that are now powering our graphics cards in order to give us the 3D realism and high frame rate performance that we want for gaming, those can be repurposed to create essentially cryptographic pipelines which are able to run cryptographic algorithms at very high speed.