0 - 2005

Yay, ddrescue saves the day

It turns out GNU ddrescue did just the trick, and I was able to get 98% of the data off that dead hard drive. Now to set up a new Windows system for these poor bastards (why, oh why, can’t everyone run *nix?). I need a cup of coffee first.

Sadness and remorse for the worst acts of human history

Wow, I worked myself up at this late hour thinking about issues related to the morality of warfare (or lack thereof, as it were), and in particular to Hiroshima and Nagasaki. A particularly naive /. poster (is the adjective “naive” redundant here?) pointed out how we can often forget that “civilians can be enemy combatants,” and he mentions a Mitsubishi plant in Nagasaki, as if that were most casulaties occured in Nagasaki (nonsense of course, since over 100,000 deaths occured in that unfortunate city). He then compares America to a police station and Japan to a “man who runs at the station with a bat,” and concludes that it is therefore “all the man with the bat’s fault.” If that reasoning weren’t pathetic enough, he provides another justification for dropping the bombs: that Japan would have done the same, but to New York! Ah, the things I could teach the average /. writer about argumentation. I really hope these aren’t the same folks I meet in the workplace of my future. Continue reading Sadness and remorse for the worst acts of human history

Gangs of America: The History of Corporate Power

I am totally engrossed in this book at the moment. My Dad gave it to me to read, and I flew through about 100 pages today while allowing the aforementioned backup processes to run.

Among other such gems you discover in this book are these facts:

The Boston Tea Party wasn’t so much about taxation without representation or hatred for the British crown as it was about anticorporatism. Colonialists were worried about the East Indies Trade Company moving into the colonies and taking their business. Colonists used to see “globalization” for what it was even back then, calling the East Indies Trade Company a vile institution which “enslaves one half of the human race to enrich the other half.”
The founding fathers were thoroughly against the idea of the corporation, and thought that large monied enterprises were the greatest threat to democracy, as they could subvert the political system if they were not placed in check.
Even Abraham Lincoln and Thomas Jefferson saw these threats, and they were themselves supported by Adam Smith, the economist whose theories are nowadays oft-used in justifying corporate existence.
During the days of robber barons, one man essentially created the modern corporation by lobbying the government for the right to intercompany ownership, namely one corporation owning stock in another. Through this law, he established “holding companies,” whose only purpose was to hold stock in other companies. And via holding companies, he was able to take over other corporations and place his corporations outside of any regulation by the state governments. Furthermore, this same man whose foresight gave him great wealth, also provides a nice historical example of corporate greed that is unchecked by government power: he managed to buy up newspapers to fire editors who didn’t print what he liked, and he managed to buy politicians by offering them posts on the board of his major corporate entities.
Corporations were not always this way. Corporations do not have to be separate legal entities, completely unaccountable to any of its investors, able to integrate across industries by gobbling up other corporations, able to subvert democracy through political contributions, and able to ruin people’s lives through “externalities.” Once upon a time, American society and American government knew corporations were dangerous, and knew they needed to be carefully monitored and controlled. What happened?

I hope this book answers that last question.

GNU ddrescue and dd_rescue and dd_rhelp, what the?

Wow. I hate when shit like this happens.

Apparently there are three tools out there to help with the same thing. First, there’s dd_rescue, the tool I was using earlier (which ships with Ubuntu in a debian package called… ddrescue). Then, there’s dd_rhelp, a shell script which is a frontend to ddrescue and which implements a rough algorithm to minimize the amount of time waiting on bad block reads.

Then, there’s GNU ddrescue, which is a C++ implementation of dd_rescue plus dd_rhelp.

I only just realized this and so now I’ve compiled a version of GNU ddrescue to pick up my recovery effort. It’ll probably help with one of the partitions that seems particularly messed up.

So far the nice thing about GNU ddrescue is that it seems faster, and more responsive. Plus, it has a real logging feature, such that if you enable it and then CTRL+C the app, you can restart it and it’ll automatically pick up where it left off.

UPDATE: wow, good thing I switched. GNU ddrescue is significantly faster just in terms of raw I/O performance. I jumped from 4GB of this partition being rescued (which took 30 minutes with dd_rescue) to 6GB in the last ten minutes. It seems at least 3x faster. I also like that the GNU info page describes the algorithmic approach in-depth.

Fried hard disk ruins weekend

So, one of my employers ended up with a fried hard disk, for the second time in a row. The main reason is that the PC this HD is contained in sits in a corner with little-to-no airflow.

In order to recover the drive, I am actually taking a different approach from my last recovery effort, mainly by necessity. This disk is seriously damaged–lots of bad sectors, and its partitions are not readable by any NTFS driver, be it Microsoft’s or the open source one. This makes simply using the wonderful R-Studio tool I used last time currently impossible, due to the fact that it won’t even see the drive properly within Windows, and will hang all over the place.

Indeed, what I needed to do is drop down a layer of abstraction: away from filesystems, and into blocks and sectors. Unfortunately, in the Windows world this drop down is difficult, so I had to use my Linux laptop to make this jump.

I found a wonderful tool to help me out called dd_rescue, which is basically a dd with the added features of continuing on error, allowing one to specify a starting position in the in/out files, and the ability to run a copy in reverse. These features allow one to really work around bad sectors and even damaged disk hardware to get as much data as possible out.

Unfortunately, the use of this tool was encumbered by my laptop’s relatively simple bus design. Apparently, if I stuck two devices on my USB bus (like two HDs I was using for this process), the bus would slow to a crawl, and the copy would move along at an unbearble 100kB/sec. I tried utilizing firewire and USB together, but got only marginal improvements. What befuddles me is that in the end, the fastest combination I could come up with is reading from the Firewire enclosure with my laptop and writing to the firewire enclosure of my desktop across the LAN utilizing Samba. Very strange indeed. Now my performance is more like 6MB/sec, factoring in all the breaks dd_rescue takes when it encounters errors. I have 6GB of the more critical partition written, but it’ll probably take a couple hours to have a big enough chunk that I can test R-Studio’s recovery of it.

The only reason I’m even writing about this is because I find it hilarious how many layers of abstraction I am breaking through to do a relatively low-level operation. Think about it:

My broken IDE drive is converted to Firewire by a Firewire-IDE bridge.
My Firewire PCMCIA adapter is allowing my notebook to take in that connection.
The Linux kernel is allowing firewire to be accessed via various ieee1394 ohci drivers.
The Linux kernel is abstracting the firewire disk as a SCSI disk, using emulation.
The SCSI disk is being read by dd_rescue and written to a file, which exists in the path /mnt/smb/image/sdb5.
That path seems local, but is actually a mount point. That mount point seems physical but is actually handled by a Samba driver.
The writes by dd_rescue to that image file are being sent through the kernel’s TCP/IP stack, and flying through my switch, and being accepted by Windows XP’s network stack.
Windows XP is writing that data to an NTFS drive, which is itself connected by a Firewire-IDE bridge (and therefore all the above steps’ equivalents for Windows apply).

I am surprised with that many layers, that this copy is even working. I really should have just taken a machine apart and connected these drives directly by IDE, to save myself a few layers.

Cindy Sheehan smeared by O’Reilly

I really would expect nothing less of my unfortunate neighbor, Bill O’Reilly. Apparently on last night’s show he smeared Cindy Sheehan, the grieving mother that’s been glavanizing the Left as of late, on his wonderful show, the O’Liar Factor. Apparently we still live under McCarthy, where it’s not who you are, but with whom you associate, that determines whether you are a “radical,” or “commie bastard.”

How do people still watch his show?

On the security of an e-mail address

I was just looking at my strange contact page, where I list my e-mail address using a sort of obfuscated string with _ and * characters mixed in. And then I saw someone’s e-mail address listed on the web with the following format:

user () domain ! com.

At that point, I started to think about all the other variations of this spam-protection trend I’ve seen, like user ///at\\\ domain ///dot\\\ com, and I realized that many of us are taking the wrong approach. Myself included. For example, the one above could easily be found by knowing the common TLDs and working backwards from there. If I find a “com”, “org” or “net,” and then look at the string tokens which occur before, I can assume any string of valid characters (say, alphanumeric characters) which is followed by whitespace or invalid chars (like parentheses and exclamation points) can be taken as a valid part of the address. From there, we can easily split user () domain ! com into its proper parts, and construct the e-mail. This same approach works for say, user ///at\\\ domain ///dot\\\ com.

So what I realized is perhaps it would be better to insert other e-mail addresses in there that might get picked up as part of an e-mail address, even in a heuristic scan. For example,

user __at__ domain :: NOT [email protected] :: __dot__ com

That seems more secure to me 😉 Another approach is just to prevent the TLD from being a complete token. This is the approach I took. Turn com into c_o__m or something, and you’re less likely to get picked up in a scan that is searching for “com”.

Microsoft’s anti-competitive behavior

This /. article has responses from Microsoft Linux Lab manager Bill Hilf. I responded to this post from a Microsoft employee. My response follows. Continue reading Microsoft’s anti-competitive behavior

N-way parallel mail retrieval with getmail and bash

I wrote a pretty sweet script tonight. It parallelizes the getmail retrieval process, while still printing prefixes so I know which accounts download which messages. This means that instead of my mail fetching process taking sum(i₁,…,i_n), where i is the length of time for a given mail retrieval, my fetching process now takes max(i₁,…,i_n).


#!/bin/sh
GETMAIL='python2.3 -Wignore /usr/bin/getmail'
unwanted() {
  grep -E -v '(Copyright|getmail|Simple)';
}

echo "N-WAY GETMAIL RETRIEVER SCRIPT:"
$GETMAIL \
  --rcfile=/etc/getmail/account1 \
  2>&1 | sed -e \
  's/.*/account1................: &/g' \
  | unwanted &

$GETMAIL \
  --rcfile=/etc/getmail/account2 \
  2>&1 | sed -e \
  's/.*/account2................: &/g' \
  | unwanted &

...

$GETMAIL \
  --rcfile=/etc/getmail/accountN \
  2>&1 | sed -e \
  's/.*/accountN................: &/g' \
  | unwanted &

wait

Server “secured”

I stopped by my web host, Peer1, in order to check out my server and see if I could come up with an explanation to yesterday’s downtime. Nothing looked fishy, but it seems likely it was the stupid power cable again. So, to completely eliminate that variable, I hooked two metal ties into the grill of the nearby fans and wrapped them around the power cable’s plug. Now when I yank on the cable, instead of it coming out, it pulls the whole fucking server across the racking slide. That’s right, sysadmin soup du jour: metal ties as power cable securer.