I am amazed that programming languages (well, the typical ones, at least) don’t make it easier to manipulate files.

A common way files are read in C is to create a struct that matches the file format and to call fread to read the file into it. Isn’t that easy enough?

Not really. This approach is fine in isolation, but it’s non-portable:

  • Different architectures or compilers may lay out structs differently. Your compiler sometimes can choose to add padding bytes to guarantee alignment requirements. Luckily compilers aren’t allowed to do it willy-nilly, and some compilers offer #pragmas to control this.
  • Different architectures have different integer sizes. Appropriate typedefs often can mitigate this, but it’s still imperfect since it requires a small porting effort.
  • Different architectures use different endianness. If a file format is defined to store integers in big-endian byte order but your architecture is little-endian, then if you read the bytes out of the struct without first swapping the bytes you’ll end up with the wrong value.

The typical way to solve these problems is to read a file a byte at a time, copying each byte into the appropriate location within the struct. This is tedious.

Programming languages should provide a mechanism for programmers to declare a struct that must conform to some external format requirement. Programmers should be able to attribute the struct, prohibiting implicit padding bytes and specifying what the size and endian requirements are for each field. For example:

file_struct myFileFormat
{
    uint8 version;
    uint8[3]; // Reserved.
    uint32BE numElements;
    uint32BE dataOffset;
};

When retrieving fields from such a struct, the compiler should generate code that automatically performs the necessary byte swaps and internal type promotions.

One-dimensional Cube

January 4, 2005 at 3:21 am (PT) in Rants/Raves, Reviews

On the other side of the movie spectrum, I watched the rather disappointing Cube last night.

I try not to write about the nitpicks I have with movies, especially with non-mainstream movies like Cube, because who really wants to read someone complaining? But I just can’t help myself.

The bad math soured a lot of the experience. For example (spoilers ahead):

(more…)

Brad Bird is not a gun

January 4, 2005 at 3:12 am (PT) in Rants/Raves

Well, I finally watched The Incredibles. Even though I found it to be totally predictable, it managed to be really fun along the way. Who cares if it’s full of clichés when they’re done that well?

I think it’s probably the best movie I’ve seen all year (okay, that’s not saying much… best movie I’ve seen in the past year, then), and possibly it might turn out to be one of my all-time favorites.

Brad Bird is my new hero. (He made one of my other all-time favorite movies, The Iron Giant.) He’s not a gun—he’s Superman!

Software piracy

September 17, 2004 at 11:49 pm (PT) in Rants/Raves

Apparently some software developers think it’s a good anti-piracy measure to sabotage a user’s computer. There’s one scheme that makes pirated games buggier, and there are some programs that delete your data if pirated.

That’s just retarded.

  • You can’t lose something you never had. Anti-piracy advocates always love to claim that the software industry loses billions of dollars each year to piracy. Did developers actually lose money? Did nasty pirates break into their bank accounts and withdraw all their cash? The only thing that companies can claim to lose are potential sales. Do they really believe that every pirate would have purchased their software if it were uncrackable?

    (Or maybe what the advocates really mean is that the software industry wastes oodles of money each year developing new and ultimately futile anti-piracy schemes. That I’d believe.)

  • Who says there’s no such thing as bad publicity? If you want your software to act buggy if it thinks it’s pirated, go ahead, but don’t expect word-of-mouth reviews to be positive. If anything, the presence of bugs, intentional or not, will make people less likely to give you money. Will people be able to distinguish real bugs from the deliberate copy-prevention bugs? Will people think your game is fun if its physics are amiss and their bullets keep missing?

    And if you destroy people’s data, forget it. You’ve just guaranteed that you never, ever will see a dime from them or from anyone within complaining distance. Congratulations, you’ve alienated potential customers.

  • Free advertising is better than bad publicity. To some degree, piracy can help software companies. Piracy can help build name recognition. It can lead to de facto standards. Would Microsoft Windows and Microsoft Office be as widespread if users didn’t copy them from their workplaces? Piracy levels the cost of Windows down to $0, and at that price, it’s hard for Windows not to outcompete Linux. Microsoft doesn’t gain money directly from this, but it does gain market- and mind-share. As other examples, look at Adobe Photoshop, 3DS Max, and Maya.

    Even if you could make your software uncrackable, are your competitors’ products uncrackable too? Do you really want to drive your users—paying or not—to them?

    Software developers should be happy that people are using their products at all.

In the end, I think the best defense is a strong offense. The best anti-piracy measure is to make a good product that’s worth buying. People take pride in doing things that are worthwhile. (And for goodness’ sake, make sure your product is easy to buy!)

Boy, do I hate Outlook.

August 28, 2004 at 3:20 pm (PT) in Rants/Raves, Usability

When I left Sony, one of the little things that pleased me was that I no longer had to deal with Microsoft Outlook and a finicky Microsoft Exchange server.

Well, it turns out VMware also uses Outlook and Exchange, and I am reminded how much I hate them.

  • Inflexible rules. The rules don’t allow boolean expressions, and what constructs they do allow are crippled. Want to match email that contains both “foo” and “bar” in the subject line? Forget it.

  • Rules aren’t filters. Outlook’s rules don’t act as filters. In a sane system, each rule is applied in order to each received email message; if a message can be matched to multiple rules, the first rule gets precedence. Not so in Outlook. Outlook displays an error message informing the user of the rule conflict, and then silently disables the conflicting rule. A single message triggering a rule conflict warrants turning off the rule completely? How helpful.

    I’ve been told that I’m supposed to add a “Stop processing more rules” command to each of my Outlook rules to avoid this stupidity, but apparently Outlook 2000 has a bug where “Stop processing more rules” really means “stop processing rules for email, period.”

  • User-friendly search gone awry. Microsoft tried to make Outlook’s Find function user-friendly by making it task-centered, but they made it too task-centered. They focused only on some common tasks and made the less common (but not rare!) tasks hard. The Find dialog box offers the ability to search for email from specific time periods, such as this week, last week, the past 7 days, this month, last month. Some of the “user-friendly” options require a bit of cognitive thought (does “this week” start on Monday or Sunday? does “last month” mean the past 30 days or since the first day of the previous month?), and if you want to search any other time periods (the past two weeks, the past two months, the month of March, from the last week of January through the first week of February, …), forget it, you have to use Outlook’s bewildering Advanced Find dialog. To specify a date range, you must use the freeform text field that provides no clues how the input should look. (Outlook actually is surprisingly intelligent about parsing the input into this field, but it’s still intimidating and error-prone. How can you really tell that Outlook parsed the input correctly?)

  • Network and fault intolerant. Outlook is so poorly designed that if the Exchange server goes down (which seems to happen fairly frequently), its entire UI is unusable. You can’t even access mail stored on your local disk. Outlook eventually will figure out that the Exchange server isn’t responding—if you’re willing to wait 10 minutes. Even better, when the Exchange server comes online again, as far as I can tell, you can’t tell Outlook to reconnect; you must quit Outlook and restart it.

  • No source for you! Outlook inexplicably makes it really hard to view the message headers to Internet email.

These are all issues I’ve had with Outlook 2000. In Microsoft’s defense, maybe they’ve fixed some or all of them in later versions. On the other hand, it’s sad that they were issues in the first place, because Microsoft’s free, light-weight (and independently-developed) email client—Outlook Express—has had friendlier search and filtering capabilities for years before!

Verizon Witless (Part 2)

May 28, 2004 at 10:50 pm (PT) in Personal, Rants/Raves

Two weeks of arguing with Verizon over their early termination fee hasn’t gotten me anywhere. They refuse to waive it.

“It follows the terms of your contract, so it’s a valid charge,” they say.

“I’m not saying it’s not valid; I’m saying it’s not right. So you’d rather have $175 from me now than have me ever do business with you again? You are aware I’ve paid Verizon over $800 during the past two years and used barely any of my minutes, right?”

“It’s a valid charge.”

It seems that more and more people these days are drones who can’t think for themselves and who are unwilling to stick their necks out.

Verizon did indicate, however, that I could get the fee waived if I reactivated my old account for the remaining duration of my contract with no additional commitment. The catch? To reactivate my old account, they’d need to use my original cell phone number, which I already had ported to Sprint. Of course, it’d be too easy if they just let me pay them directly for my remaining two weeks, as I had offered originally. No, their highly advanced computer system required an active account with the original number.

I relented and decided to play their stupid game. Not wanting to get an early termination fee on my Sprint account too, I figured I could change my Sprint number, free up my original, and then reactivate my Verizon account.

After I changed my Sprint number, Verizon informed me that although my original number was now available, it was Sprint’s property and that Verizon couldn’t touch it. To transfer it back to Verizon, they said, I’d need to have a Sprint account using it. Unyielding computer systems win again over common sense.

To add insult to injury, Sprint won’t even let me have my original number back. (Another victory for the computers.) So now I have Verizon’s $175 charge, and I’ve lost my original, four-year old number for nothing. (Not that anyone calls me anyway.) Had I not ported my number in the first place, I probably could have reactivated my Verizon account and had that fee waived. Who knew number portability would bite me in the ass like this?

And did I mention that the Sprint phone was a gift?

I give up. I suppose I’ll pay the stupid fee.

Verizon Witless

May 16, 2004 at 1:34 pm (PT) in Personal, Rants/Raves

A few weeks ago, I got a new cell phone and switched from Verizon Wireless to Sprint.

I didn’t remember when I purchased my old phone or what the terms of my service agreement were, so I went to Verizon’s website to look up my account information first.

Current calling plan:
America’s Choice 300 General .00 Long Distance $35.00 1y 0502

I interpreted this information to mean that I had a one-year commitment starting in May 2002. I assumed I was free and clear, so I switched carriers.

I then got slapped with a $175 early termination fee.

I called Verizon. It turns out that I had a two-year commitment and that the “1y” is an internal code not meant to be decipherable by users. To add insult to injury, I missed my end date by only two weeks.

There are a couple of things wrong here:

  • Verizon’s website provides no clear information about what the contract duration is.
  • Verizon’s website provides external access to internal information. If something is for internal use only, keep it internal. Otherwise people will assume that information is supposed to be relevant to them and is meant to be decipherable.

Had Verizon not generously provided useless but misleading information, I would have called them to find out my service agreement details. At best, Verizon is totally clueless and is too lazy to make their website usable. At worst, Verizon’s committing fraud. “1y” is readable enough to be misinterpreted but vague enough for Verizon to avoid accountability and to claim that they didn’t lie outright.

Of course, it’s not in Verizon’s best interests to let people know when their agreements expire, so what do I really expect?

One of the main reasons I switched away from Verizon at all was because I was throwing money away; each month I paid $35 for less than 10 minutes of actual usage. Sigh.

Programming responsibility

April 12, 2004 at 8:08 pm (PT) in Programming, Rants/Raves

I’ve recently spent some time looking at the source code to several open-source projects.

I am appalled.

Here’s a sampling of what I’ve encountered:

  • Buffer overflows. I cannot understand how C programmers continue to use gets, strcpy, and strcat when fgets, strncpy, and strncat are available. This isn’t that hard. (The C standard library shares a lot of blame; it never should have provided the unchecked versions, and it at least should have given gets/fgets and strcpy/strncpy consistent behaviors. strncpy and strncat aren’t as easy to use as they ought to be, either.)
  • Unchecked errors. Writing error-checking code is hard and a lot of work, so apparently some people decide to forgo it altogether. Woe upon the user.
  • Illegal language usage. Apparently a lot of people think it’s perfectly legal in C or C++ to name identifiers with a leading underscore. Often it’s not. Although in practice a namespace clash is unlikely, identifiers with leading underscores can intrude into the compiler’s namespace.

    I’ve also argued (unsuccessfully) with some people who recklessly invoke undefined behavior.

  • Unmaintainable code. I suppose that I naively hoped that How to Write Unmaintainable Code was a work of fiction, but yes, Virginia, there are magic numbers. It’s odd that so much open-source code has poor documentation, sparse comments, and cryptic function and variable names. What’s the point of publishing your code if it’s unreadable?
  • Inconsistent styles. More readability and maintenance problems stem from projects that lack strong core leadership.

What’s the big deal? The obvious problems are software vulnerabilities and broken software. I think that there might be a deeper problem, however.

Code begets code. Many people learn how to program by looking through other people’s code, and publishing bad code makes it too easy to propagate bad habits and to produce lousy de facto standards. Worse, people might copy-and-paste bad code outright. Yes, it can work the other way too; good code can be a paradigm for others to follow. Unfortunately, the sheer quantity of bad code available makes good code a needle in a haystack. Saturating the planet with immature software projects is not a step forward.

As potential pedagogues, open-source developers ought to be held to a higher standard. Unfortunately, reality offers a contrary opinion. A recent study indicates that email is more likely to be ignored when there are more recipients; are similar factors at play in the programming world? Does the nature of open-source code—where there’s a lack of accountability and where anyone can look at and change the source code—encourage its developers to shirk their responsibilities in the hopes that someone else will clean up their mess? Or are programmers in general just too lazy, too unskilled, too ignorant, or too apathetic? (I’m admittedly not a very experienced programmer; I lack design experience in particular. If nothing else, though, I’m meticulous (so Jeff Wong says) and consistent, and I make a conscious effort to be responsible with what I write!)

(Before anyone gets the wrong idea, I am not an opponent of open-source software. I’m picking on open-source code mostly because closed-source code isn’t available to be criticized.)

Overloaded buttons

March 25, 2004 at 4:21 pm (PT) in Rants/Raves, Usability

A lot of audio players like to combine Play and Pause buttons. If a song is playing, pressing the Play button again pauses the track; if a song is paused, pressing the Play button resumes playing. It makes sense and reduces the number of buttons, right?

Winamp does not take this approach, and I commend its developers for it.

On the surface, combining these two buttons seems like a great idea. However, let’s suppose you want to replay the current track. Maybe you’re not paying attention and suddenly realize that the song you’re listening to is a favorite that you haven’t heard in a while, or maybe you’re just in the mood to listen to a particular song over and over.

In Winamp’s separate button design, if a song is playing, pressing the Play button again re-plays the current track. In the combined button design, pressing the Previous Track button typically returns to the track’s beginning. Seems like a sound design so far.

Now let’s suppose you want to skip to the previous track. In the separate button design, the Previous Track button always selects the previous track. In the combined button design, however, the Previous Track button only sometimes selects the previous track, since it’s also used to restart the current track. Typically the button skips to the previous track only if it’s already at the beginning to another track—the “beginning” defined as some arbitrary number of seconds.

Thus, overloading the Play button with pausing functions means we need to overload the Previous Track button too. Furthermore, overloading the Previous Track button requires introducing either time-dependent behavior or button modality. That’s a significant increase in interface complexity to save a single button. (The Previous Track and Next Track buttons typically have another behavior too; they seek within tracks when the buttons are held down.)

Increasing interface complexity increases error rates. My car stereo uses the combined button approach, and it makes it difficult for me to navigate backwards through CD tracks when I want to find a particular track. I need to listen for a few seconds to determine which track is which, and depending on how long I wait to listen, I sometimes need to press the Previous Track button once and sometimes twice. This is even more problematic when I’m driving and not looking at the track number.

Makers of software-based audio players have nothing to lose by providing separate buttons; virtual buttons have no monetary cost. Makers of car stereos should provide separate buttons because it’s a little safer.

(Although I prefer separate buttons, if space is at a premium, a respectable compromise might be to combine the Pause and Stop buttons instead. If pressed briefly, pause; if held down, stop. If the Stop button has been done away with too, then perhaps holding down the Play button can replay the current track.)

Internet Explorer’s history is useless.

March 23, 2004 at 12:13 am (PT) in Rants/Raves, Usability

The history pane in Internet Explorer 5 and 6 tries to be intelligent and user-friendly, but in practice it seems almost completely useless.

Consider this scenario: You surf the web and follow a lot of random links. The next day, you try to find a specific page again.

I do not believe this is an uncommon task. How can you accomplish this in IE? IE’s history has a “View » By Date” option, where it separates history items by day or by week. Unfortunately, within each day, items are sorted alphabetically by site. This is counter-productive; if users remembered the address of the site, why would they need to look for it in the history pane?

(IE also has a “View » By Order Visited Today” option, where it lists each page you’ve visited during the past day in chronological order. Unfortunately, when Microsoft says “today”, they really mean today, not the past 24 hours. IE is not friendly to late-night web surfers; once the clock on your PC rolls past midnight, say goodbye to history items from 11:59 PM; you now have to remember the site address if you want to find them again. Oh, and if it just turned Monday, your history items from Sunday are now lost amidst all other items from “Last Week”.)

The only alternatives are to try to retrace the original steps or to try to find it with a search engine, both tedious chores.

Why should a seemingly simple task be so incredibly hard?

Netscape/Mozilla have done it right for ages. They allow you to view all history items in a single, flat list. Superficially, it might not appear to be as organized as IE’s hierarchical history pane, but it’s infinitely more useful, and it’s simpler too.

Unfortunately, years of using IE have weaned me away from using browsers’ valuable history features. Ugh.