Saturday, November 26, 2011

load01 11/26/2011

  • You can't parse [X]HTML with regex. Because HTML can't be parsed by regex. Regex is not a tool that can be used to correctly parse HTML. As I have answered in HTML-and-regex questions here so many times before, the use of regex will not allow you to consume HTML. Regular expressions are a tool that is insufficiently sophisticated to understand the constructs employed by HTML. HTML is not a regular language and hence cannot be parsed by regular expressions. Regex queries are not equipped to break down HTML into its meaningful parts. so many times but it is not getting to me. Even enhanced irregular regular expressions as used by Perl are not up to the task of parsing HTML. You will never make me crack. HTML is a language of sufficient complexity that it cannot be parsed by regular expressions. Even Jon Skeet cannot parse HTML using regular expressions. Every time you attempt to parse HTML with regular expressions, the unholy child weeps the blood of virgins, and Russian hackers pwn your webapp. Parsing HTML with regex summons tainted souls into the realm of the living. HTML and regex go together like love, marriage, and ritual infanticide. The

    cannot hold it is too late. The force of regex and HTML together in the same conceptual space will destroy your mind like so much watery putty. If you parse HTML with regex you are giving in to Them and their blasphemous ways which doom us all to inhuman toil for the One whose Name cannot be expressed in the Basic Multilingual Plane, he comes. HTML-plus-regexp will liquify the n​erves of the sentient whilst you observe, your psyche withering in the onslaught of horror. Rege̿̔̉x-based HTML parsers are

    tags: programming humor

  • We sill start off by walking though the different parts of a GIF file. (The information the page is primarily drawn from the W3C GIF89a specification.) A GIF file is made up of a bunch of different "blocks" of data. The following diagram shows all of the different types of blocks and where they belong in the file. The file starts at the left and works it's way right. At each branch you may go one way or the other. The large "middle" section can be repeated as many times as needed. (Technically, it may also be omitted completely but i can't imagine what good a GIF file with no image data would be.)

    tags: programming

  • On one sunny day I inserted some pretty innocent std::map into a struct, added a couple of one-liners, made sure it does what’s supposed to, committed and pushed. In some minutes I hear a scream. The damn thing crashes on a Linux. Why? It’s dead simple, it surely worked for me, there just cannot be anything wrong with it, it must work. Right?

    It just didn’t. After a quick look it was pretty clear – it’s a GCC bug. Perhaps the version is too recent and it has to be crawling with bugs. I started trying other versions but the problem wouldn’t go away. Well, it’s pretty clear now – it’s a GCC family bug! That would be a glorious discovery.

    We sit down together and start to minimize the example, trying to reproduce it with fewer lines of code. We are trying this and that, replacing types with simpler ones, making all kinds of changes, renaming classes, moving them to different files, rearranging the lines, etc., etc. Other people come and join us, we are three, four, five and people keep coming. Everyone’s interested, everyone’s got a theory.

    tags: programming

  • The emergency room at White Memorial Medical Center on Los Angeles' Eastside was buzzing when paramedics arrived on a Friday night with an elderly man slurring his words and complaining of aching bones.

    The nurse in the receiving bay immediately ran through standard triage questions: "Are you diabetic? Do you have high blood pressure? Are you allergic to any medications?" Each drew the same response: "I don't know."

    The hospital and doctors had no record of the man or his medical history. And with their only guide a piece of crumpled paper they found tucked into the man's pants that seemed to indicate he might have had cancer, doctors had to order a full diagnostic work-up, including blood tests and an EKG to check his heart.

    tags: news

Posted from Diigo. The rest of my favorite links are here.

No comments:

Post a Comment