Dennis Ritchie Day

Tim O’Reilly declared today, October 30, as Dennis Ritchie Day. That was five years ago but nonetheless we should all pause and consider how much Ritchie affects our life and work. He was a giant and we should give thanks that he spent some time with us.

Posted in General | Tagged | Leave a comment

Zamansky Discusses Miscellaneous Packages

Mike Zamansky has a new video up. This time it’s about a few of the miscellaneous packages he uses. He covers

  • Highlight line mode
  • Beacon mode
  • Hungry delete mode
  • Expand region

I use two of those and the others are compelling enough that I may try them out.

The video is just short of 11 and 3/4 minutes so it’s easy to watch during a coffee break. As always with Zamansky’s videos, you’re sure to learn something so it’s definitely worth your time.

Posted in General | Tagged | Leave a comment

How to Fix Swiper Search Failure

I woke up yesterday and started working only to become sad. As soon as I tried to do a swiper search I got the error

Wrong type argument: window-live-p

I’ve come to depend on swiper and hated to be without it. It’s all Zamansky’s fault; his Searching a Swiper video got me hooked and now I can’t live without it. I compensated by using isearch-backward, which is still bound to Ctrl+r, but I was bummed to be without the swiper functionality.

I thought that abo-abo would have it fixed pretty quickly but when a day went by without resolution I did a little investigation. It turns out that it’s easy to fix this. I deleted and reinstalled both swiper and counsel and everything started working again. It may be that one or the other of them is sufficient but the whole operation for the two of them at once was so fast I’d just reinstalled them both.

Posted in General | Tagged | 2 Comments

Consumers Win One

Yesterday, in my post about AT&T, Time Warner, and the Iron Law of Data Collection, I complained that in all whining about what an unfair advantage AT&T/Time Warner would have in advertising, little notice was taken of the fact that the data in question doesn’t belong to AT&T, it belongs to their users. In a rare victory for consumers (at least here in the U.S.) the FCC has agreed and passed a regulation to the effect that ISPs and carriers can not sell or use consumers’ viewing information without their permission.

Naturally, the media and communication companies are upset and we will doubtlessly see law suits and lobbying to try to get the ruling overturned. It doesn’t go into effect for a year so consumers may never see the benefits of the ruling. That would be a shame. The carriers are all in on the common carrier provisions that let them regard the information they’re carrying as an opaque bit stream for whose content they can’t be held responsible. But ask them to please not spy on that bit stream and suddenly it’s valuable proprietary information.

But I shouldn’t be peevish. At least for now consumers can expect to enjoy a little bit of privacy and the ownership of their viewing and communication habits.

UPDATE [2016-10-29 Sat 13:55]: The FCC has a fact sheet about the ruling.

Posted in General | Tagged | Leave a comment

The Iron Law Strikes Again

It’s been a while since I’ve written about the Iron Law of Data Collection. That’s the idea that when the Government collects data, regardless of the stated rationale, there will inevitably be mission creep that sees the data being used in new, unintended ways and that it always leads to abuse. It turns out that the law applies to organizations other than the government.

In the same post linked above, I noted that If the data is there, anyone with an interest in exploiting it will demand access. Those interests are not limited to law enforcement and public safety concerns. Media companies plan to ask U.S. regulators to force AT&T to share the huge store of customer use data that they’ve been collecting since 1987 through their Hemisphere program. Although the data is often shared with law enforcement, without warrant but for a fee, the purpose of the Hemisphere program is to monetize that data.

With AT&T’s proposed acquisition of Time Warner, media companies are whining that the data will give AT&T/Time Warner an unfair advantage in targeted advertisements and are demanding that regulators force AT&T/Time Warner to share that data with them. Notice how the notion that that data doesn’t really belong to ATT&T in the first place—it’s the viewing/texting/calling history of its customers—doesn’t come into play here. The data is there, it’s exploitable, and all the players are demanding access.

In a just world, it would be against the law to collect (or at least to store indefinitely) that information but then law enforcement would have to deal with getting those pesky warrants when it wanted to check on whom we’ve been communicating with. Worse, the other carriers, who do periodically purge their records, will see their customer data as exploitable and also begin storing it long term.

It’s the iron law. Once that data is collected it will be used in ways that we find objectionable. The only solution, as I say in every iron law post, is to prevent that collection in the first place. The carriers should collect that data only for billing purposes and purge it as soon as the bill is paid. Of course, that’s not going to happen because exploitable.

Posted in General | Tagged | Leave a comment

A Literate Programming Example with Org Mode

Arne Babenhauserheide has posted a nice example of literate programming with Org mode. It’s a bare-bones example but shows what you can do including stitching code blocks together.

Using Org mode for literate programming probably isn’t as powerful as something like Knuth’s CWEB but it does have the advantage of working with just about any language and of running the code directly from the (untangled) source file to insert results directly into the document.

Even if, like many people, you don’t care for the literate programming paradigm, the idea of combining code and narrative is still very powerful. The approach used by (for example) John Kitchen to embed the code used for calculations and figure generation is a huge aid in reusing results, calculations, and figures and is a significant step towards reproducible research.

In any event, take a look at Babenhauserheide’s example. It may give you some useful ideas and it certainly serves as another demonstration of the power of Org mode and Babel.

Posted in General | Tagged , | 1 Comment

Google Reneges on Their Privacy Pledge

If you were surprised by this

you haven’t been paying attention.

Google is in the business of collecting information about you and selling it to advertisers. That imperative informs just about everything they do. You can think of “Don’t be evil” as a sunny Norman Rockwell painting of a world that probably never existed and certainly doesn’t now.

I long ago switched to DuckDuckGo for my web searching and you’d be wise to do likewise. Likewise, I use Gmail only as a forwarder for technical mailing lists so Google probably knows I’m a geek. I try to limit their knowledge to that.

Posted in General | Tagged | Leave a comment

Reproducible Research Webinar

Back in June I wrote about A Complete Example of Reproducible Research with Org. One of the authors of that paper was Arnaud Legrand who has an excellent webinar on reproducible research. Sadly, the video of the talk is in Flash, which makes it unwatchable for many of us. Happily they also have an MPEG4 version but it took a long time to download. Nonetheless, it was worth the wait. It’s a really great talk on why you should be doing reproducible research and a demonstration of some of the tools you can use to help you organize and produce it.

The first half of the talk considers the problems that ordinary research papers can have and how reproducible research can help solve them. Legrand also talks about some problems with implementing reproducible research. In the second half of the talk, Legrand demonstrates a few of the tools that a researcher can use to help with producing reproducible research.

One of these, reprozip, helps you capture the environment that you used to run your software. He also looks at Rstudio and Ipython and shows how they can help produce papers while embedding code and results directly into the final result. His favorite tool, though, is Emacs/Org Mode, which he says is more powerful and versatile. He describes how he and his students keep journals (in Org mode) that record everything they did and the results they got. These journals are invaluable when it comes time to write the paper. If you read Legrand’s complete example paper, you saw an example of this is in action.

The talk is just over 2 hours 45 minutes so you’re going to have to block out time (or several times) to watch. It’s worth it though, so if you have any interest at all in the subject matter, I urge you to take the time. Don’t worry too much about reading everything on his slides because they are also available.

Finally, I’m happy to report that this is the first in a series of Webinars on reproducible research. There a three more currently available and another is planned for some time this month. The others webinars discuss

  • Controlling your environment
  • Numerical reproducibility
  • Logging and backing up your work

If you follow the link to the other videos, you will find a list of topics they plan to cover in the future.

Posted in General | Tagged , , | 1 Comment

Org Mode for PhD Students

It’s pretty much conventional wisdom that Org Mode is the killer app for Emacs. Even people who don’t use Emacs seem to be aware of that and, of course, there are various efforts underway to port Org to other editors.

Matthieu Caneill considers how Org Mode can be leveraged by PhD students to organize their work and research. He discusses his file setup and how he uses each one in his workflow. It’s a nice way of organizing things and exactly what you’d expect an Org mode beginner to have. Lots of people, including me, use similar organizations but it’s not the only strategy available. Another possibility—and one that is arguably more Org like—is to have a single file for all your long term activities. The use of tags and advanced searching functionality make this an ideal solution.

My journal file is pretty much like that but I have a few others for things like managing my blog and recording medical/exercise data. If I were starting over, I’d consider combining everything—or almost everything—into a single journal file. In any event, if you’re starting your PhD—regardless of your area—you should consider organizing things in Org. It’s all plain text so you’re not stuck and can move to something else if you find it suits your needs better.

Over and over again, you hear people saying how much Org mode increased their efficiency and productivity. If you’re starting work on a PhD, your going to need all the help you can get so it’s worth spending a little time coming up to speed with Emacs and Org mode.

Posted in General | Tagged , | Leave a comment

Public Service Announcement

Posted in General | Tagged | Leave a comment