If you’re a journalist, understanding how the Internet works is probably the most important thing you can do to benefit your career right now.
No, I don’t necessarily mean learning how to code, although that would boost your career immensely. (See the January/February Quill cover story on coding.) I mean the actual mechanics of the Internet: how information gets from a place and turns into a website, or an email gets sent, or how a Skype chat session works.
Before you scoff, let’s do a little experiment. Think back over your day. Count how many times you did one of these tasks:
• Did you check your email?
• Did you look at anything in a Web browser (Chrome, Firefox, Internet Explorer, etc.)?
• Did you check social media?
• Did you buy anything anywhere today, whether online or in person?
• Did you call, text or Snapchat anyone?
• Did you read or hear any news stories today?
Yeah, you probably did a lot, right? Almost every part of everything you do today as a journalist involves the Internet.
You may think, “But my newsroom said social media was what I had to learn! Or data journalism! Or multimedia storytelling! Or insert buzzword here!” OK, sure. But what do all of those things live on? That’s right: the Internet.
And for those of you who might say, “I’m just not a computer person,” you’re not allowed that excuse anymore. It’s 2014. If you started in journalism anywhere in the past 30 years, a computer has probably been on your desk the majority of your career. It’s like a cop saying, “I’m not a radio communication person.” It’s a cop out (yeah, I made that joke), and we got into journalism to learn new things every day.
“But Andy, I don’t plan on ever becoming a Web developer,” you might rightly say. That’s cool. But if you want to commit any sort of journalism in the future, you’ll probably need to chat with nerds like me. And if you better understand the vocabulary, we’ll be able to communicate more effectively and churn out better journalism that can have a greater effect on and increase engagement with our audiences.
If that’s not good enough for you, ask yourself if you’d like a job in five years. How about in 10 years? 30? Because as hard as curmudgeons in journalism may try to make it so, the Internet isn’t going away. It will continue to permeate every part of the industry, not to mention become a larger chunk of how everything we cover as journalists operates. I’m talking education, business, governments, military, the arts.
Just like if you were expected to know how an engine works if you reviewed cars for a living, you should know about the main thing — the Internet — that fuels, shapes and disrupts almost every industry and coverage area journalists report on.
That still not enough to convince you? How about this: Did you hear about Edward Snowden and the NSA? Did you hear about the Heartbleed bug? Or about data breaches at any major retailers lately? The mechanics of the Internet — and general computer technology — are at play. Being ignorant of them makes you more susceptible to compromising your sources, your security and more. If you care about protecting your sources from eavesdropping in the digital world, you need to learn how to protect them.
So, commence with the learning …
INTERNET BASICS
First, let’s start with basics. I’m going to leave out some of the higher-level technical explanations to make it easier. So if you’re smarter than me and saying to yourself, “Andy, you forgot about (very in-depth thing that isn’t necessary to understand Internet basics),” yeah, I know. But this is a magazine article, and space is limited.
So what happens when you visit a Web page? To put it in terms many journalists can understand: It’s like going to a bar and ordering a beer. And how does one order a drink? You go to your local watering hole and ask a bartender for a tasty brew. She walks to a specific fridge, grabs your drink and brings it to you. Then you drink it and maybe peel off the label, for kicks.
Let’s try to use that same analogy with its more technical components. Let’s say you want to visit my website, andyboyle.com. When you type a website into your browser (such as Chrome, Firefox, Safari, Internet Explorer, etc.), that’s like you asking your bartender for a drink. Your bartender then needs to figure out which fridge your drink lives in. In technical terms, your browser is using what are called DNS servers — which are basically the phonebooks of the Internet — to figure out the IP address — a string of numbers that points to a specific server — of what you’re trying to fetch.
In our case, the DNS servers tell you that andyboyle.com lives on something called Amazon S3 (I’ll explain this later), and then Amazon’s servers give your browser all the items (HTML, CSS and JavaScript; again, I’ll explain this more later) it needs to assemble a totally rad website. So just like getting a beer from a fridge, the IP address hits a server and gives you back the information you want.
And now you enjoy your beer, which is like looking at a Web page. But the way you, personally, experience a beer is different from how the person sitting next to you at the bar may experience it. She may pick up on the flavor of the hops, you may only notice the beer’s bitterness. This is like the difference between viewing websites in different browsers.
Put another way, your browser translates HTML, CSS and JavaScript and displays it the way that browsers’ programmers decided it should look. It’s kind of like if you give five different bands the sheet music to a song by The Replacements. They’re not all going to cover it the same way, even though they’ve been given the same instructions.
INTERNET’S BASIC VOCABULARY
Let’s learn the Internet’s basic vocabulary. The building block of the Internet is HTML, which stands for Hypertext Markup Language. These are the basic tags, such as These are the basic tags, such as < p > or < strong > or < h1 >, that tell your browser a basic way to recognize what to do with the text inside them. Think of these as the frame, walls and floor of a house.
Next come CSS, which stands for cascading stylesheets. This allows you to say, “Hey, every time we have a paragraph tag on an article page, make it size 14, use this font type and make it this color.” This way you don’t need to specifically repeat yourself over and over. Think of this as being able to decide what the frame, walls and floor of a house are going to be colored, what material they’ll be made out of and how they will be different on each floor.
Last comes JavaScript. This is something that actually “runs” in your browser. This allows you to manipulate the CSS or the HTML on your page, dynamically changing them. For instance, on many news sites, when you hover over your navigation bar, a dropdown menu pops up. That’s generally powered by JavaScript. Think of this as a contractor who you can tell to run through your house, change one wall to purple, write “Andy rules” on the floor and add a garage.
The way most websites work is by contacting a database and displaying information. If you’ve ever used a spreadsheet, that’s basically a simpler way of thinking how a database stores information, in fields and rows. When you go to an article on a news website, that’s talking to a server and saying, “Hey, send me the information stored in your database about this article.” It sends you the byline, the photo if there is one, the article copy, maybe links to similar stories and more.
Most modern news organizations have content management systems — that’s the fancy way of saying the place you store and manage your website content. These systems are backed by databases so they can quickly update stories or other pieces of content. Think of when you make a Facebook status and then update it; it works in a similar way.
WEB DEVELOPMENT COSTS
The way we make the Internet is now cheaper than ever before. And it’s only getting cheaper. To store one terabyte of data — about 500,000 photos from your smartphone, for instance — the hardware in 1992 cost about $400,000. In 2012? On Amazon Web Services’ S3, which is a system Amazon uses to store static content (images, HTML, etc.) it costs $128 a month.
To build something on the Internet, you used to need to buy expensive servers in the tens of thousands of dollars and have pretty brainy staff support. Now servers like the ones Amazon and others provide allow you to basically “rent” portions of servers for much lower costs, sometimes around $50 to $100 a month. And because they’ve made it easier to administrate your servers, this can also lower your personnel costs because you don’t need someone sitting in a hot room making sure servers don’t literally catch on fire.
At the Chicago Tribune, we try to use a way of making Web projects where we “bake” them out into static assets — the barebones HTML, CSS and JavaScript (and also images). This means we don’t even necessarily need to have a real server running, so we just pay for storage on Amazon S3, which has relatively cheap costs for storage and traffic. Our users can’t tell the difference between these sites and something being served by a fancy database. And these websites don’t go down because there’s no server to go down.
NOW GO FORTH
There. Now you know everything about the Internet. Just kidding; that’s not even remotely true. But now you have a bit more knowledge of how things work and how information gets from one spot to another, at least on a website. If you’d like to know more, take the folks who work on your website out to lunch or beers (or beers at lunch). If you don’t have those people in your newsroom, Google for your answers.
I previously did a presentation on this topic at SPJ’s 2012 JournCamp in Chicago. You can get the presentation materials here.
The Internet is always changing, but the fundamentals have stayed relatively static for a few decades. And if you start understanding how it works now, it will not only help better inform your coverage, but it will open your eyes to new ways of displaying that coverage online, make your work more appealing to your audience and hopefully be of more value to them in years to come, as our online audiences continue to grow.
Andy Boyle is a developer for the Chicago Tribune’s TribApps team, a journalism graduate of the University of Nebraska and an SPJ trainer. Reach him at andymboyle@gmail.com or interact on Twitter: @andymboyle