Hadoop, for those of you not in the know, is a scalable framework for performing data-intensive distributed applications. If you’re reading this article, however, you probably already know that: you’re intrigued by Hadoop’s performance potential and its proclaimed ability to run on commodity hardware. In this article, we’ll talk a bit about the hardware required to run Hadoop, and what the best configuration would be to get the most bang for your buck! First things first, however: don’t think you can just run out there and grab hundreds of bargain bin PCs and call your job done. Hadoop’s big draw is… View full post »
Hadoop Installation Tutorial
August 5th, 2011 Leave a comment 2 commentsJust for posterity, Apache Hadoop is a software framework that supports data-intensive distributed applications under a free license. It enables applications to work with thousands of nodes and petabytes of data. Hadoop was inspired by Google’s MapReduce and Google File System (GFS) papers. Since you arrived to this page, I’ll assume that you have some idea of what Hadoop is and what it is used for. This tutorial will walk you through an installation of Hadoop on your workstation so you can begin exploring some of its powerful features. Hadoop has traditionally been a royal pain to setup and configure properly…. View full post »
Using memcached to Cache Content
July 26th, 2011 Leave a commentWeb servers in the modern era are expected to do a great deal of processing and serving of all different types of content; the amount and type of content now, in fact, is far greater and more varied than it was even ten years ago. In response to these demands placed on web servers and database servers, caching systems have come into existence in order to speed up performance and help with database and web performance. memcached is a distributed memory caching system developed for use in LiveJournal’s web and database service. It is now a general-purpose, open caching server for… View full post »
Running Hadoop: Your First Single-Node Cluster
July 22nd, 2011 Leave a commentHadoop can be a very powerful resource, if used correctly; it’s a system designed to work with a vast amount of data while simultaneously taking advantage of the hardware it’s using, low-end or not. It can be a bit difficult to set up, however; as a result, many people don’t take advantage of the system. Let’s take a look at how to set up your first single-node cluster to give you a look at how Hadoop can help your business! This tutoral uses the Apache distribution, but you can just as easily use it with Cloudera’s. NOTE: This tutorial assumes you’re… View full post »
Installing Android NDK
July 20th, 2011 Leave a commentWhat is Android NDK? Chances are, if you’ve found this article, you know already what the Android NDK is. If you’ve stumbled upon here by chance, however, here’s a quick explanation: The Android NDK is a set of tools provided by Google to allow Android developers to code portions of their applications in native C, which then gets compiled into the application to run just the same as any other Android application. The advantage to this is that there is a performance increase for some processes with native code (though not automatically) and some developers can harness it to see performance… View full post »
Amazon Associates Affiliate Alternatives
July 18th, 2011 Leave a comment 4 commentsFor those Amazon affiliates living in California, a rather crushing blow has been dealt recently by California’s government. A new bill, ABX1 28, has forced all online retailers with a physical presence in the state of California to pay taxes in the state of California, and this bill rules that affiliates, such as the ones in Amazon’s affiliate program, count as a physical presence. In response, Amazon shuttered its affiliate program practically overnight and left some 25000+ affiliates floundering for a way to supplement their affiliate income. Like many California webmasters out there, we’ve had a few articles on our… View full post »
Android Malware: Should You Be Concerned?
July 8th, 2011 Leave a commentThere have been, recently, a fair number of apps pulled from the Android market because they were, in fact, malware; they would dial pay numbers and do other shady things, including stealing contacts. Though Google has since pulled these apps from the market, the lingering, uncomfortable question remains: Should we be worried about Android malware? The question is difficult to answer because of the layers of meaning in that question, but for the most part we can say this: though no computing platform is ever truly safe, Android users who do not stray from the official Android markets will, for the most… View full post »
PHP Tutorial: An Introduction to PDO
July 5th, 2011 Leave a commentSince its inception, PHP has moved from a procedural language to an object-oriented one; some of its procedural roots, however, still linger. One of these procedural remnants is an antique method of accessing SQL databases; many programmers still use the older mysql and postgresql procedural modules to access databases, and even those who have switched to newer methods, like mysqli, might still be hampered by the fact that such a module only works with mysqli. PHP has since implemented better methods, but many procedural PHP developers are unsure of how to use newer, object-oriented MySQL connection methods. Let’s delve… View full post »
Should You Be a Hybrid? 5 Design Skills for Developers
June 24th, 2011 Leave a commentIf you watch a lot of science fiction, you might think that I’m talking about some sort of part-human, part-alien hybrid. What I’m really talking about is a hybrid of a web developer and web designer. For a long time, the two roles have been somewhat separate. Web designers often had a background in graphic design or art and focused on the front end look and feel of a web site. On the other hand, web developers had a background in computer science or software development and focused their skills on the backend development. Today’s… View full post »
Intro to Hadoop
June 10th, 2011 Leave a commentHadoop. A seemingly nonsensical word that keeps getting thrown around whenever you’re in a meeting. What does it mean? What does it do? Let’s read on and find out! What is Hadoop? Hadoop is a project by the Apache Foundation in order to handle large data processing jobs. It was originally conceived by Doug Cutting, the creator of Apache Lucene (Who based the name on his son’s stuffed pet elephant, incidentally). He was inspired to do so after hearing about Google’s MapReduce and GFS projects, which were Google’s way of handling very large amounts of data at once; Hadoop is an… View full post »