Data Basics

It’s good to have an understanding of the jargon of data.

First of all, a bit (which is short for “binary digit”) is the smallest form of data in a computer. Think of it as the atom. A bit can either be 0 or 1, which is binary. It is also generally used to measure the amount of data that is being transferred (say within a network or the Internet).

A byte, on the other hand, is mostly for storage. Of course, the numbers of bytes can get large very fast. Let’s see how in Table 2-1.

Table 2-1.

Types of data levels

UnitValueUse Case
Megabyte1,000 kilobytesA small book
Gigabyte1,000 megabytesAbout 230 songs
Terabyte1,000 gigabytes500 hours of movies
Petabyte1,000 terabytesFive years of the Earth Observing System (EOS)
Exabyte1,000 petabytesThe entire Library of Congress 3,000 times over
Zettabyte1,000 exabytes36,000 years of HD-TV video
Yottabytes1,000 zettabytesThis would require a data center the size of Delaware and Rhode Island combined

Data can also come from many different sources. Here is just a sampling:

  • Web/social (Facebook, Twitter, Instagram, YouTube)
  • Biometric data (fitness trackers, genetics tests)
  • Point of sale systems (from brick-and-mortar stores and e-commerce sites)
  • Internet of Things or IoT (ID tags and smart devices)
  • Cloud systems (business applications like Salesforce.com)
  • Corporate databases and spreadsheets

Comments

Leave a Reply

Your email address will not be published. Required fields are marked *