The Data reference article from the English Wikipedia on 24-Apr-2004
(provided by Fixed Reference: snapshots of Wikipedia from wikipedia.org)

Data

For thoughtful child sponsors
A datum is a statement accepted at face value. Data is the plural of datum. A large class of practically important statements are measurements or observations of a variable. Such statements may comprise numbers, words, or images.

The word data is the plural of Latin datum, neuter past participle of dare, "to give", hence "something given". The past participle of "to give" has been used for millennia, in the sense of a statement accepted at face value; one of the works of Euclid, circa 300 BC, was the Dedomena (in Latin, Data). In discussions of problems in geometry, mathematics, engineering, and so on, the terms givens and data are used interchangeably. Such usage is the origin of data as a concept in computer science: data are numbers, words, images, etc., accepted as they stand.

In English, the word datum is still used in the general sense of "something given", and more specifically in cartography, geography, and geology to mean a reference point, reference line, or reference surface. The Latin plural data is also used as a plural in English, but it is also commonly treated as a mass noun and used in the singular. For example, "This is all the data from the experiment". This usage is inconsistent with the rules of Latin grammar, which would suggest "These are the data ...", each measurement or result being a single datum. However, given the variety and irregularity of English plural constructions, there seem to be no grounds for arguing that data is incorrect as a singular mass noun in English.

Raw data are numbers, characters, images or other outputs from devices to convert physical quantities into symbols, in a very broad sense. Such data are in a form which can be further processed by a human or (especially) input into a computer, stored and processed there, or transmitted (output) to another human or information processor. Computers nearly always represent data in binary. Raw data is a relative term; data processing commonly occurs by stages, and the "processed data" from one data processing system may be considered the "raw data" of the next.

Data on its own has no meaning, only when interpreted by some kind of data processing system does it take on meaning and become information. People or computers can find patterns in data to perceive information, and information can be used to enhance knowledge. Since knowledge is prerequisite to wisdom, we always want more data and information. But, as modern societies verge on information overload, we especially need better ways to find patterns.


See also: data processing -- data mining -- data warehouse -- datasheet -- computer program

Other uses of this term include: Data (Star Trek) (fictional android). Data is also the name of the character played by actor Ke Huy Quan in the film "The Goonies."


This article (or an earlier version of it) contains material from FOLDOC, used with permission.