Effective Database Design for Geoscience Professionals

Much of the confusion and many misconceptions about data management stem from an incomplete or incorrect understanding of key terms and concepts. This chapter introduces the reader to the most important data management concepts and essential terms and illustrates many of the features common to virtually all data management systems. Readers familiar with these concepts will be certain that the definitions are essentially the same as those they are familiar with, thus providing a common ground for later chapters.
Traditional data management texts tend to be jargon-laden and unduly complex for the novice. A complete summary of the technical terms, acronyms, and abbreviations common to geotechnical data management can be found in the glossary. However, some of the more important terms are defined here to provide a basis for further discussions without requiring the reader to repeatedly refer to the glossary.
A database is any collection of information that provides a centralized, comprehensive reference source for users of that information. Ideally, a database is digital, is easily accessed by all users, and contains only the most accurate and complete data available. However, in the real world, databases can and do include paper documents, photographs, maps, notes, books, physical rock samples, and digital files on various incompatible computers with different formats. The goal for the data management geoscientist is to develop a database that attempts to balance practicality with the ideal database goal (i.e., all the relevant data are in one place in one consistent format).