Published on Data Management Plan Tutorial (https://www.e-education.psu.edu/dmpt)

Home > Course Outline > Part 1: Data and Data Collection > 1.6 Summary

1.6 Summary

Your DMP should describe briefly the research methodology for your project - i.e., how you will collect data for your project and what the goals of such collection will be. Will there be secondary data from a previous or existing project that you'll use? Be sure to integrate any additional information that will help reviewers understand clearly your project and the techniques you'll be implementing.

Be sure to note the types of data you'll be collecting - e.g., specimen, observational, experimental, simulation, derived, etc. The DMP should also state what formats your data will be in - will they be text files, numerical data, modeling data, software code? Use, where possible, open-source (i.e., non-proprietary) formats - or, at the very least, formats in heavy use by your research community. For example, many researchers use Excel to keep track of data they're collecting; though a proprietary format, it's almost ubiquitous usage is providing assurances that it will subsist for some time.

In addition, the DMP should describe the tools or software you'll rely upon to make sense of the data and perform analyses on them.

Finally, in writing about your research methodology and data collection practices in a DMP, try to estimate how fast (or slowly) your data will grow. Where will the data be kept? How much storage is it possible to anticipate to cover the expected rate of data growth? How often will you need to access the data you're collecting? If there are data from an existing project, then it might prove a relevant exercise to revisit data from a previous project and see if the rate of growth could be tracked over a certain period of time.

banner

Check Your Understanding

In each example below, choose the preferred format to ensure accessibility for the long term:

(a) .doc
(b) .txt

test-bulbClick for answer.

ANSWER: (b) .txt. When working with file formats, it is best to use open, non-proprietary formats, such as .txt, which does not depend on any particular software in order to be read; any program is able to open and read a .txt file. A .doc file, on the other hand, is a MicroSoft (MS) Word format and thus proprietary. It is true that MS Word is widely used and, as a result, will likely continue to be supported by MicroSoft for some time. (the same with MS Excel.) There are also open-source word processing products, like Apache OpenOffice and LibreOffice that work across operating systems (Linux, Windows, and Mac).

(a) TIFF
(b) GIF

test-bulbClick for answer.

ANSWER: (a) TIFF. Many organizations that preserve and archive media such as images use TIFF (Tagged Image File Format) for preservation of master files and reproduction master files. TIFF file format is considered stable, has broad adoption, is well specified, and is uncompressed. TIFFs are not recommended for use copies, or for opening in web browsers. GIF (Graphics Interchange Format), a bitmap image format, is widely popular for displaying images on the Internet, which means that, unlike TIFF, the GIF format can be rendered in web browsers easily. It is probably most often used for animated images. GIF should not be used for preserving images.

 


Source URL: https://www.e-education.psu.edu/dmpt/node/658