 |
La Selva Biological Station
|
|
INSTRUCTION GUIDE FOR COMPLETION OF METADATA FIELDS
This guide describes the different fields included in the metadata
description of the datasets.
When creating or modifying a metadata record, it should be
kept in mind that it is not
necessarily the person who creates the metadata record who will
be most benefitted by this information (metadata), but rather
users all over the Internet who search in the
RDMCNFS database.
Unless otherwise noted, all fields are free-format text. There are no
unreasonable restrictions on the length of the text (number of lines or
characters). Upper and lower case should be used naturally; there are no
particular requirements in this regard for storing or indexing the information.
Fields marked with a "+" are required; those marked with a "-" are
optional, though in general it is desirable to include as much
information as possible.
- Reference
- Dataset Properties
- Dataset Access
- Metadata Control
- Additional Information
TITLE
The name of the dataset. It must be specific enough to distinguish between
datasets on the same subject.
Examples:
- Historical and seasonal variation in the composition of mixed-species
bird flocks at a lowland tropical forest.
- Antiparasitoid Mechanisms in Tropical Lepidopteran Larvae
SUBJECT AND KEYWORDS
The subject area(s) or topic(s) addressed by the dataset, including
additional words that may be helpful in cross-referencing or cataloging
the data.
Examples:
- Insect behavior. Hymenoptera. Vespidae. Parasitism.
- Forest dynamics. Quercus. Nuclear radiation.
- Bird banding.
- Mammal count.
- Meteorological data.
- Tree growth.
- Inventory of arthropods.
- Research projects.
- Birds. Aves. Old growth. Second growth. Monitoring. Census. Ant-following
birds. Euphonia. Tachyphonous. Greenlet. Honeycreeper. Canopy. Understory.
Resident species. Migratory species.
AUTHOR(S)
The person(s) primarily responsible for the intellectual content of
the dataset, including their electronic mail and web page, postal addresses
and institutional affiliation, if appropriate.
Examples:
- Craig Dodson (dodson@wpogate.mesa.colorado.edu
School of Natural Sciences and Mathematics
Mesa State College
Grand Junction, CO 81502
- Bruce Young (young@sloth.ots.ac.cr)
Organization for Tropical Studies (OTS/OET).
http://www.ots.ac.cr
OTHER AGENT(S)
The list of other people who have made significant intellectual contributions
to the work. These might include
primary investigators (PIs), research assistants (RAs) or technicians
(Techs) who were also involved in the data collection or compilation,
as well as editors and transcribers.
Examples:
- Data entry was done by Fermina Daza and Artemio Cruz.
- Santiago Nassar and Forrest Wood provided technical support for the conversion from Wordstar to SGML.
FUNDING SOURCE(S)
The funding sources which supported the project that generated the dataset.
Examples:
- Consejo Nacional para el Conocimiento y uso de la Biodiversidad (CONABIO),
México.
- National Science Foundation (NSF), USA.
ABSTRACT
A short, accurate description of the dataset and the project that created
it.
Also include a description of the methods used to collect the data. (Do
not include literature references pertaining to the methods. They should
be included in the ADDITIONAL INFORMATION field.)
DATA DESCRIPTION
Description of the variables (data columns) that are included in the
dataset, and the methods used for their collection.
For datasets that will not be available through the RDMCNFS
system, this field should describe in general terms the variables
therein. The description may be narrative or tabular, reflecting
the conceptual content rather than the actual names of the variables.
If appropriate, the names of the actual data files may be included.
Examples:
-
We performed censuses along existing trails. We walked 1.25 km/hr during dry
weather and recorded all flocks we encountered, noting leader (songwren,
antwren, _Tachyphonus_ sp., Scarlet-rumped cacique, Lesser Greenlet, Social
Flycatcher, Ant-following, _Chlorothraupis carmioli_, miscelaneous, or group of
big birds) and attendant species, number of individuals of each species, height
of flock (Canopy, subcanopy, midlevel, understory, or army-ant following). All
censuses took place between 0700-1600. We performed approximately one census
per week during the study period.
Data on location, species composition, height, flock type, habitat, and time of
discovery for each of 359 flocks found are located in a Filemaker Pro data
base.
Summaries of the data are in a Microsoft Excel workbook. For each of the 50
census days, there are summaries of the number of flocks found at each height
and in each flock type in each habitat type, together with the number of km
walked and minutes spent in each habitat. Also, a summary of the number of
flocks found during each hour of the day in old and second growth for the
entire study.
- These seedlings were grown from local seeds in a nursery for 3-6 months, then
transplanted into study sites. The dataset includes 580 seedlings: 240 in
second-growth, 240 in tree plantations, and 90 in a forest-pasture edge
transect. There are also 23 records for seedlings which were measured and
harvested at the start of the experiment to obtain baseline data on
characteristics of transplants. Growth variables collected included: seedling
height (height to meristem), number of leaves, number of leaflets, diameter at
10 cm, variance of the diameter measurements. These data were collected
bimonthly for each seedling. Photosynthetic variables include: chlorophyll
content, specific leaf area, photosynthetic light response curves (Li-Cor 6200
Photosynthesis system), whole plant carbon gain on a daily basis estimated
using the YPLANT model. Light variable include: percent diffuse transmittance
(Decagon Sunfleck Ceptometer used with Li-Cor Quantum sensor in an open site),
fisheye photographs, GaAsP photodiodes.
NOTE: For datasets that will be available through the RDMCNFS
system, the document guidelines for describing
datasets in detail describes how the variables, file format, etc. should
be described.
LANGUAGE
The two-character code (from the ISO 639 standard) and full name for the language used in the dataset. (e.g., ES-Spanish,
EN-English, PT-Portuguese).
In the event that multiple languages are used (e.g., in different files)
then each must be specified.
Examples:
- ES-Español/Spanish.
- EN-English/Inglés.
- PT-Portuguese/Portugués.
COVERAGE
The spatial locations and temporal duration characteristics of the dataset.
Include the names of the sampling locations: trails, plots, rivers. When
available, use map coordinates, latitude and longitude. Specify the time
interval of the study and the frequency of the studies.
Examples:
- Spatial coverage: La Selva Biological Station. OTS. Second-growth sites located
within the 1 ha plots of the BOSQUES project (PI-Robin Chazdon) located off the
LEP trail and the LSUR trail. Tree plantations sites located in the Vochysia
ferruginea plots (Block 3 and 4) of the TRIALS project in the Peje Annex.
Temporal coverage: June 1997-present (as of Sept 1997)
- First 300 meters of the "Las Cercas" trail. Data collected monthly from 1996-07-01 to 1997-01-15.
FORM AND SIZE
The data format and, if known, the size of the dataset or the files.
The user should be able to determine
which tools will be needed in order to access the data, as
well as have an estimate of the storage space required
if the dataset is to be copied. For commercial
software formats, the version number used should be included.
For all datasets (even if not online), it is desirable for the
number of records in the collection should be included.
Examples:
- Comma-separated tabular data (CSV), 1204 records (64 KB)
- The "50ha.xls" file is an Excel 5.0 spreadsheet; the "species.dbf"
file is a DBase (DBF) table.
- DBase / XBase (DBF), 3.4 MB, 23879 records
- Microsoft Access, 600 records.
- Microsoft Word 2.0 for Windows, approx 30 printed pages
- HTML 2.0
- Paper, 760 cards
- ASCII text file (unstructured), 3 MB
- Other (e.g. paper, microfilm, photographs). (Specify.)
TROPICAL FIELD STATION NAME
The name of the Tropical Field Station where the data was
primarily collected. (Taken from a pull-down list of options.)
- ECOSUR - San Cristóbal de las Casas. México.
- OTS - La Selva. Costa Rica.
- STRI - BCI. Panamá.
CONTACT(S)
The name of the person(s) or institution(s) to be contacted for matters
regarding the data or metadata, including whatever information necessary
(e.g., postal address, e-mail address, fax, etc.) Whenever possible, a
position (e.g., "Data Manager") rather than an individual's name
should be used. If these are the same as the Authors (see above), then
it suffices to indicate so. For transient (i.e., non-resident) researchers
Example:
- Data Manager, La Selva Station, OTS. (email: dmanager@sloth.ots.ac.cr)
- (See "Authors")
- Jim Jones, STRI/NAOS. Panamá. (email: jjones@stri.si.edu)
AVAILABILITY
The conditions for data availability:
- Public: data available to the general public
- Restricted: access requires approval of dataset administrator
- Confidential: data available only to the primary investigators
- Other (specify)
Whether or not the data is available in a computer-readable form:
- Online
- Magnetic media but not available online
- Not on magnetic media (e.g., on paper, etc.)
The tools needed to access the data, and the appropriate references:
- Electronic mail
- Ftp
- Web
- Specialized search interface (e.g., via "telnet")
- Phone / Fax
- Postal mail
Examples:
- Public access. Online. http://www.ots.ac.cr/rdmcnfs/datasets/binabitrop
- Public access. Online. ftp://ftp.stri.si.edu/pub/datasets/trees95
- Restricted access. (See "Contacts")
RELATION TO OTHER DATASETS
Relationship between this and other datasets, metadata records,
works by other PIs, that may be of interest
when viewing the data set.
Examples:
- See also the record for the dataset "Understory air temperature as
a factor in nest construction in Vespid wasps."
METADATA LOG
Log records to track the operations that have been applied to the metadata
record (not the dataset). The metadata entry and modification system
will automatically create log entries with the date, time, and name
of the user who performed them operation. What is entered manually is
the description of the operation.
Examples:
- Updated "Avaliability" now that the data is available on-line.
- Added the "species.dbf" file to the Data Description field.
- Changed "Coverage" and "Form and Size" to reflect additional records in dataset.
- Changed "Form and Size" and "Availability" to reflect conversion to HTML.
ADDITIONAL INFORMATION
Comments referring to the whole dataset, including explanations of any
problems with data collection or the dataset, or reasons for missing data.
This should refer to the entire dataset or about data columns
(variables) in the dataset. Do not include information about particular data
points.
When applicable, include any relevant literature references (including
author, title, journal and page numbers) pertaining to the methods used
to collect the data.
For datasets derived from other collections of data, describe the original
collections and the derivation process.
Any other information considered relevant, but not covered by
the other metadata fields, should be included here.