To give you a feel of what we would like to have ideally is probably best done by showing a small section of the database. In this format, it is typically just a copy-paste process.
Here are the first two entries from the database...
# The date format below is the preferred way and is shown as an example on the next line
date: January 14, 2015
authors: Aisha Abdelhamid
source: Clean Technica
title: Why New York Banned Fracking
paragraphs: New York State voters approve of Gov. Andrew Cuomo’s recent decision to ban hydraulic fracturing, or fracking, by a margin of 55 to 25 percent. According to a recent Quinnipiac University poll, no political party, gender, age group, or regional interest group disapproves of the ban against fracking in New York State. A recent article in Wall Street Journal does not carry the same favorable opinion, however. Implying that newly re-elected Governor Cuomo is an unknowledgeable wimp,...
# There are a couple of blank lines after this line
webpages: in_the_news, health
date: January 14, 2015
authors: Jamison Cocklin
source: NGI's Shale Daily
title: West Virginia University to Study Drilling Impacts at Experimental Shale Site
paragraphs: The West Virginia University School of Public Health will join a federal project announced in November to conduct air, noise, occupational safety and health monitoring over the next five years at an experimental shale gas drilling site in Morgantown.
There are 6 keywords followed by a colon and a tab and one EOF tag, all starting in column 1. The record for a link starts with the "webpage:" keyword and ends with an "EOF". The latter was required to make it a little simpler to distinguish the records because the last keyword, "paragraphs:" can be composed of any number of lines that are terminated by a newline/carriage return character.
The keywords are pretty self explanatory. The preferred date format is as shown, but there are times when the day or even the month is not available. In those cases just put down what you know.
You can have multiple authors, even with their affiliation if you think it's important. For the "source:" provide the name of the organization that the link pointed you two.
In the "paragraphs:" section the preferred thing to do is to copy-paste material from the website, preferably the abstract. Copy a few sentences, or partial sentences, to reflect the intent of the article. If a partial sentence or paragraph it would be nice to show thee sequential periods to signify such.
One last thing. You can intersperse as many blank lines as you want, they are ignored. Also, any line that starts with the pound sign (#) is considered a comment and is also ignored.
If you have any questions, send us a message via the "Contact" form from the menu above. We greatly appreciate your interest.
The webpage names that the links can be assigned to can contain one or multiple entries (each separated by a comma). The ones currently in the system are as follows (suggestions for additional ones are welcome; NOTE this set is recreated every time new links are added):