Commit fa2421ce authored by Castillo's avatar Castillo

csX

parent 19996302
# CiteSeerX : doc for sqlite format
Authors have no ID. Each set of authors per publication could have duplicated authors. Harvesting results:
- 6.915.748 publications, where:
- 6.257.801 publications with authors
- 1.360.655 publications with keywords
- 1.261.447 publications with authors AND keywords
![alt tag](https://dl.dropboxusercontent.com/u/9975992/phylo/citeseerx.png)
### SQL TABLES DESCRIPTION
#### __publication__
- __doi__
- __oai_id__ : OAI-PMH url + doi.
- __url__ : citeseerx direct link.
- __name__ : the title.
- __abstract__
- __authors__ : authors fullnames concatenated by "|" in a string (they could be repeated).
- __keywords__ : keywords concatenated by "|" in a string.
- __publisher__
- __format__ : of the type "*application/x*".
- __fulltext__ : direct link to the full publication.
*I'm downloading fulltexts in other side.*
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment