Biodiversity Information Service in China: The Architecture and Techniques Experiences from the Construction of NSII Zheping Xu(xuzp@ibcas.ac.cn), Keping Ma,Haining Qin,Jinzhong Cui Institute of Botany, Chinese Academy of Sciences TDWG 2012 Conference, Oct. 25, 2012 NSII(National Specimen Information Infrastructure)
Outline Data Architecture Service
What is a Specimen (Occurrence) Data Time Scale: Fossil ---= Current Living life Geospatial Scale: Local => Regional =>Global=>Outside the planet (meteorite)
New Concept: EventGIS = Who + When + Where + What.
Primary Biodiversity Data (The GBIF Network) Biodiversity & Geo-diversity Class: Insecta Order: Lepidoptera Family: Pyralidae Taxonomy Synonym: Pyralis nubilalis Hübner, 1796 DNA Sequence Locus: AAL35331 Definition: acyl-coa Z/E11 desaturase Genus: Ostrinia Hübner, 1825 Vernacular (EN): European Corn-borer Vernacular (DE): Maiszünsler Vernacular (ES): Piral del maíz Vernacular (FR): Pyrale du maïs Family: Gramineae Foodplant: Zea mais L. 1753 Species: Ostrinia nubilalis (Hübner, 1796) Image 1 mvpyattadg hpekdecfed... Description ( Cultural, Usage.) Diagnosis: Wingspan 26-30mm; sexually dimorphic;male: forewings ochreous to dark brown; female: forewings pale yellow; Literature Pheromones of Ostrinia http://www.nysaes.cornell.edu/fst/faculty/acree /pheronet/phlist/ostrinia.html Distribution (Event, History) Collection: DGH Lepidoptera Record id: DGHEUR_003217 Country: France Coordinates: 03.047 E 48.730 N Date: 28 June 2003 Collector: Donald Hobern Climate Average Rainfall Location: 48.82 N 2.29 E Jan Feb Mar Apr... 182.3 120.6 158.1 204.9...
Names Specimen Literature Images Observatory Community Trend: High quality Integration
Data Status (NSII) Sub-platform Data Types Main Records Plant platform Educational platform Plant specimens, names, books, distributions, authors, Plant specimen, animal specimen, some color photos 3.31 million specimen, 256 volume books, 2.52 million specimen, Animal platform animal specimen, 2.13 million specimen Protected Area platform Mineral & Rock platform Polar Sample platform Other cooperative websites Plant specimen, animal specimen, rocks, fossils, ores, minerals polar biology specimen, ice sample BHL China, Species 2000 China Node, several photo sites, DNA barcode 0.55 million 0.12 million 6,000 > 5 million self building records, 15,000 registered users
Plant Platform
Animal Platform
Educational Platform: Universities
Protected Area
Mineral & Rock
Polar Sample
Name Literature Photos Photos
Evolution of Data Small -- Big -- Simple & Visualization Scattered - Central - Thematic Static and single - Online and Linkage Managed by small team - By virtual professional team - High-Quality,Feedback, Crowdsourcing Metadata: none - Single Standard - Mashup Storage: ACCESS/EXCEL- MYSQL, SQL SERVER- SOLR - NoSQL
Architecture Cache LSID Evaluation High-Quality and the Control
All-in-one Design: NSII(National Specimen Information Infrastructure) Biodiversity AND Geodiversity >9 million specimens, 100+ institutes & universities NSII Global Regional National Thematic Local Plant Rock Supporting Database Education Animal Protected Area Polar DNA Barcode Literature Distribution Ecology (Habit) User & Communities
Service DAAS: Data-As-A-Service SAAS: Software-As-A-Service KAAS: Knowledge-As-A-Service
DAAS: Data-As-A-Service Query and download online Submit an application One Type - Multiple Types Data clean by the request Different ways to display data
DAAS: Data-As-A-Service Description Name Status Distribution Geological Taxon DNA Barcode References Habit (Ecology) Cultural & Usage
DAAS: Data-As-A-Service Achievement
SAAS: Software-As-A-Service
Use Tools and Functions
Integrated Species Page
Multimedia Analysis (When, Where, What) CBIR: Query and Identification (photos) Video Filter, analysis and Notification (Video Frames)
Literature data process Internet Archive ABBYY SDK Annotation & Term & Dictionary Hard Copy PDF (b/w) OCRed Text Structured Text User upload Citation Literature Name Status Description Distribution Habit Geological
Collection History of Dendrobium (Genus) from 1900-2009
Group Query Results on Map
Who + When + Where + What visual System
Modules of Drupal 7 Boost: cache most of static pages Entity: Flag: add to your favorite Og: groups and interested in some fields Pathauto: friendly to Spider and human Subscriptions: subscribe to even anything Token: User Points: ranks and permissions
Group & Permission
Subscription
Querying from Database VS SOLR Geological Time
KAAS: Knowledge-As-A-Service Identification Pilot Project in biodiversity research and informatics
http://darwintree.cn/index.htm
Image Annotation and Identification Knowledge Base And need more Expert Experience
Biodiversity Information Service in China: The Architecture and Techniques Experiences from the Construction of NSII Zheping Xu(xuzp@ibcas.ac.cn), Keping Ma,Haining Qin,Jinzhong Cui Institute of Botany, Chinese Academy of Sciences TDWG 2012 Conference, Oct. 25, 2012 www.nsii.org.cn (under developing) NSII(National Specimen Information Infrastructure)