This HTML5 document contains 28 embedded RDF statements represented using HTML+Microdata notation.

The embedded RDF content will be recognized by any processor of HTML5 Microdata.

PrefixNamespace IRI
dctermshttp://purl.org/dc/terms/
atomhttp://atomowl.org/ontologies/atomrdf#
foafhttp://xmlns.com/foaf/0.1/
n17http://vos.openlinksw.com/dataspace/services/wiki/
oplhttp://www.openlinksw.com/schema/attribution#
n2http://vos.openlinksw.com/dataspace/owiki/wiki/VOS/
dchttp://purl.org/dc/elements/1.1/
n20http://vos.openlinksw.com/dataspace/dav#
rdfshttp://www.w3.org/2000/01/rdf-schema#
n16http://rdfs.org/sioc/services#
n13http://vos.openlinksw.com/dataspace/owiki/wiki/VOS/VirtSetCrawlerJobsGuide/sioc.
siocthttp://rdfs.org/sioc/types#
n8http://vos.openlinksw.com/dataspace/person/dav#
n6http://vos.openlinksw.com/dataspace/owiki/wiki/
rdfhttp://www.w3.org/1999/02/22-rdf-syntax-ns#
n10http://vos.openlinksw.com/dataspace/owiki#
n15http://docs.openlinksw.com/virtuoso/rdfinsertmethods.html#
xsdhhttp://www.w3.org/2001/XMLSchema#
n7http://vos.openlinksw.com/dataspace/%28NULL%29/wiki/VOS/
n11http://vos.openlinksw.com/dataspace/person/owiki#
siochttp://rdfs.org/sioc/ns#
Subject Item
n2:VirtSetCrawlerJobsGuide
rdf:type
sioct:Comment atom:Entry
dcterms:created
2017-06-13T05:47:49.689032
dcterms:modified
2017-06-13T05:47:49.689032
rdfs:label
VirtSetCrawlerJobsGuide
foaf:maker
n8:this n11:this
dc:title
VirtSetCrawlerJobsGuide
opl:isDescribedUsing
n13:rdf
sioc:has_creator
n10:this n20:this
sioc:content
%META:TOPICPARENT{name="VOSIndex"}% ---+ Quad Store Data Loading via Virtuoso's In-built Content Crawler This guide covers the use of Virtuoso's in-built content crawler as a mechanism for scheduled of one-off data loading operations for its native quad store. ---++ Why is this important? Transforming external data sources into Linked Data "on the fly" (e.g., via the 'Sponger') is sufficient for many use cases, but there are times when the volume or sheer nature of a data source makes batch-loading necessary. For example, Freebase offers RDF representations of its data, but it doesn't publish RDF dumps; even if it did, such dumps would usually be outdated by the time they were loaded. Thus, a scheduled crawl of that resource collection offers a viable alternative. ---++ How to Set Up the Content Crawler for Linked Data generation and import The Virtuoso Conductor can be used to set up various Content Crawler Jobs: * [[http://docs.openlinksw.com/virtuoso/rdfinsertmethods.html#rdfinsertmethodvirtuosocrawler][Setting up a Content Crawler Job to Import Linked Data into the Virtuoso Quad Store]] * [[VirtSetCrawlerJobsGuideSitemaps][Setting up a Content Crawler Job to Retrieve Sitemaps]] (when the source includes RDFa) * [[VirtSetCrawlerJobsGuideSemanticSitemaps][Setting up a Content Crawler Job to Retrieve Semantic Sitemaps]] (a variation of the standard sitemap) * [[VirtSetCrawlerJobsGuideDirectories][Setting up a Content Crawler Job to Retrieve Content from Specific Directories]] * [[VirtCrawlerGuideAtom][Setting up a Content Crawler Job to Retrieve Content from ATOM feed]] * [[VirtCrawlerSPARQLEndpoints][Setting up a Content Crawler Job to Retrieve Content from SPARQL endpoint]]
sioc:id
a20a29b21e47327ea4337ace2622ba74
sioc:link
n2:VirtSetCrawlerJobsGuide
sioc:has_container
n6:VOS
n16:has_services
n17:item
atom:title
VirtSetCrawlerJobsGuide
sioc:links_to
n2:VirtSetCrawlerJobsGuideDirectories n7:VirtSetCrawlerJobsGuideSitemaps n2:VirtSetCrawlerJobsGuideSemanticSitemaps n7:VirtCrawlerSPARQLEndpoints n15:rdfinsertmethodvirtuosocrawler n2:VirtCrawlerGuideAtom
atom:source
n6:VOS
atom:author
n8:this
atom:published
2017-06-13T05:47:49Z
atom:updated
2017-06-13T05:47:49Z
sioc:topic
n6:VOS