About: VirtCrawlerSPARQLEndpoints

Not logged in : Login

Facets (new session)
Description
Metadata
Settings
- Rule:
- Inverse Functional Properties:
- "Same As":

About: VirtCrawlerSPARQLEndpoints Goto Sponge NotDistinct Permalink

An Entity of Type : atom:Entry, within Data Space : ods.openlinksw.com associated with source document(s)

Attributes	Values
type	Comment atom:Entry
Date Created	2017-06-13 05:49:29.939406 (xsd:dateTime)
Date Modified	2017-06-13 05:49:29.939406 (xsd:dateTime)
label	VirtCrawlerSPARQLEndpoints
maker	WebDAV System Administrator owiki
Title	VirtCrawlerSPARQLEndpoints
isDescribedUsing	http://vos.openlinksw.com/dataspace/owiki/wiki/VOS/VirtCrawlerSPARQLEndpoints/sioc.rdf
has creator	http://vos.openlinksw.com/dataspace/dav#this http://vos.openlinksw.com/dataspace/owiki#this
attachment
content	%META:TOPICPARENT{name="VirtSetCrawlerJobsGuide"}% ---+Setting up a Content Crawler Job to Retrieve Content from SPARQL endpoint The following step-by guide walks you through the process of: * Populating a Virtuoso Quad Store with data from a 3rd party SPARQL endpoint * Generating RDF dumps that are accessible to basic HTTP or WebDAV user agents. 1. Sample SPARQL query producing a list SPARQL endpoints: PREFIX rdf: PREFIX rdfs: PREFIX owl: PREFIX xsd: PREFIX foaf: PREFIX dcterms: PREFIX scovo: PREFIX void: PREFIX akt: SELECT DISTINCT ?endpoint WHERE { ?ds a void:Dataset . ?ds void:sparqlEndpoint ?endpoint } 1 Here is a sample SPARQL protocol URL constructed from one of the sparql endpoints in the result from the query above: http://void.rkbexplorer.com/sparql/?query=PREFIX+foaf%3A+%3Chttp%3A%2F%2Fxmlns.com%2Ffoaf%2F0.1%2F%3E+%0D%0APREFIX+void%3A+++++%3Chttp%3A%2F%2Frdfs.org%2Fns%2Fvoid%23%3E++%0D%0ASELECT+distinct+%3Furl++WHERE+%7B+%3Fds+a+void%3ADataset+%3B+foaf%3Ahomepage+%3Furl+%7D%0D%0A&format=sparql 1 Here is the cURL output showing a Virtuoso SPARQL URL that executes against a 3rd party SPARQL Endpoint URL: $ curl "http://void.rkbexplorer.com/sparql/?query=PREFIX+foaf%3A+%3Chttp%3A%2F%2Fxmlns.com%2Ffoaf%2F0.1%2F%3E+%0D%0APREFIX+void %3A+++++%3Chttp%3A%2F%2Frdfs.org%2Fns%2Fvoid%23%3E++%0D%0ASELECT+distinct+%3Furl++WHERE+%7B+%3Fds+a+void%3ADataset+%3B+foaf%3Ah omepage+%3Furl+%7D%0D%0A&format=sparql" http://kisti.rkbexplorer.com/ http://epsrc.rkbexplorer.com/ http://test2.rkbexplorer.com/ http://test.rkbexplorer.com/ ... ... ... 1 Go to Conductor UI. For ex. http://localhost:8890/conductor : %BR%%BR%%BR%%BR% 1 Enter dba credentials 1 Go to "Web Application Server"-> "Content Management" -> "Content Imports" %BR%%BR%%BR%%BR% 1 Click "New Target" %BR%%BR%%BR%%BR% 1 In the presented form enter for ex.: * "Crawl Job Name": voiD store * "Data Source Address (URL)": the url from above i.e.: http://void.rkbexplorer.com/sparql/?query=PREFIX+foaf%3A+%3Chttp%3A%2F%2Fxmlns.com%2Ffoaf%2F0.1%2F%3E+%0D%0APREFIX+void%3A+++++%3Chttp%3A%2F%2Frdfs.org%2Fns%2Fvoid%23%3E++%0D%0ASELECT+distinct+%3Furl++WHERE+%7B+%3Fds+a+void%3ADataset+%3B+foaf%3Ahomepage+%3Furl+%7D%0D%0A&format=sparql * "Local WebDAV Identifier": /DAV/void.rkbexplorer.com/content * "Follow links matching (delimited with ;)": % * Un-hatch "Use robots.txt" ; * "XPath expression for links extraction": //binding[@name="url"]/uri/text() * Hatch "Semantic Web Crawling"; * "If Graph IRI is unassigned use this Data Source URL:": enter for ex: http://void.collection * Hatch "Follow URLs outside of the target host"; * Hatch "Run "Sponger" and "Accept RDF" %BR%%BR% %BR%%BR%%BR% 1 Click "Create". 1 The target should be created and presented in the list of available targets: %BR%%BR%%BR%%BR% 1 Click "Import Queues": %BR%%BR%%BR%%BR% 1 Click "Run" for the imported target: %BR%%BR%%BR%%BR% 1 To check the retrieved content go to "Web Application Server"-> "Content Management" -> "Content Imports" -> "Retrieved Sites": %BR%%BR%%BR%%BR% 1 Click voiD store -> "Edit": %BR%%BR%%BR%%BR% 1 To check the imported URLs go to "Web Application Server"-> "Content Management" -> "Repository" path DAV/void.rkbexplorer.com/content: %BR%%BR%%BR%%BR% 1 To check the inserted into the RDF QUAD data go to http://cname/sparql and execute the following query: SELECT * FROM WHERE { ?s ?p ?o } %BR%%BR%%BR%%BR% %BR%%BR%%BR%%BR% ---++Related * [[http://docs.openlinksw.com/virtuoso/rdfinsertmethods.html#rdfinsertmethodvirtuosocrawler][Setting up a Content Crawler Job to Add RDF Data to the Quad Store]] * [[VirtSetCrawlerJobsGuideSitemaps][Setting up a Content Crawler Job to Retrieve Sitemaps]] (when the source includes RDFa) * [[VirtSetCrawlerJobsGuideSemanticSitemaps][Setting up a Content Crawler Job to Retrieve Semantic Sitemaps]] (a variation of the standard sitemap) * [[VirtSetCrawlerJobsGuideDirectories][Setting up a Content Crawler Job to Retrieve Content from Specific Directories]] * [[VirtCrawlerGuideAtom][Setting up a Content Crawler Job to Retrieve Content from ATOM feed]]
id	375585bb5d3b7b0fa04f28ce2a196565
link	VirtCrawlerSPARQLEndpoints
has container	owiki's Wiki
http://rdfs.org/si...ices#has_services	ODS Wiki item services
atom:title	VirtCrawlerSPARQLEndpoints
links to	http://docs.openlinksw.com/virtuoso/rdfinsertmethods.html#rdfinsertmethodvirtuosocrawler http://localhost:8890/conductor VirtSetCrawlerJobsGuideDirectories VirtSetCrawlerJobsGuideSemanticSitemaps VirtSetCrawlerJobsGuideSitemaps WebDAV VirtCrawlerGuideAtom http://cname/sparql
atom:source	owiki's Wiki
atom:author	WebDAV System Administrator
atom:published	2017-06-13T05:49:29Z
atom:updated	2017-06-13T05:49:29Z
topic	owiki's Wiki
is made of	WebDAV System Administrator
is container of of	owiki's Wiki
is link of	VirtCrawlerSPARQLEndpoints
is http://rdfs.org/si...vices#services_of of	ODS Wiki item services
is creator of of	http://vos.openlinksw.com/dataspace/dav#this http://vos.openlinksw.com/dataspace/owiki#this
is atom:entry of	owiki's Wiki
is atom:contains of	owiki's Wiki

Faceted Search & Find service v1.17_git150 as of Jan 20 2025

Alternative Linked Data Documents: iSPARQL | ODE Content Formats:

RDF

ODATA

Microdata

About

OpenLink Virtuoso version 08.03.3332 as of Feb 27 2025, on Linux (x86_64-generic-linux-glibc212), Single-Server Edition (15 GB total memory, 2 GB memory in use)
Data on this page belongs to its respective rights holders.
Virtuoso Faceted Browser Copyright © 2009-2025 OpenLink Software