GrETEL Search Engine for Querying Syntactic Constructions in Treebanks
<?xml version="1.0" encoding="UTF-8"?>
<cmd:CMD xmlns:cmd="http://www.clarin.eu/cmd/1"
xmlns:cmdp="http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1342181139640"
xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance"
CMDVersion="1.2"
xsi:schemaLocation="http://www.clarin.eu/cmd/1 https://infra.clarin.eu/CMDI/1.x/xsd/cmd-envelop.xsd http://www.clarin.eu/cmd/1/profiles/clarin.eu:cr1:p_1342181139640 https://catalog.clarin.eu/ds/ComponentRegistry/rest/registry/1.1/profiles/clarin.eu:cr1:p_1342181139640/1.2/xsd">
<cmd:Header>
<cmd:MdCreator>rogierkraf</cmd:MdCreator>
<cmd:MdCreationDate>2013-11-30+02:00</cmd:MdCreationDate>
<cmd:MdProfile>clarin.eu:cr1:p_1342181139640</cmd:MdProfile>
<cmd:MdCollectionDisplayName>CLARIN Flanders</cmd:MdCollectionDisplayName>
</cmd:Header>
<cmd:Resources>
<cmd:ResourceProxyList>
<cmd:ResourceProxy id="GrETEL001">
<cmd:ResourceType>Resource</cmd:ResourceType>
<cmd:ResourceRef>http://nederbooms.ccl.kuleuven.be/eng/gretel/</cmd:ResourceRef>
</cmd:ResourceProxy>
</cmd:ResourceProxyList>
<cmd:JournalFileProxyList/>
<cmd:ResourceRelationList/>
</cmd:Resources>
<cmd:Components>
<cmdp:ClarinSoftwareDescription>
<cmdp:GeneralInfo>
<cmdp:name xml:lang="eng">GrETEL 1.0</cmdp:name>
<cmdp:title xml:lang="eng">GrETEL Search Engine for Querying Syntactic Constructions in Treebanks</cmdp:title>
<cmdp:version>1.0</cmdp:version>
<cmdp:publicationYear>2012</cmdp:publicationYear>
<cmdp:url>http://nederbooms.ccl.kuleuven.be/eng/gretel/</cmdp:url>
<cmdp:CLARINCentre>none yet</cmdp:CLARINCentre>
<cmdp:OriginalSource>http://portal.clarin.nl/node/1967</cmdp:OriginalSource>
<cmdp:ReleaseStatus>
<cmdp:LifeCycleStatus>superseded</cmdp:LifeCycleStatus>
<cmdp:lastUpdate>2013-09-11</cmdp:lastUpdate>
<cmdp:version>1.0</cmdp:version>
</cmdp:ReleaseStatus>
<cmdp:NationalProjects>
<cmdp:Project>
<cmdp:name>CLARIN-DLU</cmdp:name>
<cmdp:title>CLARIN in Flanders</cmdp:title>
<cmdp:id/>
<cmdp:funder>EWI</cmdp:funder>
<cmdp:url/>
<cmdp:Contact>
<cmdp:Person>Ineke Schuurman</cmdp:Person>
<cmdp:Role>Coordinator</cmdp:Role>
<cmdp:Address>Leuven, Belgium</cmdp:Address>
<cmdp:Email>Ineke.Schuurman@ccl.kuleuven.be</cmdp:Email>
<cmdp:Department>CCL</cmdp:Department>
<cmdp:Organisation>KU Leuven</cmdp:Organisation>
</cmdp:Contact>
<cmdp:Duration>
<cmdp:StartYear>2010</cmdp:StartYear>
<cmdp:CompletionYear>2012</cmdp:CompletionYear>
</cmdp:Duration>
</cmdp:Project>
</cmdp:NationalProjects>
<cmdp:Country>
<cmdp:CountryName>Belgium</cmdp:CountryName>
<cmdp:CountryCoding>BE</cmdp:CountryCoding>
</cmdp:Country>
<cmdp:Description>
<cmdp:Description>
GrETEL is a query engine in which linguists can use a natural language example as a starting point for searching a treebank with limited knowledge
about tree representations and formal query languages. Instead of a formal search instruction, it takes a natural language example as input.
This provides a convenient way for novice and non-technical users to use treebanks with a limited knowledge of the underlying syntax and
formal query languages. By allowing linguists to search for constructions similar to the example they provide, it aims to bridge the gap
between descriptive-theoretical and computational linguistics.
The example-based query procedure consists of several steps.
In the first step the user enters an example of the construction he/she is interested in.
In the second step the example is returned in the form of a matrix, in which the user specifies which aspects of this example are essential
for the construction under investigation.
The third step provides an overview of the search instruction, i.e. the subpart of the parse tree that contains the elements relevant for the construction
under investigation. This query tree is automatically converted in an XPath query which can be used for the actual treebank search.
This query can be edited if desired. In the fourth step the query is executed on the selected corpus.
The matching constructions are presented to the user as a list of sentences, which can be downloaded.
The user can also click on the sentences in order to visualize the results as syntax trees.
GrETEL enables search in the LASSY-SMALL and the CGN (Spoken Dutch Corpus) Treebanks (1 million tokens each). GrETEL was created by CLARIN Dutch Language Union in Flanders in the context of the CLARIN-NL / CLARIN Flanders cooperation project.
</cmdp:Description>
</cmdp:Description>
</cmdp:GeneralInfo>
<cmdp:SoftwareFunction>
<cmdp:toolCategory>written language tool</cmdp:toolCategory>
<cmdp:ToolTasks>
<cmdp:toolTask>corpus searching</cmdp:toolTask>
<cmdp:toolTask>corpus exploration</cmdp:toolTask>
<cmdp:toolTask>querying</cmdp:toolTask>
</cmdp:ToolTasks>
<cmdp:ResearchPhases>
<cmdp:ResearchPhase>Browsing and Searching</cmdp:ResearchPhase>
</cmdp:ResearchPhases>
<cmdp:ResearchDomains>
<cmdp:researchDomain>Linguistics</cmdp:researchDomain>
</cmdp:ResearchDomains>
<cmdp:LinguisticsSubject>
<cmdp:linguisticsSubject>syntax</cmdp:linguisticsSubject>
<cmdp:Description>
<cmdp:Description/>
</cmdp:Description>
</cmdp:LinguisticsSubject>
<cmdp:LinguisticsSubject>
<cmdp:linguisticsSubject>morpho-syntax</cmdp:linguisticsSubject>
<cmdp:Description>
<cmdp:Description/>
</cmdp:Description>
</cmdp:LinguisticsSubject>
<cmdp:LinguisticsSubject>
<cmdp:linguisticsSubject>computational linguistics</cmdp:linguisticsSubject>
<cmdp:Description>
<cmdp:Description/>
</cmdp:Description>
</cmdp:LinguisticsSubject>
<cmdp:LanguageVariety>
<cmdp:languageDependent>yes</cmdp:languageDependent>
<cmdp:Language>
<cmdp:LanguageName>Dutch</cmdp:LanguageName>
<cmdp:ISO639>
<cmdp:iso-639-3-code>nld</cmdp:iso-639-3-code>
</cmdp:ISO639>
</cmdp:Language>
<cmdp:Centuries>
<cmdp:centuryDependent>yes</cmdp:centuryDependent>
<cmdp:CenturyInterval>
<cmdp:centuryFrom>20</cmdp:centuryFrom>
<cmdp:centuryThrough>20</cmdp:centuryThrough>
</cmdp:CenturyInterval>
</cmdp:Centuries>
</cmdp:LanguageVariety>
</cmdp:SoftwareFunction>
<cmdp:SoftwareImplementation>
<cmdp:distributionMedium>Online available</cmdp:distributionMedium>
<cmdp:UserInterface>
<cmdp:interfaceType>graphical user interface</cmdp:interfaceType>
<cmdp:applicationType>web application</cmdp:applicationType>
</cmdp:UserInterface>
<cmdp:Input>
<cmdp:inputType>text</cmdp:inputType>
</cmdp:Input>
<cmdp:Output>
<cmdp:outputType>text</cmdp:outputType>
<cmdp:characterEncoding>UTF8</cmdp:characterEncoding>
<cmdp:outputResource>query results</cmdp:outputResource>
<cmdp:Schema>
<cmdp:schemaname/>
</cmdp:Schema>
<cmdp:MimeType>
<cmdp:MimeType>text/html</cmdp:MimeType>
</cmdp:MimeType>
</cmdp:Output>
<cmdp:Output>
<cmdp:outputType>text</cmdp:outputType>
<cmdp:characterEncoding>UTF8</cmdp:characterEncoding>
<cmdp:outputResource>downloaded query results</cmdp:outputResource>
<cmdp:Schema>
<cmdp:schemaname>CSV, tab-separated</cmdp:schemaname>
</cmdp:Schema>
<cmdp:MimeType>
<cmdp:MimeType>text/csv</cmdp:MimeType>
</cmdp:MimeType>
</cmdp:Output>
</cmdp:SoftwareImplementation>
<cmdp:Access>
<cmdp:ResourceLicense>
<cmdp:license>CC-BY-NC-SA</cmdp:license>
<cmdp:version>4.0</cmdp:version>
<cmdp:distributionType>public</cmdp:distributionType>
<cmdp:Price>
<cmdp:amount>0</cmdp:amount>
<cmdp:ISO4217>
<cmdp:iso-4217-currency>EUR</cmdp:iso-4217-currency>
</cmdp:ISO4217>
</cmdp:Price>
</cmdp:ResourceLicense>
<cmdp:Contact>
<cmdp:Person>Liesbeth Augustinus</cmdp:Person>
<cmdp:Email>liesbeth.augustinus@kuleuven.be</cmdp:Email>
<cmdp:Organisation xml:lang="nld">KU Leuven</cmdp:Organisation>
</cmdp:Contact>
</cmdp:Access>
<cmdp:ResourceDocumentation>
<cmdp:Documentation>
<cmdp:title>GrETEL manual and documentation</cmdp:title>
<cmdp:documentationTarget>user</cmdp:documentationTarget>
<cmdp:url>http://nederbooms.ccl.kuleuven.be/eng/docgretel</cmdp:url>
<cmdp:ISO639>
<cmdp:iso-639-3-code>eng</cmdp:iso-639-3-code>
</cmdp:ISO639>
</cmdp:Documentation>
<cmdp:Publication>
<cmdp:publicationCategory>in proceedings</cmdp:publicationCategory>
<cmdp:publicationPurpose>scientific background</cmdp:publicationPurpose>
<cmdp:peerReviewStatus>yes</cmdp:peerReviewStatus>
<cmdp:Description>
<cmdp:Description LanguageID="eng">Liesbeth Augustinus, Vincent Vandeghinste, and Frank Van Eynde (2012). "Example-Based Treebank Querying" In: Proceedings of the 8th International Conference on Language Resources and Evaluation (LREC-2012). Istanbul, Turkey. pp. 3161-3167</cmdp:Description>
</cmdp:Description>
</cmdp:Publication>
<cmdp:Publication>
<cmdp:publicationCategory>in book</cmdp:publicationCategory>
<cmdp:publicationPurpose>scientific background</cmdp:publicationPurpose>
<cmdp:peerReviewStatus>yes</cmdp:peerReviewStatus>
<cmdp:Description>
<cmdp:Description LanguageID="eng">Augustinus, L, Vandeghinste, V, Schuurman, I and Van Eynde, F. 2017. GrETEL: A Tool for Example-Based Treebank Mining. In: Odijk, J and van Hessen, A. (eds.) CLARIN in the Low Countries, Pp. 269â280. London: Ubiquity Press. DOI: https://doi.org/10.5334/bbi.22. License: CC-BY 4.0</cmdp:Description>
</cmdp:Description>
</cmdp:Publication>
<cmdp:Publication>
<cmdp:publicationCategory>misc</cmdp:publicationCategory>
<cmdp:publicationPurpose>scientific background</cmdp:publicationPurpose>
<cmdp:peerReviewStatus>yes</cmdp:peerReviewStatus>
<cmdp:Description>
<cmdp:Description LanguageID="eng">http://gretel.ccl.kuleuven.be/project/publications.php</cmdp:Description>
</cmdp:Description>
</cmdp:Publication>
<cmdp:Pictures>
<cmdp:picture height="600" type="other" width="600">
http://dev.clarin.nl/sites/default/files/gretel.png
</cmdp:picture>
</cmdp:Pictures>
</cmdp:ResourceDocumentation>
<cmdp:SoftwareDevelopment>
<cmdp:Project>
<cmdp:name>Nederbooms</cmdp:name>
<cmdp:title>Exploitation of Dutch treebank for linguistic research</cmdp:title>
<cmdp:funder>EWI, Flanders</cmdp:funder>
<cmdp:url>nederbooms.ccl.kuleuven.be/eng/projects#nb</cmdp:url>
<cmdp:Contact>
<cmdp:Email>liesbeth.augustinus@kuleuven.be</cmdp:Email>
</cmdp:Contact>
<cmdp:Duration>
<cmdp:StartYear>2010-10</cmdp:StartYear>
<cmdp:CompletionYear>2012-02</cmdp:CompletionYear>
</cmdp:Duration>
</cmdp:Project>
<cmdp:Creator>
<cmdp:Contact>
<cmdp:Person>Liesbeth Augustinus</cmdp:Person>
<cmdp:Email>liesbeth.augustinus@kuleuven.be</cmdp:Email>
</cmdp:Contact>
</cmdp:Creator>
</cmdp:SoftwareDevelopment>
<cmdp:TechnicalInfo>
<cmdp:ImplementationLanguage>
<cmdp:implementationLanguage>Perl</cmdp:implementationLanguage>
<cmdp:version>5.10</cmdp:version>
</cmdp:ImplementationLanguage>
<cmdp:ImplementationLanguage>
<cmdp:implementationLanguage>PHP</cmdp:implementationLanguage>
<cmdp:version>5.3</cmdp:version>
</cmdp:ImplementationLanguage>
</cmdp:TechnicalInfo>
</cmdp:ClarinSoftwareDescription>
</cmd:Components>
</cmd:CMD>
Organisation:
- KU Leuven