This shows you the differences between two versions of the page.
Both sides previous revision Previous revision Next revision | Previous revision | ||
spokes_documentation [2017/02/11 17:32] pezik [Spokes PL] |
spokes_documentation [2017/05/29 09:07] (current) pezik |
||
---|---|---|---|
Line 6: | Line 6: | ||
- | Please make sure you cite Spokes properly when using it for your work or research: [[http://www.ep.liu.se/ecp/article.asp?issue=116&volume=&article=009| | + | Please make sure you cite Spokes properly: |
- | : Pęzik, Piotr. “Spokes – a Search and Exploration Service for Conversational Corpus Data.” In Selected Papers from the CLARIN 2014 Conference, October 24-25, 2014, Soesterberg, The Netherlands, 99–109. Linköping Electronic Conference Proceedings. Linköping University Electronic Press, Linköpings universitet, 2015. | + | |
- | ]] . | + | [[http://www.ep.liu.se/ecp/article.asp?issue=116&volume=&article=009| |
+ | Pęzik, Piotr. “Spokes – a Search and Exploration Service for Conversational Corpus Data.” In Selected Papers from the CLARIN 2014 Conference, October 24-25, 2014, Soesterberg, The Netherlands, 99–109. Linköping Electronic Conference Proceedings. Linköping University Electronic Press, Linköpings universitet, 2015]]. | ||
+ | |||
+ | Here is a BibTeX record: | ||
+ | |||
+ | <code> | ||
+ | @inproceedings{pezik_spokes_2015, | ||
+ | series = {Linköping {Electronic} {Conference} {Proceedings}}, | ||
+ | title = {Spokes – a search and exploration service for conversational corpus data}, | ||
+ | copyright = {CC-BY-NC}, | ||
+ | isbn = {978-91-7685-954-4}, | ||
+ | url = {http://www.ep.liu.se/ecp_article/index.en.aspx?issue=116;article=009}, | ||
+ | abstract = {Spokes is an online service for conversational corpus data search and exploration, currently developed as part of CLARIN-PL – the Polish CLARIN infrastructure. This paper describes the data sets currently available through Spokes, the architecture of the service and the data and metadata search functionality it provides to its users. We also introduce some of the more experimental features which have been developed to facilitate more advanced research on multimodal conversational corpora.}, | ||
+ | booktitle = {Selected {Papers} from {CLARIN} 2014}, | ||
+ | publisher = {Linköping University Electronic Press, Linköpings universitet}, | ||
+ | author = {Pęzik, Piotr}, | ||
+ | year = {2015}, | ||
+ | pages = {99--109} | ||
+ | } | ||
+ | </code> | ||
=====SlopeQ syntax===== | =====SlopeQ syntax===== | ||
Line 639: | Line 658: | ||
=====REST API===== | =====REST API===== | ||
- | The REST API of Spokes makes it possible to search and extract | + | The REST API of Spokes PL makes it possible to search and extract the entire contents of the corpus. It's [[http://clarin.pelcra.pl/apidocs/spokes|documented here]]. The only missing bit there is the extraction of the media associated with the transcripts, which is why we illustrate it here: |
+ | - Here is [[http://pelcra.clarin-pl.eu/api/corpus/spokes/texts/MqBX|how you get the metadata of a conversation]]. | ||
+ | - If the transcript is time-aligned (see the relevant field in the response), you should be able to [[http://pelcra.clarin-pl.eu/api/corpus/spokes/texts/MqBX/utterances|download the "utterances" with their ids and time offsets]]. | ||
+ | - Finally, [[http://pelcra.clarin-pl.eu/api/corpus/spokes/media/text/range?text_id=MqBX&start=14311&stop=16241|you can get the audio snippet corresponding to the utterance]] using the time offsets for utterances provided in the previous response. |