For full functionality of Sketch Engine it is necessary to
enable JavaScript
susanne
KIP
ParlaTO
KIPTO
KIP
defaults
Reset settings
English
česky
slovensky
简体中文
繁體中文
Gaeilge
slovenščina
hrvatski
العربية
español
français
українська
polski
Home
Search
Word list
Corpus info
My jobs
User guide
All words
Find x
Menu position
This action may take several minutes for large corpora, please wait.
Word list options
Corpus:
KIP
KIPTO
ParlaTO
Susanne
Subcorpus:
None (whole corpus)
Comunque>25
DAI
Ecco-lists
Per dire
centro
dai
esami
giovani
nord
sud
tipo
to
info
create new
Search attribute:
word
doc.doc_number
doc.full_conversation
doc.full_conversation_jefferson
conversation.type
conversation.point
conversation.participants_number
conversation.year
conversation.participants_relationship
conversation.moderator
conversation.topic
annotation.audio_file
annotation.participant_occupation
annotation.participant_sex
annotation.files_in_which_participant_appears
annotation.participant_school_region
annotation.participant_age_range
annotation.speaker_id
use n-grams
. Value of n: from
2
3
4
5
6
to
2
3
4
5
6
hide/nest sub-n-grams
Filter options:
Filter word list by:
Regular expression:
Minimum frequency:
Maximum frequency:
(0 = no maximum frequency)
Whitelist:
Blacklist:
format
Word list whitelists and blacklists must be plain text (.txt), encoded in UTF-8, with one item per line. The items must correspond to the selected attribute, so, eg, if 'lemma' is selected from the attribute menu, then the list should be a list of lemmas. We use exact matching, not regular-expression matching, for file input.
Include non-words
Output options:
Frequency figures:
Hit counts
Document counts
ARF
Output type:
Simple
Keywords
Reference (sub)corpus
KIP
KIPTO
ParlaTO
Susanne
(whole corpus)
the rest of the corpus
Comunque>25
DAI
Ecco-lists
Per dire
centro
dai
esami
giovani
nord
sud
tipo
to
Prefer:
rare words
common words
Change output attribute(s)
---
word
---
word
---
word
You can select one or more output attributes. Please note that this option can be time-consuming.