Skip to contents

Takes at least two separate tables of n-grams and frequencies (either output of `fst_freq_table()` or `fst_ngrams_table()`) and finds n-grams unique to one table.

Usage

fst_get_unique_ngrams_separate(table1, table2, ...)

Arguments

table1

The first n-grams table.

table2

The second n-grams table.

...

Any other n-grams tables you want to include.

Value

Dataframe of words and whether word is unique or not.

Examples

top_child <- fst_freq_table(fst_child)
#> Note:
#>  Words with equal occurrence are presented in alphabetical order. 
#>  By default, words are presented in order to the `number` cutoff word. 
#>  This means that equally-occurring later-alphabetically words beyond the cutoff word will not be displayed.
#> 
top_dev <- fst_freq_table(fst_dev_coop)
#> Note:
#>  Words with equal occurrence are presented in alphabetical order. 
#>  By default, words are presented in order to the `number` cutoff word. 
#>  This means that equally-occurring later-alphabetically words beyond the cutoff word will not be displayed.
#> 
fst_get_unique_ngrams_separate(top_child, top_dev)
#> # A tibble: 20 × 2
#>    words          word_type
#>    <fct>          <chr>    
#>  1 jokin          Unique   
#>  2 tulla          Unique   
#>  3 tietää         Unique   
#>  4 sanoa          Unique   
#>  5 tehdä          Unique   
#>  6 paha           Unique   
#>  7 joku           Unique   
#>  8 lyöminen       Unique   
#>  9 lyödä          Unique   
#> 10 toinen         Unique   
#> 11 nälkä          Unique   
#> 12 ahneus         Unique   
#> 13 epätasa-arvo   Unique   
#> 14 vesi           Unique   
#> 15 ihminen        Unique   
#> 16 puute          Unique   
#> 17 ilmastonmuutos Unique   
#> 18 sota           Unique   
#> 19 nälänhätä      Unique   
#> 20 köyhyys        Unique