Skip to content

Latest commit

 

History

History
1522 lines (1503 loc) · 727 KB

openai_community_gpt2.md

File metadata and controls

1522 lines (1503 loc) · 727 KB

Report for openai-community/gpt2

Model info

  • Model Info:
    • Tied embeddings: True
    • LM head uses bias: False
    • Embeddings shape: [50257, 768]
  • Tokenizer Info:
    • Vocab Size: 50257
    • Tokenizer Class: GPT2Tokenizer
    • Bytes handling: Byte Input
    • Tokenizer Type: BPE
    • Token for verification prompt building: BuyableInstoreAndOnline
    • Token id for verification prompt building: 40242
  • Indicator summary:
    • Indicator for under-trained tokens: E_{out} Cosine Distance
    • Overall distribution: 0.536 +/- 0.070
  • Detected Token Counts:
    • Number of tested under-trained tokens: 999, 967 non-special, 33 below p = 0.01 threshold, 26 below soft indicator threshold
    • Number of single byte tokens: 256, of which 46 below indicator threshold
    • Number of special tokens: 0, of which 0 below indicator threshold
    • Number of non-single-byte UTF-fragment tokens: 216, of which 2 below soft indicator threshold

Under-trained token indicators plot

Indicators scatter plots

Verification plot

Verification plot

Under-trained token verification results

26 entries below threshold of 0.206

token_id token indicator max_prob in_other_tokens
40241 InstoreAndOnline 0.00143921 8.6e-09 BuyableInstoreAndOnline
30905 rawdownload 0.00145763 1.4e-07 rawdownloadcloneembedreportprint
39752 quickShip 0.00147873 9.1e-09 quickShipAvailable
40240 oreAndOnline 0.00148576 9.1e-08 InstoreAndOnline, BuyableInstoreAndOnline
30898 embedreportprint 0.00153571 1e-07 cloneembedreportprint, rawdownloadcloneembedreportprint
45544 ▁サーティ 0.00154209 1.4e-07 ▁サーティワン
36173 ▁RandomRedditor 0.00156629 1.5e-07 ▁RandomRedditorWithNo
30212 ▁externalToEVA 0.00158066 1.7e-07 ▁externalToEVAOnly
42089 ▁TheNitrome 0.00158507 8.5e-09 ▁TheNitromeFan
30897 reportprint 0.00160545 3.3e-08 embedreportprint, cloneembedreportprint, rawdownloadcloneembedreportprint
30208 ▁externalTo 0.00763935 1e-06 ▁externalToEVA, ▁externalToEVAOnly
23090 ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ 0.022287 3.5e-07 ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ
37574 StreamerBot 0.036516 2.8e-07 TPPStreamerBot
31573 ActionCode 0.0397143 5.4e-07 externalActionCode
14827 ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ 0.0400726 6.9e-06 ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ
42066 Nitrome 0.135858 1.7e-05 ▁TheNitrome, ▁TheNitromeFan
9364 ÃÂÃÂÃÂÃÂ 0.139732 0.0014 ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ
39749 DeliveryDate 0.163166 0.0053 soDeliveryDate
39142 ThumbnailImage 0.163535 0.00014 ItemThumbnailImage
39714 isSpecial 0.169225 0.0019 isSpecialOrderable
6 additional entries below threshold
token_id token indicator max_prob in_other_tokens
40219 oreAnd 0.172416 1.7e-05 oreAndOnline, InstoreAndOnline, BuyableInstoreAndOnline
30899 cloneembedreportprint 0.176264 0.00011 rawdownloadcloneembedreportprint
13150 ▁subur 0.180166 0.00016 ▁suburban, ▁suburbs, ▁suburb
5815 ÃÂÃÂ 0.182786 0.043 ÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ
17629 ▁practition 0.183995 0.0008 ▁practitioners, ▁practitioner
39655 Orderable 0.199313 0.0036 isSpecialOrderable
941 additional entries above threshold
token_id token indicator max_prob in_other_tokens
15272 ▁pione 0.206062 5.7e-05 ▁pioneer, ▁pioneering, ▁pioneers, ▁pioneered
27293 ▁antidepress 0.219108 0.012 ▁antidepressants, ▁antidepressant
27013 aditional 0.231961 0.00026 ▁Traditional, traditional, Traditional
30439 ▁unintention 0.247503 0.00035 ▁unintentionally, ▁unintentional
25618 ▁councill 0.259484 0.0011 ▁councillor, ▁councillors
7105 ▁volunte 0.272465 0.01 ▁volunteers, ▁volunteer, ▁volunteered, ▁volunteering
4690 ortunately 0.28492 0.0011 fortunately, ▁Unfortunately, ▁unfortunately, Unfortunately, ▁Fortunately, ...
24973 ▁exting 0.285715 0.057 ▁extingu, ▁extinguished
19476 ▁carbohyd 0.29039 0.11 ▁carbohydrate, ▁carbohydrates
18945 ▁teasp 0.304984 0.14 ▁teaspoon, ▁teaspoons
13198 ▁earthqu 0.305768 0.027 ▁earthquake, ▁earthquakes
42202 GoldMagikarp 0.31455 0.98 ▁SolidGoldMagikarp
11548 ▁entreprene 0.318222 0.0076 ▁entrepreneurs, ▁entrepreneur, ▁entrepreneurial, ▁entrepreneurship
14695 ▁eleph 0.319038 0.28 ▁elephant, ▁elephants
39693 Buyable 0.323047 0.77 BuyableInstoreAndOnline
42889 ikuman 0.33741 0.021 ▁Kinnikuman
48396 ÛÛ 0.337924 0.91
44392 ▁cumbers 0.338503 0.04 ▁cumbersome
14341 PDATE 0.341838 0.073 UPDATE, ▁UPDATE, PDATED
35496 ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ 0.342343 0.98
31666 ?????-?????- 0.343427 0.8
5808 ÃÂ 0.345483 0.93 ÃÂÃÂ, ÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ
22315 ▁newcom 0.354794 0.033 ▁newcomers, ▁newcomer
23711 ▁Moroc 0.357426 0.059 ▁Morocco, ▁Moroccan
8994 ailability 0.35836 0.093 ▁availability, Availability, channelAvailability, ▁Availability, availability
40817 Honestly 0.35894 0.95
24307 ▁looph 0.360073 0.053 ▁loophole, ▁loopholes
41215 conservancy 0.360696 0.92 natureconservancy
46399 Putting 0.36074 0.99
44850 Ironically 0.360898 0.66
25658 ?????- 0.362028 0.69 ?????-?????-
20554 ▁unbeliev 0.366103 0.0096 ▁unbelievable, ▁unbelievably
11689 ▁unnecess 0.366404 0.17 ▁unnecessary, ▁unnecessarily
27924 ▁srf 0.367115 0.55 ▁srfN, ▁srfAttach
22640 itially 0.367276 0.11 ▁Initially, Initially
48142 Assuming 0.369249 0.97
30513 Shortly 0.369461 0.65
43298 userc 0.370259 0.88 usercontent
33813 =~=~ 0.370398 0.97
32917 aution 0.370782 0.37 ▁precaution, ▁cautioned
43569 ÍÍ 0.371278 0.92
27927 Nearly 0.372061 0.92
16303 ▁undermin 0.372135 0.25 ▁undermine, ▁undermining, ▁undermined, ▁undermines
49321 Typically 0.37348 0.81
41156 Depending 0.3738 0.9
33092 Interestingly 0.374256 0.74
36174 ▁RandomRedditorWithNo 0.374315 0.59
48908 ▁4090 0.375049 1
43258 Nonetheless 0.375779 0.54
36725 Sadly 0.377924 0.94
43541 Amid 0.377997 0.96
40443 Initially 0.378373 0.76
12677 ▁tradem 0.378841 0.095 ▁trademark, ▁trademarks
40475 Considering 0.379018 0.99
28235 aeper 0.379227 0.24 aepernick, ▁Kaepernick
50030 ▁glared 0.37924 0.92
4183 ▁conflic 0.379625 0.058 ▁conflict, ▁conflicts, ▁conflicting, ▁conflicted
45228 uliffe 0.380092 0.24 ▁McAuliffe
20670 Obviously 0.380222 0.73
49543 ▁inhibits 0.380854 0.75
36651 ▁Somehow 0.380955 0.92
27894 Regardless 0.381237 0.85
30402 Apparently 0.381601 0.83
37731 ▁Ironically 0.383061 0.93
35992 WithNo 0.384026 0.62 ▁RandomRedditorWithNo
24982 ▁Consequently 0.384384 0.65
46418 ▁formulate 0.384643 0.95
50009 ▁strutConnector 0.384671 0.99
30638 Clearly 0.38488 0.9
38150 Hundreds 0.385048 0.89
26797 Throughout 0.38505 0.88
6598 ▁behavi 0.385081 0.37 ▁behaviour, ▁behaviors, ▁behavioral, ▁behaving, ▁behaviours, ...
43970 ▁Notwithstanding 0.385126 1
44217 Hmm 0.38603 0.96
45683 ▁Notably 0.38644 0.67
47578 ▁counteract 0.386773 0.82
44213 Naturally 0.386882 0.98
49856 627 0.387394 0.99
34718 rehensible 0.387654 0.58 ▁incomprehensible
29740 ▁Azerb 0.387717 0.085 ▁Azerbai, ▁Azerbaijan
34173 ▁Honestly 0.38779 0.77
47183 ▁Surprisingly 0.38784 0.75
46435 653 0.387993 0.99
13296 ▁Leban 0.38822 0.63 ▁Lebanon, ▁Lebanese
40067 ▁Ideally 0.388304 0.97
12869 ▁reluct 0.388419 0.22 ▁reluctant, ▁reluctance, ▁reluctantly
37058 Generally 0.388523 0.96
44669 Compared 0.388598 0.98
45385 674 0.388907 0.99
6336 ▁Palestin 0.389059 0.63 ▁Palestinian, ▁Palestinians, ▁Palestine
42548 676 0.389105 0.99
47106 439 0.389444 0.99 20439
44959 Quite 0.389908 0.96
48724 572 0.390064 0.99
48448 iosyn 0.390173 0.012 iosyncr, ▁idiosyncr
25044 ▁Interestingly 0.390359 0.67
43336 ▁291 0.390429 0.97
45872 Likewise 0.390493 0.37
46010 Huh 0.39083 0.95
39889 ▁chuckled 0.390899 0.63
32602 Aside 0.390969 0.79
33937 Vaults 0.390998 0.96
36720 372 0.391018 0.99 ▁372
24847 ModLoader 0.391086 0.93 ForgeModLoader
13171 VIDIA 0.391106 0.33 ▁NVIDIA, NVIDIA
33459 459 0.391122 0.99
3523 ▁citiz 0.391493 0.18 ▁citizens, ▁citizen, ▁citizenship
27824 367 0.391611 0.99 ▁367
36928 807 0.391641 1
45648 Knowing 0.391828 0.97
42000 ▁hemor 0.391834 0.14 ▁hemorrh
41974 accompan 0.391983 0.99 accompanied, ▁unaccompanied, ▁accompanies
27251 ▁Nonetheless 0.392029 0.51
44826 ▁385 0.392464 0.98
45214 694 0.392592 0.99
28877 Whenever 0.392652 0.99
24606 Moreover 0.392747 0.6
24795 Nobody 0.39277 0.99
24951 Furthermore 0.392842 0.57
39865 proclaimed 0.393023 0.97
34039 ▁Essentially 0.393099 0.91
48758 ▁396 0.393169 0.98
45082 ▁perpend 0.393224 0.23 ▁perpendicular
43864 672 0.393241 0.97
43193 652 0.393338 0.99
36336 ▁quantify 0.393502 0.94
30695 378 0.393547 1 ▁378
38569 758 0.39358 1
48379 Specifically 0.393715 0.69
7782 ▁occas 0.393793 0.21 ▁occasionally, ▁occasional, ▁occasions
37680 657 0.393804 0.99
34107 396 0.394078 0.99 ▁396
28172 Everybody 0.394122 0.99
44084 ▁348 0.394169 0.98
49051 593 0.394261 1
29886 ▁assail 0.39442 0.72 ▁assailant, ▁assailants
49473 ▁interrogated 0.394476 0.91
29011 Nevertheless 0.394491 0.37
36268 ▁grinned 0.394681 0.47
33372 397 0.394988 0.99
44326 ーテ 0.395033 0.14 ーティ, ▁サーティ, ▁サーティワン
18521 Unlike 0.395035 0.82
37710 391 0.39525 1
48065 ▁426 0.395292 0.98
34626 394 0.395293 0.99
29646 ▁gobl 0.395385 0.6 ▁goblin, ▁goblins
43240 798 0.395672 0.99
33238 ▁Assuming 0.395767 0.98
42983 ワン 0.395899 0.99 ▁サーティワン
8983 ▁satell 0.395953 0.31 ▁satellite, ▁satellites
48602 629 0.395969 0.99
49541 691 0.396014 0.98
49633 ▁389 0.396025 0.99
49020 ▁374 0.396068 0.97
39118 588 0.396095 0.99
39195 326 0.396142 0.99 ▁326
44125 ▁preclude 0.396318 0.92
40654 ▁284 0.396412 0.95
42382 Depths 0.396489 0.99
34301 Prosecut 0.39657 0.99 ▁Prosecutor, Prosecutors, ▁Prosecutors
48800 ▁Presumably 0.396773 0.54
9286 ▁exha 0.396782 0.065 ▁exhaust, ▁exhausted, ▁exhaustion, ▁exhaustive, ▁exhausting
31524 Basically 0.396791 0.64
47834 Yep 0.396833 0.97
49689 ▁387 0.396854 0.9
44169 589 0.396926 1
44578 464 0.396966 0.99
38056 371 0.397006 0.99 ▁371
27728 298 0.397036 0.99 ▁298
36001 Certainly 0.397224 0.84
46519 782 0.397379 0.99
26294 ioxid 0.397502 0.91 ▁antioxid, ▁antioxidant, ▁antioxidants
33023 hovah 0.39755 0.9 ▁Jehovah
15755 ▁millenn 0.397636 0.15 ▁millennials, ▁millennia, ▁millennium, ▁millennial
35667 362 0.397695 1
41810 ▁282 0.397708 0.97
41451 Isn 0.39773 1
47946 ▁373 0.397774 0.98
48564 681 0.397838 0.99
39351 authored 0.397929 0.85
10864 iverpool 0.397976 1 ▁Liverpool, Liverpool
43453 ▁SolidGoldMagikarp 0.398138 0.88
43452 599 0.398179 0.99
32437 ▁Smartstocks 0.398365 0.88
49934 548 0.398365 1
48102 ▁nutritious 0.398386 0.99
49225 ▁incite 0.398448 0.88
39768 ▁274 0.398469 0.95
28386 ▁Accordingly 0.398533 0.69
20213 ▁pestic 0.39855 0.24 ▁pesticides, ▁pesticide
48528 693 0.398559 0.99
49522 ▁exacerbate 0.398581 0.96
41103 ▁297 0.398632 0.98
45734 596 0.398957 1
42032 ▁283 0.399178 0.99
40523 689 0.399347 0.99
49814 ▁383 0.399463 1
46589 692 0.399476 0.99
43195 ▁impede 0.399542 0.77
39466 ▁279 0.399613 0.93
37381 695 0.399634 1
47343 ▁371 0.399726 0.98
42489 ▁339 0.399781 0.98
38040 ▁promul 0.399814 0.97 ▁promulg
43690 436 0.40001 1 ▁436
47674 ▁Particularly 0.400092 0.75
27212 Ultimately 0.400264 0.91
48156 792 0.400269 0.99
36119 querque 0.400274 0.45 buquerque, ▁Albuquerque
35890 489 0.400285 0.99
39937 ▁345 0.400389 1
50255 ▁gazed 0.400456 0.76
31496 337 0.400481 0.99 ▁337
46674 ▁counterproductive 0.400615 0.99
42819 ▁334 0.400711 0.98
23591 ▁Depending 0.400882 0.95
49669 ▁465 0.400962 0.99
39174 ▁278 0.400982 0.96
37001 ▁undermines 0.401224 0.98
30885 ▁alleviate 0.401226 0.9
32568 282 0.401249 0.99 ▁282
37991 @@@@@@@@ 0.4013 0.96
45151 725 0.401443 1
46250 671 0.401447 0.99
45473 ▁378 0.401466 0.99
47521 683 0.401482 0.99
44617 587 0.401504 0.99
40256 492 0.401537 0.99
33528 Investigators 0.401738 0.85 ▁Investigators
43155 ▁341 0.401822 0.97
39401 Prosecutors 0.401832 0.97 ▁Prosecutors
46618 ▁413 0.401893 0.99
40149 482 0.401897 0.99
39997 462 0.402016 1
46096 537 0.402108 1
39424 Regarding 0.40218 0.99
38219 756 0.402222 0.99
41125 ▁reinforces 0.402226 0.99
12943 ▁encount 0.402307 0.73 ▁encountered, ▁encounters, ▁encountering
43625 Normally 0.402448 0.88
48207 ▁392 0.402465 0.99
46767 ▁embroiled 0.402533 0.87
47325 699 0.402537 0.99
42141 ▁329 0.402653 0.96
38205 696 0.402919 1
48630 581 0.402954 1
41813 643 0.402992 0.99
43874 ▁perpetuate 0.403087 0.84
44341 ▁342 0.403098 0.98
39339 ▁eradicate 0.403098 0.8
46302 786 0.403104 0.99
43722 ▁331 0.403121 0.98
45730 ▁undertook 0.403133 0.91
42240 866 0.403163 1
47263 ▁ridiculed 0.403217 0.89
42294 ▁337 0.403222 0.97
37528 ▁258 0.403229 0.98
48655 ▁445 0.403242 1
49852 ▁reiterate 0.403292 0.9
37988 806 0.403307 1
28705 Authorities 0.403337 0.86 ▁Authorities
26443 ▁denounced 0.40336 0.81
5392 ▁conclud 0.403435 0.097 ▁concluded, ▁conclude, ▁concludes, ▁concluding
43239 597 0.40349 0.99
44218 554 0.403496 1
45937 ▁379 0.403511 0.98
21893 ▁sighed 0.403549 0.63
37859 paralleled 0.40358 0.99 ▁unparalleled
43467 Understanding 0.403615 1
33551 291 0.403718 0.99 ▁291
30956 ▁impover 0.403733 0.98 ▁impoverished
42470 TextColor 0.403741 1 ▁SetTextColor
16041 ▁referen 0.403752 0.53 ▁referenced, ▁referencing
8438 everal 0.403758 0.0093 ▁Several, Several
44282 ▁plummeted 0.403851 0.87
43134 493 0.403855 1
48246 748 0.403976 0.99
35402 706 0.403982 0.99
34801 705 0.404002 0.99
4060 vertisement 0.404033 0.026 Advertisement, vertisements, Advertisements, ▁advertisement, ▁advertisements, ...
48311 ilantro 0.404038 0.95
19306 ▁prosec 0.40407 0.28 ▁prosecuted, ▁prosecute, ▁prosecutions, ▁prosecuting
45165 organisms 0.404183 1
31256 ▁undermined 0.404192 0.97
40236 FINEST 0.404288 0.98
47915 561 0.404623 0.99 76561
30557 346 0.404754 0.99 ▁346
44351 ▁despise 0.404785 0.84 ▁despised
34287 648 0.404896 0.99
23937 Besides 0.404919 0.76
36347 ▁neuronal 0.404941 0.92
47498 approximately 0.404951 0.98
41235 ▁294 0.404965 0.97
45331 432 0.404971 1 ▁432
40223 disable 0.405021 0.99 disabled
37747 496 0.405029 0.99
30803 369 0.405039 0.98 ▁369
46239 583 0.405113 0.99
46438 594 0.405175 1
28039 Similarly 0.405188 0.64
31869 esteem 0.405251 1 ▁esteem, ▁esteemed
48252 ▁424 0.405264 0.98
43798 670 0.405486 0.99 ▁670
41580 684 0.405518 1
42877 70710 0.405554 1
42300 umenthal 0.405575 0.32 ▁Blumenthal
35809 668 0.405696 0.99
39925 687 0.405782 1
40281 Increased 0.405801 0.98
40501 Absolutely 0.405803 0.99
38431 658 0.40581 1
39111 654 0.405845 0.99
45071 ▁refute 0.405868 0.93 ▁refuted
49995 733 0.405891 0.99
26556 Taking 0.405916 0.98
30336 284 0.405968 0.99 ▁284
44856 ▁366 0.405969 0.9
49489 546 0.406012 1
39114 ▁Needless 0.406031 0.98
47744 ▁361 0.406046 0.99
38314 759 0.406088 1
41569 ▁292 0.406141 0.99
37967 329 0.406167 0.99 ▁329
41696 ▁insightful 0.406198 0.9
30057 261 0.406262 1 ▁261
46352 584 0.406297 1
48952 591 0.406341 1
13710 Perhaps 0.406443 0.91
39166 ▁261 0.406489 0.99
31442 Alright 0.406528 0.93 ▁Alright
27097 -+-+ 0.406553 0.89 -+-+-+-+
41734 579 0.406608 0.99
48132 ▁409 0.406629 0.99 ▁4090
39135 ▁263 0.406652 0.99
34137 484 0.406669 0.99
36481 ertodd 0.406692 0.93 ▁petertodd
45063 ▁428 0.4068 0.99
30995 347 0.40682 1 ▁347
40179 677 0.406825 1
36243 382 0.406861 1
35061 ositories 0.406874 0.3 ▁repositories
23216 Additionally 0.406921 0.81
32759 292 0.406978 1 ▁292
32365 Hopefully 0.406991 0.97
28872 241 0.407031 0.99 ▁241
43343 Damn 0.40713 0.96
37288 Often 0.407175 0.68
19373 ▁adolesc 0.407191 0.73 ▁adolescents, ▁adolescent, ▁adolescence
47101 434 0.407261 1
48891 029 0.407282 0.98
27693 289 0.407328 0.98 ▁289
31952 398 0.407396 0.99 ▁398
48578 ▁disingen 0.407404 0.97
45791 663 0.40747 1
30743 359 0.407579 0.99 ▁359
50148 956 0.407585 0.99
36387 intestinal 0.407631 0.98 ▁gastrointestinal
38652 474 0.407707 0.98
40097 ▁uphe 0.407711 0.97 ▁upheaval
45598 740 0.407714 1
42311 arnaev 0.407754 0.59 ▁Tsarnaev
44163 Alternatively 0.407806 0.71
47682 ,,,,,,,, 0.40782 1
47075 ▁restricts 0.407845 0.76
41455 entanyl 0.40791 0.99 ▁fentanyl
38865 FAULT 0.407953 0.98
42090 ▁TheNitromeFan 0.407987 0.92
16637 ▁undermine 0.408099 0.93 ▁undermined, ▁undermines
39506 442 0.408101 0.99
48105 ▁integrates 0.408114 0.94
40434 ▁saddened 0.408144 0.84
34511 ▁Conversely 0.408206 0.75
45236 ▁gasped 0.408207 0.89
47567 ▁353 0.408277 0.99
43375 unsigned 0.408292 0.99
48387 Thankfully 0.408326 0.32
35273 351 0.408344 0.99 ▁351
37730 452 0.408389 1
33660 341 0.408407 0.99 ▁341
40286 ▁309 0.408412 0.98
13947 itzerland 0.408416 0.82 ▁Switzerland
45675 Excellent 0.408441 1
49125 ▁423 0.408453 0.97
45839 592 0.408459 0.99
45261 ▁Numerous 0.408479 0.71
49150 736 0.408488 0.99
46633 ▁372 0.408562 0.97
42699 ▁facilitates 0.40864 0.87
46957 028 0.408645 0.99
32128 376 0.408665 0.99 ▁376
49327 ▁363 0.408726 0.98
10060 ithub 0.408746 0.6 github, ▁github, ▁Github
40652 461 0.408781 0.99
50165 783 0.408794 1
29613 otonin 0.408891 0.42 ▁serotonin
41544 795 0.408913 0.99
28360 ▁antioxid 0.409 0.92 ▁antioxidant, ▁antioxidants
44093 899 0.409022 0.99 ▁1899
45758 673 0.409045 1
31727 cffff 0.409047 0.63 cffffcc
43614 ▁induces 0.40907 0.98
43019 ▁368 0.409071 0.98
43494 ▁mobilize 0.409073 0.9 ▁mobilized
46900 574 0.409115 0.99
36794 ▁imposes 0.409175 0.93
45455 799 0.409197 1
49641 763 0.409209 1
33394 352 0.409263 1 ▁352
41586 ▁appalled 0.409379 0.83
42321 ▁395 0.409449 0.97
38703 ▁277 0.409489 0.98
46660 887 0.409523 0.99
33505 ▁succumb 0.409596 0.99 ▁succumbed
25895 idepress 0.409632 0.93 ▁antidepress, ▁antidepressants, ▁antidepressant
43855 ▁propel 0.409635 0.98 ▁propell
44426 ▁deems 0.409708 0.85
21109 pleasant 0.409729 1 ▁unpleasant, ▁pleasantly
21920 innamon 0.409742 0.97 ▁cinnamon, ▁Cinnamon
41723 ▁hesitated 0.409853 0.95
27213 ▁Sadly 0.409866 0.74
42530 estyles 0.409907 0.24 ▁lifestyles
46511 ▁advisable 0.40995 0.96
27846 ▁glanced 0.40995 0.83
29088 379 0.409966 0.99 ▁379
46343 untled 0.409977 0.75 ▁disgruntled
42338 Seriously 0.409983 0.95
38639 ▁denounce 0.41 0.97
48200 628 0.410007 0.99
44087 022 0.410033 0.99
26912 246 0.41008 0.99 ▁246
18356 ▁opio 0.41009 0.52 ▁opioid, ▁opioids
38776 ïve 0.410124 0.99 ▁naïve
41922 ▁296 0.410187 0.97
41289 491 0.410231 0.99
34804 ▁unavoid 0.41026 0.71 ▁unavoidable
27265 ▁Shortly 0.410287 0.7
48096 553 0.41035 1
7678 ▁overwhel 0.41037 0.7 ▁overwhelming, ▁overwhelmed, ▁overwhelmingly, ▁overwhelm
34951 ▁246 0.410484 0.98
38149 Configuration 0.410494 0.99
47905 ▁tirelessly 0.41057 0.96
43916 730 0.410575 0.99
43713 ▁leapt 0.410621 0.68
32583 708 0.410639 0.99
36445 659 0.410641 0.99
30340 ▁undermining 0.410697 0.99
43550 ▁388 0.410699 0.97
49148 ▁antioxidants 0.410729 0.96
45662 raltar 0.410748 0.75 ▁Gibraltar
28544 Increases 0.410776 1
45449 CLASSIFIED 0.410787 0.95 ▁UNCLASSIFIED
32321 392 0.410835 1 ▁392
28417 ▁Surely 0.41088 0.91
22428 acerb 0.410916 0.8 ▁exacerb, ▁exacerbated, ▁exacerbate
49881 ▁stimulates 0.41093 0.85
10298 senal 0.410941 0.01 ▁Arsenal, ▁arsenal, Arsenal
40064 435 0.410956 1 ▁435
27137 296 0.410998 1 ▁296
43147 710 0.411062 0.98
43352 ▁Hmm 0.411082 0.96
27019 277 0.411112 0.99 ▁277
34427 688 0.411141 0.99
49351 528 0.411236 0.99
14945 Several 0.411265 0.91
38073 497 0.41129 0.99
47589 ▁undeniably 0.411369 0.92
46600 ▁Adinida 0.411422 0.99
48638 573 0.411435 0.99
36657 669 0.411454 0.99
43643 Lots 0.411477 0.99
45210 ▁357 0.411517 0.99
47400 risome 0.411524 0.84 ▁worrisome
16782 ▁misunder 0.411567 0.35 ▁misunderstanding, ▁misunderstood, ▁misunderstand
17844 ▁insurg 0.41159 0.99 ▁insurgents, ▁insurgency, ▁insurgent
45345 ▁427 0.41159 0.99
46554 ▁apopt 0.411594 1
42524 ivariate 0.411713 0.99
46752 ▁354 0.411717 0.98
41483 ▁skyrocket 0.411736 0.96
27662 ▁Considering 0.411744 0.88
28042 Unless 0.411758 0.96
37831 ▁259 0.411862 0.96
36260 498 0.411863 0.99
30290 283 0.411961 1 ▁283
45959 ▁418 0.411966 0.99
40486 558 0.412044 1
48061 ▁embody 0.412055 0.96
32182 354 0.412089 0.99 ▁354
48531 526 0.412141 1
44367 ▁349 0.41215 0.98
28525 oliberal 0.412269 0.9 ▁neoliberal
27412 368 0.412379 0.99 ▁368
33524 anamo 0.412387 0.28 ▁Guantanamo
40385 ▁319 0.412435 0.96
38595 ▁324 0.412449 0.98
40856 ▁lessen 0.412466 0.89
40035 697 0.412536 1
16263 ▁Obviously 0.412551 0.81
38831 ▁322 0.412705 0.95
45601 ▁490 0.412728 0.99
34583 809 0.412738 1
39500 osponsors 0.412792 0.38 ▁Cosponsors
37364 ▁267 0.412808 0.99
7601 ▁proport 0.412823 0.66 ▁proportion, ▁proportions, ▁proportional
49287 883 0.412844 0.99
14698 Having 0.412911 0.96
29769 389 0.412916 0.99 ▁389
42322 Personally 0.412921 0.98
28268 racuse 0.412953 0.36 ▁Syracuse
40037 establish 0.413066 0.99 establishment
36678 ▁268 0.413132 0.97
48365 751 0.413154 0.99
30856 ▁Vaugh 0.413165 0.2 ▁Vaughn, ▁Vaughan
45881 ▁475 0.413166 0.98
35447 363 0.413169 1 ▁363
50080 431 0.413309 0.99
33207 WINDOWS 0.413317 1
32459 366 0.413376 1 ▁366
41948 557 0.413407 1
45041 ▁denouncing 0.413455 0.77
32220 387 0.41347 1 ▁387
44103 413 0.413547 1 ▁413
31697 331 0.413562 0.99 ▁331
43704 438 0.413579 1
44994 533 0.413579 1
25096 186 0.413592 0.99 ▁186, ▁1860, ▁1861, ▁1863, ▁1865, ...
44673 797 0.413689 0.99
42759 641 0.413713 0.99
41290 642 0.413722 1
34876 ortmund 0.413727 0.97 ▁Dortmund
48702 iosyncr 0.413737 0.04 ▁idiosyncr
29807 272 0.413739 0.99 ▁272
33153 ```` 0.413757 0.99
46839 ▁455 0.413821 0.98
47258 Callback 0.413827 1
31128 358 0.413842 0.99 ▁358
42622 ▁402 0.413879 0.99
48464 Especially 0.413927 0.92
48977 ▁joyful 0.413944 0.95
37736 ▁oxidative 0.41395 1
43489 445 0.414039 0.99 ▁445
48303 ▁cherish 0.414067 0.97
42199 466 0.414077 1
45620 ▁369 0.414106 0.92
47284 ▁outlandish 0.41417 0.6
28211 Someone 0.414178 0.95
43038 ▁Okawaru 0.414235 0.98
48581 949 0.414236 1
45975 Removed 0.414263 0.99
35914 omedical 0.414291 0.98 ▁biomedical
39890 jriwal 0.414316 0.16 ▁Kejriwal
29743 uania 0.414343 0.93 ▁Lithuania
36100 ▁257 0.414361 0.98
39088 525 0.414375 1 ▁525
48712 896 0.414407 1
44417 ▁351 0.414418 0.98
39357 698 0.414447 0.99 ▁698
36897 eredith 0.414515 0.79 ▁Meredith
27800 287 0.414544 0.99 ▁287
37737 ▁266 0.414588 0.97
49517 712 0.414596 1
44033 0.414601 0.99
46983 ▁contaminants 0.414617 0.99
38902 ▁289 0.414649 0.96
37084 ▁supremacists 0.414786 0.85
37466 656 0.414804 0.99 76561
11606 ategory 0.414829 0.97 ▁Category, category, Category
36623 Critics 0.414863 0.89
29966 ormons 0.41495 0.99 ▁Mormons
39177 ItemThumbnailImage 0.415003 0.99
47159 661 0.415004 1
38880 ▁altercation 0.415037 0.98
44230 885 0.415111 1
38905 585 0.415129 1
44870 ▁broaden 0.415136 0.94
24237 ▁mitigate 0.415139 0.91
46766 ▁spearheaded 0.415161 0.96
27988 276 0.415271 0.99 ▁276
38192 ▁unsettling 0.415286 0.74
47850 ▁evaluates 0.4153 0.95
43950 682 0.415353 1
43292 ▁347 0.415388 0.99
28018 urrencies 0.415397 0.95 ▁cryptocurrencies
35292 ▁soared 0.415438 0.96
46044 582 0.415443 0.99
48947 ▁THESE 0.415446 0.98
49542 522 0.415447 1
39861 ▁698 0.415449 1
34598 ▁231 0.415458 0.99
38618 Avoid 0.41546 0.96
43903 ptives 0.415479 0.77 ▁contraceptives, ▁captives
42780 426 0.415485 1 ▁426
46396 075 0.415492 0.99
47916 ▁innovate 0.415495 0.64
32901 Adding 0.415505 0.98
36626 381 0.415525 1
43011 mittedly 0.415539 0.18
48404 ruciating 0.415543 0.18 ▁excruciating
36314 Seeing 0.415577 0.95
37283 322 0.41564 0.99 ▁322
20627 becca 0.415662 0.94 ▁Rebecca
42332 Luckily 0.415733 0.38
42018 465 0.415739 1 ▁465
47785 519 0.415764 1
29889 ▁diminish 0.415834 0.8 ▁diminishing
33552 ▁exclaimed 0.415878 0.99
45979 ▁pious 0.41588 0.72
41172 785 0.415881 1
25674 267 0.415888 1 ▁267
32531 402 0.4159 1 ▁402
37411 Honest 0.415902 1 Honestly
24369 290 0.415943 1 ▁290
20621 akespe 0.415954 0.45 akespeare, ▁Shakespeare
42223 upuncture 0.416015 0.83 ▁acupuncture
47007 615 0.416019 1
37887 Usually 0.416043 0.79
50242 794 0.416162 0.99
18092 ernandez 0.416164 0.88 ▁Hernandez, ▁Fernandez
37750 790 0.416194 0.99
38380 463 0.416246 1
39449 494 0.416251 1
31380 334 0.416278 1 ▁334
27203 385 0.416289 1 ▁385
44550 753 0.416355 1
24581 ▁Throughout 0.416382 0.91
45900 ▁414 0.416455 0.99
20677 ▁comr 0.416465 0.22 ▁comrades, ▁comrade
32571 ▁Rohing 0.416492 0.88 ▁Rohingya
45223 fledged 0.416507 0.97
48058 ▁condone 0.416518 0.99
43665 752 0.416572 1
34206 #$#$ 0.416637 0.93
40401 789 0.416653 0.98
44086 oxicity 0.416705 0.91
44415 ▁μg 0.416729 0.98
37397 680 0.416733 0.99 ▁680
44815 Keeping 0.416782 0.98
49259 645 0.416803 0.99
28724 Eventually 0.416807 0.99
24199 ▁Ultimately 0.416827 0.95
33314 erala 0.416834 0.76 ▁Kerala
26417 Actually 0.416834 0.98
49569 ▁nonsensical 0.416835 0.97
28312 ▁unfocused 0.416847 0.97 ▁unfocusedRange
39322 580 0.41688 1 ▁580
48893 Disable 0.416911 0.99
32417 605 0.416971 1
41026 joice 0.417049 0.97 ▁rejoice
26279 285 0.417057 0.99 ▁285
25707 244 0.417105 1 ▁244
37133 umsy 0.417141 0.099 ▁clumsy
49406 perors 0.417142 0.88
50038 ▁436 0.417147 0.99
43564 803 0.417176 1
47045 documented 0.417181 0.98
38783 483 0.417188 1
47697 ▁intensify 0.417242 0.93
38932 ▁encompasses 0.417264 0.92
24096 176 0.417287 0.99 ▁176
36189 575 0.417364 0.99
2887 acebook 0.417372 0.27 ▁Facebook, Facebook, facebook, ▁facebook
20168 astrous 0.41738 0.92 ▁disastrous
45370 ▁embodies 0.417457 0.99
40887 ▁gastrointestinal 0.417466 0.94
35195 361 0.417485 1 ▁361
38818 ▁homers 0.417525 0.99
43356 423 0.417534 0.99 ▁423
47034 ▁despicable 0.417576 0.88
41529 ▁handcuffed 0.417631 0.94
22186 195 0.417635 1 ▁1959, ▁1953, ▁1958, ▁1954, ▁195, ...
39882 ▁281 0.417643 0.98
48629 ▁distraught 0.417675 0.97
24991 197 0.4177 0.99 ▁197, 1970, 1979, 1977, 1978, ...
40553 Across 0.417706 0.97
23195 275 0.41773 0.99 ▁275
11585 eatures 0.417749 0.16 ▁Features, Features, ▁Creatures, features
35797 ▁disparities 0.417759 0.98
27192 171 0.417812 0.98 ▁171
38366 ▁pathogens 0.417814 0.94
39697 ▁286 0.417867 0.96
40271 481 0.417883 1
45743 ocusing 0.417917 0.54
25272 196 0.417918 0.99 ▁196, 1969, 1960, 1968, 1967, ...
29626 339 0.418033 1 ▁339
45799 ▁sprinkle 0.418055 0.72 ▁sprinkled
47316 ▁appease 0.418069 0.94
49703 781 0.418098 0.99
44686 ーティ 0.418099 0.91 ▁サーティ, ▁サーティワン
35378 507 0.418127 0.99
31010 395 0.418132 0.98 ▁395
16080 ▁corrid 0.418173 0.93 ▁corridor, ▁corridors
32071 Creating 0.418236 0.98
48512 ▁deducted 0.418245 0.97
30120 407 0.418261 1 ▁407
33781 495 0.418288 0.99
45271 1965 0.418325 1
41776 okane 0.41833 0.88 ▁Spokane
36625 453 0.418349 1
48494 Whereas 0.418368 0.96
47058 ▁worsen 0.418387 0.98
48689 ▁indignation 0.418396 0.98
42347 Residents 0.418397 1
46347 ▁uptick 0.418399 0.96
30716 ▁cannabin 0.41842 1 ▁cannabinoids, ▁cannabinoid
45664 mbuds 0.418484 0.95 mbudsman
35638 506 0.418492 0.99
37511 Lastly 0.418528 0.75
48367 ▁worrisome 0.41853 0.92
43865 ▁scalable 0.418598 0.98
46572 563 0.418604 1
40660 ▁326 0.418639 0.98
31714 479 0.418639 0.99
24059 ▁prohibits 0.418672 0.97
45072 ▁inquired 0.418728 0.94
48943 ▁unbelievably 0.418742 0.77
49841 034 0.418775 1
41423 ▁332 0.418862 0.98
16950 ournaments 0.418932 0.92 ▁tournaments
29211 336 0.418962 1 ▁336
39710 441 0.418968 0.99
38867 ▁336 0.418994 0.96
43187 Jennifer 0.418999 0.99
24898 Recomm 0.419048 0.89 ▁Recommend, ▁Recommended, Recommended, Recommend
22250 ▁Regardless 0.419054 0.94
29372 ▁guiActiveUn 0.419055 0.9 ▁guiActiveUnfocused
33646 488 0.419061 0.99
47132 etchup 0.419097 0.94
34059 Officers 0.419103 0.99
49658 ▁1862 0.419114 0.96
25340 ▁undertake 0.419146 0.95
37688 784 0.419153 1
41807 leanor 0.419178 0.99 ▁Eleanor
34042 ▁muttered 0.419189 0.98
40828 1974 0.419214 0.99
40574 LinkedIn 0.419214 1
31303 umably 0.41922 0.46 ▁Presumably
34716 460 0.419286 1
13898 Unfortunately 0.419317 0.95
28460 338 0.419317 0.99 ▁338
48341 830 0.41938 1
29193 Scientists 0.419453 0.97
47580 ▁417 0.419454 0.98
38582 Suddenly 0.419474 0.97
19381 iovascular 0.419479 0.91 ▁cardiovascular
36061 ▁surged 0.419527 0.71
43929 ▁heinous 0.419587 0.92
28592 253 0.419655 1 ▁253
41504 farious 0.419661 0.12 ▁nefarious
37482 Thousands 0.419672 0.99
30900 ▁Initially 0.419711 0.92
42105 intuitive 0.419806 0.99
23730 ▁Clearly 0.419863 0.88
34999 inances 0.419873 0.99 ▁ordinances
27534 ░░ 0.419876 0.99
39063 ▁Afterwards 0.419884 0.8
47760 727 0.419887 0.98
29416 409 0.420032 0.99 ▁4096, ▁409, ▁4090
38907 578 0.420049 1
38549 ▁273 0.420064 0.95
41292 598 0.420111 0.99
45516 ▁poignant 0.420131 0.92
42744 -+-+-+-+ 0.420147 0.69
50166 ▁mobilized 0.420211 0.98
44063 ▁352 0.42028 0.98
40761 760 0.420324 0.99 7601, ▁760
23815 228 0.420369 1 ▁228
44729 ▁346 0.420372 0.99
39257 ▁championed 0.420399 0.99
37757 glomer 0.420411 0.46 ▁conglomer, ▁conglomerate
22316 ▁detrim 0.420508 0.89 ▁detrimental, ▁detriment
32570 Palest 0.420522 1 Palestinian
41023 1972 0.420536 0.99
31276 Fortunately 0.420546 0.65
49290 ▁malnutrition 0.420559 0.99
47338 825 0.420613 0.99
33583 Applic 0.420613 1 Applications, ▁Applicant
31980 607 0.420623 1
34770 373 0.420631 0.99 ▁373
28771 399 0.420705 1 ▁399
25883 Officials 0.420726 0.95 ▁Officials
35844 690 0.420756 0.99
42723 discrimination 0.420756 1
43525 Located 0.420758 1
42691 994 0.420801 1
45969 515 0.420802 1
35989 ▁243 0.420811 0.98
45118 ▁overtly 0.420813 0.92
44006 ▁encompass 0.420828 0.99
46431 ▁proclaiming 0.420873 0.94
42245 ertation 0.42088 0.97 ▁dissertation
47452 HAEL 0.420958 1
49626 ▁overdoses 0.420994 0.96
37444 ▁petertodd 0.420999 0.93
41322 060 0.421022 0.99
39827 enfranch 0.421043 0.99 ▁disenfranch
39320 ▁318 0.421072 0.98
15040 byss 0.421075 0.92 ▁Abyss, ▁abyss, ▁Abyssal, Abyss
34284 estinal 0.421105 0.78 intestinal, ▁gastrointestinal
41296 cipled 0.42112 0.99 ▁principled
24710 oldemort 0.421123 0.86 ▁Voldemort
28978 348 0.421139 1 ▁348
38107 ▁272 0.421194 0.96
43056 abiding 0.421215 1 ▁abiding
49010 ▁dismissive 0.421237 0.97
24035 Along 0.421248 0.99
48268 respected 0.421269 0.98
32275 ructose 0.421295 0.7 ▁fructose
44505 ▁630 0.42133 0.98
23631 ▁Certainly 0.421332 0.73
50119 951 0.421339 1
47623 ▁laughable 0.421419 0.99
41538 Magikarp 0.421429 0.99 GoldMagikarp, ▁SolidGoldMagikarp
35264 ▁244 0.421441 0.98
30762 ▁metic 0.421467 0.17 ▁meticulously, ▁meticulous
24881 ricanes 0.421518 0.99 ▁Hurricanes, ▁hurricanes
46528 ▁recite 0.421536 0.7
48170 517 0.421571 1
33796 Important 0.421591 0.97
50186 ▁conspiring 0.421593 0.97
28857 274 0.421594 0.98 ▁274
34372 ▁egregious 0.421612 0.95
45719 ▁525 0.421666 1
48868 ▁625 0.421693 0.96
45962 ▁curtail 0.421699 0.97
44248 atonin 0.421702 0.99
48676 ▁intrinsically 0.421712 0.98
42947 469 0.421714 1
34096 Twenty 0.421737 0.96
44622 ▁376 0.421745 0.99
48866 ▁disapprove 0.421861 0.86
21271 HAHA 0.421941 0.85 HAHAHAHA
50146 ▁lackluster 0.422066 0.92
33916 324 0.422092 1 ▁324
31696 noticed 0.422115 0.72 ▁unnoticed
48120 ▁touting 0.422125 0.99
42802 877 0.422152 1
38271 aucuses 0.422166 0.78 ▁caucuses
47938 ▁ingrained 0.422166 0.97
38339 471 0.422182 0.99
30460 388 0.422186 1 ▁388
34323 ▁234 0.422238 0.99
37068 ▁Yanuk 0.422262 1 ▁Yanukovych
49989 ▁448 0.42234 1
47305 ▁strikingly 0.422355 0.96
35133 487 0.422364 1
40220 1973 0.422367 1
47194 ▁adore 0.42237 1
32576 475 0.422386 1 ▁475
44119 ▁premiered 0.422425 0.98
48250 635 0.422459 1
40360 ▁blinked 0.422461 0.97
49312 ▁overwrite 0.422462 0.98
45310 023 0.422462 0.99
50147 ▁aback 0.422463 0.83
42440 ▁cytok 0.422584 0.96
36284 ▁vehement 0.422595 0.9 ▁vehemently
40173 428 0.422599 0.99 ▁428
49615 ▁perceptual 0.422611 0.99
11273 ▁enthusi 0.422623 0.083 ▁enthusiasm, ▁enthusiastic, ▁enthusiasts, ▁enthusiast, ▁enthusiastically
42184 Romney 0.422643 1
11399 ▁Furthermore 0.42265 0.62
41824 ▁pollutants 0.422692 0.97
30368 281 0.422705 1 ▁281
41683 ▁linemen 0.422774 0.87
48831 ▁spate 0.422849 0.87
28470 ▁Aside 0.422853 0.93
33799 ▁contends 0.422871 0.83
43526 057 0.422891 1
47685 ▁biochemical 0.422893 0.97
40239 ampoo 0.422906 0.98 ▁shampoo
43234 910 0.422914 0.99
44867 ▁nutrit 0.422947 0.8 ▁nutritious
25870 278 0.422974 0.99 ▁278
49334 brainer 0.422987 0.97
40516 0.422996 1
36879 665 0.423012 0.99
31751 609 0.42308 0.99
44064 doctoral 0.423094 0.99
48513 ▁partake 0.423111 0.98
45403 1959 0.423138 1
26514 377 0.423138 0.99 ▁377
43900 ilitarian 0.423241 0.61 ▁utilitarian
47236 ▁coercive 0.423246 0.96
29703 406 0.423259 1 ▁406
38369 ▁307 0.423267 0.97
23188 178 0.423285 1 ▁178
44698 027 0.42329 0.98
39200 ▁Suppose 0.42333 0.99
14223 ;;;; 0.423378 0.98 ;;;;;;;;, ;;;;;;;;;;;;
41747 ▁waging 0.423393 0.99
35871 ▁ideologies 0.423395 0.99
33981 647 0.423417 0.99
47582 774 0.423422 0.99
46477 1964 0.423454 1
49786 evaluate 0.423498 1
41060 796 0.423518 1
44328 ▁negate 0.423551 0.92
40659 ▁Occasionally 0.423612 0.99
31352 ▁supremacist 0.423615 0.98 ▁supremacists
21889 ottenham 0.423634 0.26 ▁Tottenham
35596 Simply 0.423673 0.96
30064 ▁tablespoons 0.423686 0.99
44491 ▁cumbersome 0.423691 0.93
42218 excluding 0.423694 0.93
32148 332 0.423754 1 ▁332
23496 aundering 0.423764 0.82 ▁laundering
36841 ▁markedly 0.423795 0.92
49254 ▁SERVICES 0.423795 0.99
46145 ▁summarizes 0.423798 0.9
49294 ▁421 0.423818 0.99
36941 ▁indulge 0.423832 0.86
43977 044 0.423844 0.99
36543 ▁sarcast 0.423857 0.45 ▁sarcastic
22660 ▁Likewise 0.423871 0.6
50046 ▁warheads 0.423918 0.99
41306 ▁skillet 0.423962 0.98
46861 ▁Arpaio 0.423978 0.99
44361 422 0.423981 1 ▁422
34044 ▁236 0.423984 0.99
31540 ▁assaulting 0.423991 0.97
46927 roversial 0.423997 0.78
28935 ▁Volunte 0.424021 1 ▁Volunteer, ▁Volunteers
43078 utonium 0.424088 0.77 ▁plutonium
48348 ▁additives 0.424095 1
37601 679 0.424147 0.98
25929 ▁Alternatively 0.42415 0.92
45200 ▁culminated 0.424166 0.94
32848 ▁hesitant 0.424178 0.92
45326 545 0.424203 0.99
23305 ▁notor 0.424211 0.72 ▁notoriously, ▁notoriety
43571 772 0.424233 0.99
33792 ormonal 0.424251 0.88 ▁hormonal
31575 342 0.424327 1 ▁342
49574 ▁metabolites 0.424335 0.98
37177 Attempt 0.424337 0.99 Attempts
49649 953 0.424344 1
23379 Almost 0.424364 0.88
42763 ▁drawbacks 0.424388 0.93
33032 457 0.42439 1 ▁457
43383 ▁levied 0.42439 0.97
38147 ▁276 0.424403 0.99

Tokens with partial UTF-8 sequences

2 entries below threshold of 0.206

token_id token indicator in_other_tokens
39820 龍<0xE5><0xA5> 0.00162309 龍契士
33434 <0x96><0x9A>士 0.201911 龍喚士
214 additional entries above threshold
token_id token indicator in_other_tokens
13945 <0xA5><0x9E> 0.316751 , ▁神
47490 <0xA9><0xB6><0xE6> 0.378875 <0xA9><0xB6>極
23596 <0x93><0x98> 0.38741 ▁ⓘ,
39374 <0x91>士 0.394016 龍契士
47703 <0xA9><0xB6>極 0.428451
43897 <0xE6><0xA9> 0.438768
4204 <0xBF><0xBD> 0.441968 , ��, ����, ▁�, ▁����, ...
22757 <0x9A>醒 0.442171 覚醒, ▁裏覚醒
34247 ا<0xD8> 0.443137
23329 <0x8E><0x8B> 0.446635
43769 <0x81><0xAB> 0.447803
47797 <0xE8><0x83> 0.449707
43889 <0xE5><0x8E> 0.45015
19021 <0xE7><0x9A> 0.450203
49035 <0xE5><0x87> 0.450535
32003 <0xE8><0x80> 0.450764
32518 <0xE8><0xA3> 0.45343 ▁裏<0xE8>,
48071 <0xE0><0xA6> 0.455177
39355 <0xE5><0x8D> 0.455582
46479 <0xE4><0xBF> 0.456193
36596 <0xBB><0x92> 0.456427
46763 <0xE6><0x95> 0.456463
11805 <0x98><0x85> 0.457008 , ▁★, ★★
45539 <0xAC><0xBC> 0.457333
29826 <0xE6><0xAD> 0.457675
35069 <0xA5><0xB5> 0.458313 <0xA9><0xB6>極
25001 <0xE5><0xA5> 0.459953 龍<0xE5><0xA5>, 龍契士,
18004 <0xE5><0xA3> 0.460822 , <0x96><0x9A>士, 龍喚士, <0x91>士, 龍契士
25081 <0x99><0x82> 0.461012 ▁🙂
37239 <0xE9><0x9B> 0.461139
49426 <0xE7><0x90> 0.461541
49694 <0xE9><0x9A> 0.46357
33699 <0xE6><0x89> 0.466032
22755 <0xE6><0x88> 0.466052
43380 <0xE5><0xAF> 0.466767
28839 <0xE5><0x9C> 0.466827
19049 龍<0xE5> 0.466854 龍喚士, 龍<0xE5><0xA5>, 龍契士
47728 <0xF0><0x9D> 0.467541
48953 <0xAD><0xB7> 0.467792
8955 <0x82><0xAC> 0.468319 ▁€, ,
37863 <0xE5><0x86> 0.468368
45617 <0xE9><0xA3> 0.468685
45865 <0xAB><0x98> 0.468965
6408 <0xA3><0x8F> 0.469302 ▁裏, ▁裏<0xE7>, ▁裏覚醒, ▁裏<0xE8>
18433 <0xAD><0x94> 0.470022 , の魔
36365 <0xE6><0xB0> 0.470478
41840 <0xF0><0x9F><0x91> 0.471918 ▁<0xF0><0x9F><0x91>
45784 <0xE7><0xB7> 0.472803
27764 <0xE5><0xAD> 0.472922
46349 <0xE6><0x83> 0.472942
32573 <0xE8><0xBF> 0.473065
33951 י<0xD7> 0.473572
48958 <0xE8><0x88> 0.473692
33232 <0xE5><0xBF> 0.474013
40367 <0xE7><0x9C> 0.474041
7134 <0x88><0x92> 0.47414 ▁−, , ▁(−
22880 <0xE2><0x95> 0.474212 , ══
46788 <0x8A><0xB1> 0.474641
43102 <0xE8><0xBB> 0.476129
45433 <0x81><0x96> 0.476318
35975 <0xEC><0x9D> 0.476523
47249 <0xF0><0x9F><0x98> 0.476631
21253 <0x9A><0xE9> 0.476954 <0x9A>醒, 覚醒, ▁裏覚醒
28938 <0xE5><0x90> 0.476994
31479 <0xE0><0xB9> 0.477151
47947 <0xE5><0x8B> 0.47763
18796 <0xE7><0x94> 0.477676 ,
44293 <0xE5><0x8C> 0.477925
22887 <0xE5><0xB0> 0.478969
10253 <0x86><0x92> 0.479474 ▁→, <0x9A>醒, 覚醒, ▁裏覚醒,
47991 <0xED><0x95> 0.479547
45911 <0xE7><0x95> 0.479553
45379 <0xE7><0x8B> 0.479554
35707 <0xE6><0x84> 0.479617
30298 <0xE5><0x89> 0.479731
29785 <0xE9><0x97> 0.479936
42314 ▁<0xE0><0xA8> 0.480335
19526 <0xE4><0xBD> 0.4808 , 使
31965 <0xE7><0x89> 0.481065
34650 <0xE5><0xA7> 0.481602
34932 <0xE9><0x87> 0.48175
44165 <0xE7><0xAB> 0.482186
27670 <0xE4><0xBC> 0.482448
50159 <0x99><0xBD> 0.482667
46237 <0xE8><0xAF> 0.482668
35050 <0xB6><0xE6> 0.482791 <0xA9><0xB6><0xE6>, <0xA9><0xB6>極
31619 ▁<0xEB> 0.482926
30585 <0xE5><0xB8> 0.483343
46695 <0xEB><0x8B> 0.483431
31204 <0x96><0x9A> 0.48369 <0x96><0x9A>士, 龍喚士
49149 の<0xE5><0xAE> 0.484379
42164 ▁<0xE6><0x9C> 0.484584
34402 <0xE9><0x81> 0.485002
45495 <0xE1><0xBD> 0.485084
13783 <0xE5><0xA4> 0.485114 , , ▁<0xE5><0xA4>
34504 ▁裏<0xE8> 0.485328
42062 <0x88><0xE8> 0.485419
30266 <0xE6><0x9D> 0.485944
36181 <0xE5><0xBE> 0.486193
19469 <0xE0><0xA8> 0.486271 ▁<0xE0><0xA8>
37772 <0xE5><0x91> 0.486389
20015 <0xE4><0xBB> 0.486396
43636 <0xE5><0x82> 0.486406
38184 <0xE6><0xB5> 0.486498
47078 <0xE7><0x84> 0.48656
33426 の<0xE9> 0.486781 の魔
11737 <0xE9><0xBE> 0.486799 , 龍<0xE5>, 龍喚士, 龍<0xE5><0xA5>, 龍契士
25443 о<0xD0> 0.487452
26344 <0xE5><0x88> 0.487557
32849 <0xE9><0x83> 0.488043
43718 <0xE6><0xA0> 0.488645
32432 <0xE5><0xB7> 0.489007
32368 <0xE5><0x9B> 0.489398
26534 <0x85><0x8B> 0.489413 , ㅋㅋ
34460 <0xE9><0x80> 0.489867
33176 <0xE5><0xB9> 0.4899
41340 <0xE0><0xBC> 0.490545
41753 <0xE5><0xBA> 0.490717
12859 <0xE4><0xBA> 0.490731 ,
36469 ▁<0xE5><0xA4> 0.4909
33566 <0xE7><0x9B> 0.491017
43518 <0x82><0x8E> 0.491796
36685 <0xE5><0xA6> 0.49236
37345 <0xE6><0xB3> 0.492377
23877 <0xE6><0x96> 0.493012
23626 <0xE6><0x98> 0.493242
41678 <0xB6><0x85> 0.493761
45250 <0xE6><0x80> 0.494832
43297 <0xE0><0xA9> 0.495167
15139 ▁<0xE2><0x89> 0.495304 ▁≥, ▁≡, ▁≤
20998 <0xE5><0x8F> 0.495347
33768 <0xE6><0x97> 0.495784
17358 <0xE8><0xA6> 0.4965 覚醒, ▁裏覚醒
45739 <0xE8><0xAA> 0.496809
10310 <0xE4><0xB8> 0.497147 , , , ,
23821 ▁<0xEC> 0.497489
37605 <0xE5><0xBD> 0.499306
17739 <0xE5><0x85> 0.499424
20046 <0xE4><0xB9> 0.501329
39611 <0xE1><0xB5> 0.501967
22522 <0xE5><0xAE> 0.502101 の<0xE5><0xAE>
50169 ▁<0xF0><0x9F><0x91> 0.502247
18923 ▁<0xD9> 0.503234 ▁و, ▁م
15926 <0xE2><0x97> 0.505319 ▁<0xE2><0x97>, , ▁●,
41585 <0xE1><0xB8> 0.50542
35266 <0xEF><0xB8> 0.507202
1792 <0xE3><0x82> 0.507488 , , , , , ...
38461 <0xE9><0x96> 0.50884
14519 ▁<0xE2><0x9C> 0.510499 ▁✓, ▁✔
41365 <0x82><0xAA> 0.510769
28156 <0xE5><0xBC> 0.510932
29705 <0xE2><0x86> 0.512347 ,
27950 <0xE5><0x8A> 0.512615
26193 <0xE8><0xA1> 0.513023
12045 ー<0xE3><0x83> 0.513714 ーン, ール, ーテ, ーティ, ▁サーティ, ...
27032 の<0xE6> 0.514515
35705 <0xE2><0x89> 0.515011 ▁≡, ▁≤
19567 <0xE0><0xB8> 0.515037
2515 <0xE3><0x81> 0.515715 , の<0xE5>, の<0xE7>, , , ...
8008 <0x84><0xA2> 0.516003 , ™:
30325 ▁<0xF0><0x9F><0x98> 0.517076
20174 ▁裏<0xE7> 0.517223
26292 <0xE1><0xB9> 0.517404
17312 <0xE6><0x9C> 0.517434 ▁<0xE6><0x9C>
11976 <0xE0><0xA4> 0.51759 ▁<0xE0><0xA4>,
26486 <0xE2><0x9C> 0.517651 ▁✔
14360 ▁<0xD7> 0.51925
6552 <0xE2><0x94> 0.520441 , ──, ▁<0xE2><0x94>, ────, ▁│, ...
17550 ▁<0xD8> 0.520777 ▁ال
23294 ▁<0xE3><0x81> 0.522037
39333 <0xB2><0xBE> 0.523633
17433 ▁<0xE3><0x82> 0.523751 ▁サ, ▁サーティ, ▁サーティワン
28225 ▁<0xE0><0xA4> 0.524213
24231 <0xE0><0xA5> 0.524801
42527 ▁<0xE2><0x87> 0.525539
28053 ▁<0xE1> 0.525607
13305 ▁<0xE2><0x94> 0.525661 ▁│, ▁├, ▁├──
43074 ▁<0xE2><0x9D> 0.526396
17683 の<0xE7> 0.526527
17804 ▁<0xE2><0x86> 0.526877 ▁↑
15474 の<0xE5> 0.529464 の<0xE5><0xAE>
17992 <0xE2><0x99> 0.529969 ▁<0xE2><0x99>, ,
24966 ▁<0xE2><0x97> 0.530476 ▁●
32391 <0xE2><0x9D> 0.531058 ▁<0xE2><0x9D>
5008 <0xE2><0x96> 0.532309 , ██, ▁<0xE2><0x96>, ████, , ...
8582 <0xF0><0x9F> 0.534677 ▁<0xF0><0x9F>, ▁<0xF0><0x9F><0x98>, ▁🙂, <0xF0><0x9F><0x91>, <0xF0><0x9F><0x98>, ...
29773 <0xEE><0x80> 0.534788
24583 <0xE2><0x98> 0.534919 ★★, ▁<0xE2><0x98>,
1209 <0xE3><0x83> 0.535312 , , , , , ...
14524 ▁<0xE3><0x83> 0.541521
18074 ▁<0xCF> 0.54181 ▁τ
18872 ▁<0xE2><0x88> 0.541945 ▁∼
5099 <0xE3><0x80> 0.542084 , , , , , ...
34719 ▁<0xE2><0x98> 0.544297
16268 ▁<0xE9> 0.545694
46256 <0xE2><0x81> 0.549108
13328 ▁<0xE7> 0.549931 ▁神
25370 ▁<0xC5> 0.556054
5525 ▁<0xE8> 0.558289 ▁裏, ▁裏<0xE7>, ▁裏覚醒, ▁裏<0xE8>
10545 ▁<0xE6> 0.562226 ▁<0xE6><0x9C>
24861 <0xE2><0x88> 0.562391 ▁(−, ▁∼
34754 ▁<0xC4> 0.562496
7377 ▁<0xCE> 0.563062 ▁μ, ▁α, ▁β, ▁Δ, ▁μg
12520 ▁<0xF0><0x9F> 0.564086 ▁<0xF0><0x9F><0x98>, ▁🙂, ▁<0xF0><0x9F><0x91>
27332 ▁<0xEF> 0.569655 ▁��������
10263 ▁<0xE5> 0.569912 ▁<0xE5><0xA4>
12466 ▁<0xD0> 0.580397
20724 ▁<0xE2><0x99> 0.581587
1587 ▁<0xC2> 0.589135 ▁£, ▁\xa0, ▁±, ▁§, ▁©, ...
11019 ▁<0xE2><0x96> 0.596913 ▁█, ▁■, ▁►
447 <0xE2><0x80> 0.6029 ▁<0xE2><0x80>, ▁–, ▁—, , , ...
564 ▁<0xE2><0x80> 0.612308 ▁–, ▁—, ▁…, ▁•, ▁\u200b, ...
2343 ▁<0xE2> 0.62053 ▁…, ▁•, ▁−, ▁€, ▁<0xE2><0x96>, ...
6184 ▁<0xC3> 0.637716 ▁×, ▁à, ▁é, ▁þ, ▁É, ...

Byte tokens

46 entries below threshold of 0.140

token_id token indicator ord hex byte_type
179 <0xF7> 0.00115538 247 0xF7 unused_utf8
177 <0xF5> 0.00115615 245 0xF5 unused_utf8
187 <0xFF> 0.00116438 255 0xFF unused_utf8
185 <0xFD> 0.00117475 253 0xFD unused_utf8
183 <0xFB> 0.00117517 251 0xFB unused_utf8
178 <0xF6> 0.00118297 246 0xF6 unused_utf8
181 <0xF9> 0.0011909 249 0xF9 unused_utf8
182 <0xFA> 0.00121307 250 0xFA unused_utf8
186 <0xFE> 0.00128454 254 0xFE unused_utf8
184 <0xFC> 0.0013572 252 0xFC unused_utf8
180 <0xF8> 0.00136507 248 0xF8 unused_utf8
202 \x0e 0.00137275 14 0x0E ascii
188 \x00 0.00137687 0x00 ascii
205 \x11 0.00137717 17 0x11 ascii
125 <0xC1> 0.00139213 193 0xC1 unused_utf8
213 \x19 0.00141013 25 0x19 ascii
197 \t 0.00141722 9 0x09 ascii
204 \x10 0.00142157 16 0x10 ascii
211 \x17 0.00145513 23 0x17 ascii
207 \x13 0.00145561 19 0x13 ascii
26 additional entries below threshold
token_id token indicator ord hex byte_type
200 \x0c 0.0014562 12 0x0C ascii
208 \x14 0.00146127 20 0x14 ascii
189 \x01 0.00146729 1 0x01 ascii
193 \x05 0.00146949 5 0x05 ascii
214 \x1a 0.0014773 26 0x1A ascii
190 \x02 0.00147963 2 0x02 ascii
199 \x0b 0.00148165 11 0x0B ascii
195 \x07 0.00149047 7 0x07 ascii
201 \r 0.00149822 13 0x0D ascii
194 \x06 0.00150222 6 0x06 ascii
216 \x1c 0.00151449 28 0x1C ascii
217 \x1d 0.00151765 29 0x1D ascii
219 \x1f 0.00152028 31 0x1F ascii
124 <0xC0> 0.00152189 192 0xC0 unused_utf8
203 \x0f 0.00152725 15 0x0F ascii
206 \x12 0.00153619 18 0x12 ascii
212 \x18 0.0015434 24 0x18 ascii
221 \x7f 0.0015437 127 0x7F ascii
192 \x04 0.00154603 4 0x04 ascii
210 \x16 0.00154918 22 0x16 ascii
196 \x08 0.00157344 8 0x08 ascii
218 \x1e 0.00160688 30 0x1E ascii
209 \x15 0.00163686 21 0x15 ascii
191 \x03 0.00166035 3 0x03 ascii
215 \x1b 0.00167865 27 0x1B ascii
153 <0xDD> 0.105542 221 0xDD utf8
210 additional entries above threshold
token_id token indicator ord hex byte_type
174 <0xF2> 0.167003 242 0xF2 utf8
173 <0xF1> 0.173873 241 0xF1 utf8
154 <0xDE> 0.230459 222 0xDE utf8
155 <0xDF> 0.27928 223 0xDF utf8
176 <0xF4> 0.300827 244 0xF4 utf8
152 <0xDC> 0.396505 220 0xDC utf8
175 <0xF3> 0.405284 243 0xF3 utf8
150 <0xDA> 0.421559 218 0xDA utf8
160 <0xE4> 0.464325 228 0xE4 utf8
145 <0xD5> 0.464496 213 0xD5 utf8
143 <0xD3> 0.466559 211 0xD3 utf8
172 <0xF0> 0.466762 240 0xF0 utf8
151 <0xDB> 0.471456 219 0xDB utf8
169 <0xED> 0.473067 237 0xED utf8
166 <0xEA> 0.482233 234 0xEA utf8
144 <0xD4> 0.488315 212 0xD4 utf8
147 <0xD7> 0.489711 215 0xD7 utf8
148 <0xD8> 0.491993 216 0xD8 utf8
131 <0xC7> 0.494264 199 0xC7 utf8
167 <0xEB> 0.495822 235 0xEB utf8
142 <0xD2> 0.497258 210 0xD2 utf8
139 <0xCF> 0.498268 207 0xCF utf8
149 <0xD9> 0.499115 217 0xD9 utf8
159 <0xE3> 0.499514 227 0xE3 utf8
168 <0xEC> 0.49957 236 0xEC utf8
133 <0xC9> 0.500372 201 0xC9 utf8
137 <0xCD> 0.509996 205 0xCD utf8
130 <0xC6> 0.518623 198 0xC6 utf8
156 <0xE0> 0.523145 224 0xE0 utf8
161 <0xE5> 0.524869 229 0xE5 utf8
141 <0xD1> 0.526708 209 0xD1 utf8
170 <0xEE> 0.526741 238 0xEE utf8
95 <0xA2> 0.537576 162 0xA2 utf8
132 <0xC8> 0.538115 200 0xC8 utf8
162 <0xE6> 0.53825 230 0xE6 utf8
164 <0xE8> 0.538859 232 0xE8 utf8
146 <0xD6> 0.539047 214 0xD6 utf8
104 <0xAB> 0.539134 171 0xAB utf8
121 <0xBD> 0.540342 189 0xBD utf8
165 <0xE9> 0.540569 233 0xE9 utf8
171 <0xEF> 0.54207 239 0xEF utf8
252 <0x9E> 0.542418 158 0x9E utf8
235 <0x8D> 0.543801 141 0x8D utf8
106 <0xAE> 0.546793 174 0xAE utf8
99 <0xA6> 0.54755 166 0xA6 utf8
163 <0xE7> 0.547909 231 0xE7 utf8
136 <0xCC> 0.548162 204 0xCC utf8
103 <0xAA> 0.549399 170 0xAA utf8
250 <0x9C> 0.549502 156 0x9C utf8
255 <0xAD> 0.550183 173 0xAD utf8
111 <0xB3> 0.550384 179 0xB3 utf8
115 <0xB7> 0.550723 183 0xB7 utf8
157 <0xE1> 0.550763 225 0xE1 utf8
228 <0x86> 0.550818 134 0x86 utf8
100 <0xA7> 0.551208 167 0xA7 utf8
96 <0xA3> 0.551218 163 0xA3 utf8
237 <0x8F> 0.551939 143 0x8F utf8
119 <0xBB> 0.552567 187 0xBB utf8
244 <0x96> 0.553186 150 0x96 utf8
232 <0x8A> 0.553933 138 0x8A utf8
238 <0x90> 0.554733 144 0x90 utf8
110 <0xB2> 0.555323 178 0xB2 utf8
248 <0x9A> 0.555586 154 0x9A utf8
240 <0x92> 0.555737 146 0x92 utf8
245 <0x97> 0.556407 151 0x97 utf8
242 <0x94> 0.55675 148 0x94 utf8
243 <0x95> 0.557039 149 0x95 utf8
225 <0x83> 0.557613 131 0x83 utf8
97 <0xA4> 0.557832 164 0xA4 utf8
117 <0xB9> 0.558541 185 0xB9 utf8
109 <0xB1> 0.558543 177 0xB1 utf8
135 <0xCB> 0.558843 203 0xCB utf8
107 <0xAF> 0.558994 175 0xAF utf8
112 <0xB4> 0.559058 180 0xB4 utf8
105 <0xAC> 0.559753 172 0xAC utf8
223 <0x81> 0.559985 129 0x81 utf8
254 <0xA0> 0.560324 160 0xA0 utf8
98 <0xA5> 0.560359 165 0xA5 utf8
101 <0xA8> 0.560811 168 0xA8 utf8
114 <0xB6> 0.560951 182 0xB6 utf8
140 <0xD0> 0.561022 208 0xD0 utf8
230 <0x88> 0.561344 136 0x88 utf8
247 <0x99> 0.561852 153 0x99 utf8
134 <0xCA> 0.562296 202 0xCA utf8
222 <0x80> 0.56239 128 0x80 utf8
94 <0xA1> 0.562492 161 0xA1 utf8
233 <0x8B> 0.563777 139 0x8B utf8
138 <0xCE> 0.563809 206 0xCE utf8
239 <0x91> 0.564012 145 0x91 utf8
102 <0xA9> 0.564341 169 0xA9 utf8
253 <0x9F> 0.56463 159 0x9F utf8
241 <0x93> 0.566288 147 0x93 utf8
231 <0x89> 0.566508 137 0x89 utf8
108 <0xB0> 0.566781 176 0xB0 utf8
116 <0xB8> 0.567318 184 0xB8 utf8
246 <0x98> 0.567941 152 0x98 utf8
122 <0xBE> 0.568097 190 0xBE utf8
120 <0xBC> 0.56886 188 0xBC utf8
118 <0xBA> 0.570098 186 0xBA utf8
113 <0xB5> 0.571313 181 0xB5 utf8
236 <0x8E> 0.5715 142 0x8E utf8
227 <0x85> 0.571958 133 0x85 utf8
234 <0x8C> 0.572285 140 0x8C utf8
251 <0x9D> 0.572305 157 0x9D utf8
224 <0x82> 0.572491 130 0x82 utf8
249 <0x9B> 0.573863 155 0x9B utf8
226 <0x84> 0.573914 132 0x84 utf8
229 <0x87> 0.576642 135 0x87 utf8
128 <0xC4> 0.578588 196 0xC4 utf8
123 <0xBF> 0.57898 191 0xBF utf8
129 <0xC5> 0.597445 197 0xC5 utf8
158 <0xE2> 0.602405 226 0xE2 utf8
126 <0xC2> 0.61877 194 0xC2 utf8
92 } 0.627752 125 0x7D ascii
90 { 0.639155 123 0x7B ascii
127 <0xC3> 0.643043 195 0xC3 utf8
63 ` 0.65181 96 0x60 ascii
61 ^ 0.662929 94 0x5E ascii
3 $ 0.663779 36 0x24 ascii
93 ~ 0.666712 126 0x7E ascii
27 < 0.686481 60 0x3C ascii
2 # 0.686593 35 0x23 ascii
59 \ 0.687838 92 0x5C ascii
91 | 0.694004 124 0x7C ascii
24 9 0.696497 57 0x39 ascii
21 6 0.696751 54 0x36 ascii
23 8 0.698009 56 0x38 ascii
22 7 0.699247 55 0x37 ascii
80 q 0.699591 113 0x71 ascii
31 @ 0.70065 64 0x40 ascii
29 > 0.708774 62 0x3E ascii
57 Z 0.709682 90 0x5A ascii
48 Q 0.710838 81 0x51 ascii
56 Y 0.718 89 0x59 ascii
28 = 0.718791 61 0x3D ascii
15 0 0.7196 48 0x30 ascii
4 % 0.721073 37 0x25 ascii
20 5 0.724367 53 0x35 ascii
5 & 0.731675 38 0x26 ascii
54 W 0.732553 87 0x57 ascii
41 J 0.734824 74 0x4A ascii
52 U 0.739134 85 0x55 ascii
19 4 0.740863 52 0x34 ascii
55 X 0.741186 88 0x58 ascii
36 E 0.741788 69 0x45 ascii
46 O 0.745141 79 0x4F ascii
60 ] 0.746135 93 0x5D ascii
53 V 0.747995 86 0x56 ascii
45 N 0.750942 78 0x4E ascii
40 I 0.752849 73 0x49 ascii
10 + 0.754061 43 0x2B ascii
9 * 0.756454 42 0x2A ascii
43 L 0.761259 76 0x4C ascii
39 H 0.761783 72 0x48 ascii
42 K 0.76298 75 0x4B ascii
49 R 0.765187 82 0x52 ascii
47 P 0.765212 80 0x50 ascii
18 3 0.768257 51 0x33 ascii
38 G 0.77102 71 0x47 ascii
58 [ 0.772278 91 0x5B ascii
37 F 0.772933 70 0x46 ascii
84 u 0.77488 117 0x75 ascii
51 T 0.776583 84 0x54 ascii
34 C 0.779543 67 0x43 ascii
44 M 0.782255 77 0x4D ascii
35 D 0.782783 68 0x44 ascii
89 z 0.78908 122 0x7A ascii
32 A 0.789253 65 0x41 ascii
73 j 0.78963 106 0x6A ascii
87 x 0.789777 120 0x78 ascii
33 B 0.790633 66 0x42 ascii
50 S 0.792968 83 0x53 ascii
81 r 0.79434 114 0x72 ascii
86 w 0.795779 119 0x77 ascii
17 2 0.796077 50 0x32 ascii
220 0.796784 32 0x20 ascii
74 k 0.797121 107 0x6B ascii
85 v 0.797511 118 0x76 ascii
0 ! 0.797713 33 0x21 ascii
16 1 0.80789 49 0x31 ascii
79 p 0.80806 112 0x70 ascii
70 g 0.81095 103 0x67 ascii
30 ? 0.813412 63 0x3F ascii
75 l 0.81376 108 0x6C ascii
71 h 0.816604 104 0x68 ascii
67 d 0.817897 100 0x64 ascii
7 ( 0.818547 40 0x28 ascii
66 c 0.823076 99 0x63 ascii
68 e 0.823388 101 0x65 ascii
69 f 0.825164 102 0x66 ascii
88 y 0.82561 121 0x79 ascii
8 ) 0.826005 41 0x29 ascii
65 b 0.826941 98 0x62 ascii
26 ; 0.827545 59 0x3B ascii
62 _ 0.829546 95 0x5F ascii
83 t 0.83119 116 0x74 ascii
78 o 0.831293 111 0x6F ascii
77 n 0.832061 110 0x6E ascii
76 m 0.834383 109 0x6D ascii
72 i 0.835831 105 0x69 ascii
1 " 0.839757 34 0x22 ascii
6 ' 0.859091 39 0x27 ascii
64 a 0.864021 97 0x61 ascii
82 s 0.894729 115 0x73 ascii
14 / 0.934385 47 0x2F ascii
25 : 0.951251 58 0x3A ascii
198 \n 1.0055 10 0x0A ascii
13 . 1.04142 46 0x2E ascii
12 - 1.06188 45 0x2D ascii
11 , 1.10433 44 0x2C ascii

Special tokens

0 entries below threshold of 0.140

1 additional entries above threshold
token_id token indicator
50256 <|endoftext|> 0.778391