Skip to content

Latest commit

 

History

History
1517 lines (1499 loc) · 739 KB

openai_community_gpt2_medium.md

File metadata and controls

1517 lines (1499 loc) · 739 KB

Report for openai-community/gpt2-medium

Model info

  • Model Info:
    • Tied embeddings: True
    • LM head uses bias: False
    • Embeddings shape: [50257, 1024]
  • Tokenizer Info:
    • Vocab Size: 50257
    • Tokenizer Class: GPT2Tokenizer
    • Bytes handling: Byte Input
    • Tokenizer Type: BPE
    • Token for verification prompt building: BuyableInstoreAndOnline
    • Token id for verification prompt building: 40242
  • Indicator summary:
    • Indicator for under-trained tokens: E_{out} Cosine Distance
    • Overall distribution: 0.489 +/- 0.053
  • Detected Token Counts:
    • Number of tested under-trained tokens: 999, 967 non-special, 17 below p = 0.01 threshold, 11 below soft indicator threshold
    • Number of single byte tokens: 256, of which 45 below indicator threshold
    • Number of special tokens: 0, of which 0 below indicator threshold
    • Number of non-single-byte UTF-fragment tokens: 216, of which 1 below soft indicator threshold

Under-trained token indicators plot

Indicators scatter plots

Verification plot

Verification plot

Under-trained token verification results

11 entries below threshold of 0.041

token_id token indicator max_prob in_other_tokens
30897 reportprint 0.00444567 3.9e-07 embedreportprint, cloneembedreportprint, rawdownloadcloneembedreportprint
45544 ▁サーティ 0.00456727 2.7e-07 ▁サーティワン
30212 ▁externalToEVA 0.00459385 3.3e-07 ▁externalToEVAOnly
30905 rawdownload 0.00463021 3.3e-07 rawdownloadcloneembedreportprint
39752 quickShip 0.00471389 2.4e-07 quickShipAvailable
36173 ▁RandomRedditor 0.00473255 2.7e-07 ▁RandomRedditorWithNo
42089 ▁TheNitrome 0.00477672 3.1e-07 ▁TheNitromeFan
40241 InstoreAndOnline 0.00498092 4.4e-07 BuyableInstoreAndOnline
30898 embedreportprint 0.00511497 3.1e-07 cloneembedreportprint, rawdownloadcloneembedreportprint
40240 oreAndOnline 0.00512677 3e-07 InstoreAndOnline, BuyableInstoreAndOnline
30208 ▁externalTo 0.0234748 6.7e-06 ▁externalToEVA, ▁externalToEVAOnly
956 additional entries above threshold
token_id token indicator max_prob in_other_tokens
23090 ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ 0.0407522 2.7e-05 ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ
37574 StreamerBot 0.0409848 1.4e-05 TPPStreamerBot
31573 ActionCode 0.066516 0.0028 externalActionCode
14827 ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ 0.095297 0.026 ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ
42066 Nitrome 0.111814 0.00021 ▁TheNitrome, ▁TheNitromeFan
9364 ÃÂÃÂÃÂÃÂ 0.145773 0.018 ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ
17629 ▁practition 0.151631 0.022 ▁practitioners, ▁practitioner
39749 DeliveryDate 0.156375 0.98 soDeliveryDate
39142 ThumbnailImage 0.174062 0.25 ItemThumbnailImage
39714 isSpecial 0.180131 0.82 isSpecialOrderable
39655 Orderable 0.185153 0.84 isSpecialOrderable
40219 oreAnd 0.194004 0.044 oreAndOnline, InstoreAndOnline, BuyableInstoreAndOnline
30899 cloneembedreportprint 0.195899 0.00083 rawdownloadcloneembedreportprint
27013 aditional 0.198679 0.65 ▁Traditional, traditional, Traditional
27293 ▁antidepress 0.201805 0.15 ▁antidepressants, ▁antidepressant
5815 ÃÂÃÂ 0.206792 0.74 ÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ
13150 ▁subur 0.224842 0.097 ▁suburban, ▁suburbs, ▁suburb
15272 ▁pione 0.230987 0.69 ▁pioneer, ▁pioneering, ▁pioneers, ▁pioneered
30439 ▁unintention 0.234851 0.04 ▁unintentionally, ▁unintentional
4690 ortunately 0.253595 0.00019 fortunately, ▁Unfortunately, ▁unfortunately, Unfortunately, ▁Fortunately, ...
24973 ▁exting 0.25457 0.97 ▁extingu, ▁extinguished
25618 ▁councill 0.263922 0.25 ▁councillor, ▁councillors
13198 ▁earthqu 0.265404 0.51 ▁earthquake, ▁earthquakes
19476 ▁carbohyd 0.277147 1 ▁carbohydrate, ▁carbohydrates
7105 ▁volunte 0.27921 0.91 ▁volunteers, ▁volunteer, ▁volunteered, ▁volunteering
18945 ▁teasp 0.281192 0.89 ▁teaspoon, ▁teaspoons
14695 ▁eleph 0.283802 0.96 ▁elephant, ▁elephants
39693 Buyable 0.296606 0.98 BuyableInstoreAndOnline
31666 ?????-?????- 0.303311 0.93
11548 ▁entreprene 0.304762 0.84 ▁entrepreneurs, ▁entrepreneur, ▁entrepreneurial, ▁entrepreneurship
44392 ▁cumbers 0.309018 0.76 ▁cumbersome
42889 ikuman 0.310367 0.035 ▁Kinnikuman
35496 ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ 0.310747 1
46399 Putting 0.311479 0.99
30513 Shortly 0.313889 0.69
27927 Nearly 0.31483 0.93
41156 Depending 0.317696 0.97
48396 ÛÛ 0.318282 0.97
40475 Considering 0.318328 0.95
42202 GoldMagikarp 0.319694 0.86 ▁SolidGoldMagikarp
26797 Throughout 0.320166 0.87
25658 ?????- 0.321682 0.97 ?????-?????-
44850 Ironically 0.322936 0.94
33092 Interestingly 0.323081 0.79
40817 Honestly 0.323977 0.95
20670 Obviously 0.32457 0.85
5808 ÃÂ 0.327337 0.98 ÃÂÃÂ, ÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ, ÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂÃÂ
44959 Quite 0.327447 0.98
37058 Generally 0.327854 0.98
27924 ▁srf 0.328317 0.99 ▁srfN, ▁srfAttach
30638 Clearly 0.328935 0.97
48142 Assuming 0.328946 0.98
30402 Apparently 0.329255 0.92
14945 Several 0.329478 0.87
18521 Unlike 0.329488 0.82
7003 Although 0.32961 0.93
4183 ▁conflic 0.329711 0.89 ▁conflict, ▁conflicts, ▁conflicting, ▁conflicted
23711 ▁Moroc 0.330016 0.39 ▁Morocco, ▁Moroccan
5392 ▁conclud 0.330525 0.89 ▁concluded, ▁conclude, ▁concludes, ▁concluding
38150 Hundreds 0.330968 0.86
20554 ▁unbeliev 0.330975 0.095 ▁unbelievable, ▁unbelievably
11689 ▁unnecess 0.331183 0.96 ▁unnecessary, ▁unnecessarily
49321 Typically 0.3323 0.87
13710 Perhaps 0.333135 0.93
28877 Whenever 0.333442 0.98
14341 PDATE 0.334026 0.74 UPDATE, ▁UPDATE, PDATED
19373 ▁adolesc 0.334032 1 ▁adolescents, ▁adolescent, ▁adolescence
8332 Despite 0.334149 0.63
24795 Nobody 0.3343 0.97
40443 Initially 0.33434 0.76
26556 Taking 0.334549 0.98
16303 ▁undermin 0.334646 0.67 ▁undermine, ▁undermining, ▁undermined, ▁undermines
28235 aeper 0.33699 0.96 aepernick, ▁Kaepernick
8983 ▁satell 0.337036 0.95 ▁satellite, ▁satellites
45648 Knowing 0.337517 0.99
22315 ▁newcom 0.337805 0.59 ▁newcomers, ▁newcomer
27894 Regardless 0.338152 0.99
46010 Huh 0.338256 0.99
22640 itially 0.338314 0.012 ▁Initially, Initially
43541 Amid 0.338509 0.99
32901 Adding 0.339066 0.98
14698 Having 0.339086 0.98
24307 ▁looph 0.339379 0.98 ▁loophole, ▁loopholes
37288 Often 0.33941 0.89
20570 Getting 0.340336 0.98
24951 Furthermore 0.340889 0.73
39865 proclaimed 0.34141 0.98
32602 Aside 0.341693 0.88
28172 Everybody 0.341715 0.99
6336 ▁Palestin 0.341811 0.81 ▁Palestinian, ▁Palestinians, ▁Palestine
27693 289 0.341954 0.99 ▁289
29740 ▁Azerb 0.341983 0.97 ▁Azerbai, ▁Azerbaijan
8438 everal 0.342446 0.04 ▁Several, Several
9286 ▁exha 0.342809 0.9 ▁exhaust, ▁exhausted, ▁exhaustion, ▁exhaustive, ▁exhausting
13689 Earlier 0.342824 0.86 ▁Earlier
43298 userc 0.343431 0.96 usercontent
24035 Along 0.343538 0.95
44669 Compared 0.343778 0.97
23379 Almost 0.343894 0.93
43569 ÍÍ 0.34395 0.93
15755 ▁millenn 0.344816 0.99 ▁millennials, ▁millennia, ▁millennium, ▁millennial
23305 ▁notor 0.344988 0.41 ▁notoriously, ▁notoriety
36725 Sadly 0.345193 0.96
31524 Basically 0.345276 0.99
36001 Certainly 0.345286 0.93
30695 378 0.346228 1 ▁378
14291 Following 0.346315 0.97
42877 70710 0.34639 1
43258 Nonetheless 0.346456 0.47
27824 367 0.346545 0.99 ▁367
30803 369 0.346712 0.99 ▁369
41451 Isn 0.346842 0.98
48448 iosyn 0.346874 0.95 iosyncr, ▁idiosyncr
6598 ▁behavi 0.346881 0.94 ▁behaviour, ▁behaviors, ▁behavioral, ▁behaving, ▁behaviours, ...
38314 759 0.34737 0.99
45385 674 0.347391 0.99
16080 ▁corrid 0.3474 1 ▁corridor, ▁corridors
33937 Vaults 0.347425 0.8
44217 Hmm 0.34744 0.97
45872 Likewise 0.347511 0.48
25044 ▁Interestingly 0.347608 0.71
42382 Depths 0.347628 0.98
41734 579 0.347998 0.99
44213 Naturally 0.348053 1
27097 -+-+ 0.348105 0.99 -+-+-+-+
34626 394 0.34843 1
38569 758 0.348696 1
34096 Twenty 0.348989 0.98
13171 VIDIA 0.349528 0.95 ▁NVIDIA, NVIDIA
47834 Yep 0.349615 0.97
37710 391 0.349868 0.94
16782 ▁misunder 0.349879 0.59 ▁misunderstanding, ▁misunderstood, ▁misunderstand
30290 283 0.349927 0.96 ▁283
13921 Does 0.350057 0.97 ▁Doesn
27212 Ultimately 0.350267 0.97
42000 ▁hemor 0.350554 0.99 ▁hemorrh
44169 589 0.350664 0.99
12943 ▁encount 0.350677 0.99 ▁encountered, ▁encounters, ▁encountering
43240 798 0.350899 0.99
31442 Alright 0.350973 0.95 ▁Alright
36928 807 0.351068 0.99
28705 Authorities 0.351237 0.98 ▁Authorities
38652 474 0.351332 0.99
46435 653 0.351355 1
15468 Sometimes 0.351413 0.98
32321 392 0.351531 0.99 ▁392
28857 274 0.351566 0.99 ▁274
24096 176 0.351625 0.97 ▁176
30557 346 0.35165 0.97 ▁346
25707 244 0.351746 0.98 ▁244
36314 Seeing 0.351973 0.94
28039 Similarly 0.352244 0.78
40281 Increased 0.352356 1
41538 Magikarp 0.352451 0.97 GoldMagikarp, ▁SolidGoldMagikarp
39111 654 0.352738 0.99
10915 Though 0.352777 0.98 ▁Thought, ▁Thoughts
42322 Personally 0.352783 0.82
27728 298 0.352952 0.99 ▁298
6104 Even 0.353153 0.97 ▁Event, Event, ▁Eventually, ▁Events, Eventually, ...
3523 ▁citiz 0.353387 0.91 ▁citizens, ▁citizen, ▁citizenship
33813 =~=~ 0.353408 1
34784 Probably 0.35351 0.97
39449 494 0.353898 1
37731 ▁Ironically 0.353901 0.78
44163 Alternatively 0.354121 0.87
39925 687 0.354231 0.97
43625 Normally 0.354669 0.99
45734 596 0.355034 1
29416 409 0.355121 1 ▁4096, ▁409, ▁4090
33528 Investigators 0.355124 0.92 ▁Investigators
32583 708 0.355127 0.99
20045 Much 0.355218 0.98
12869 ▁reluct 0.355297 0.98 ▁reluctant, ▁reluctance, ▁reluctantly
38431 658 0.3553 1
45662 raltar 0.355326 0.39 ▁Gibraltar
49051 593 0.355452 0.96
49856 627 0.355498 0.97
40501 Absolutely 0.3555 0.98
38073 497 0.355732 0.99
32071 Creating 0.355754 1
12677 ▁tradem 0.355935 0.99 ▁trademark, ▁trademarks
37887 Usually 0.355937 0.85
37680 657 0.355977 0.99
25883 Officials 0.356021 0.95 ▁Officials
15056 Given 0.35605 0.97
29011 Nevertheless 0.35608 0.82
7601 ▁proport 0.356089 0.95 ▁proportion, ▁proportions, ▁proportional
37482 Thousands 0.356182 0.97
31496 337 0.356192 0.96 ▁337
24606 Moreover 0.356242 0.66
39424 Regarding 0.356254 0.99
31751 609 0.356309 0.99
39401 Prosecutors 0.356332 0.83 ▁Prosecutors
23937 Besides 0.356383 0.93
44815 Keeping 0.356457 0.96
33660 341 0.356496 0.99 ▁341
34427 688 0.356607 1
45791 663 0.356621 1
27404 Going 0.356754 0.99
34287 648 0.356785 0.99
35667 362 0.356829 0.98
25870 278 0.356831 0.98 ▁278
11273 ▁enthusi 0.356906 0.25 ▁enthusiasm, ▁enthusiastic, ▁enthusiasts, ▁enthusiast, ▁enthusiastically
32917 aution 0.356945 0.98 ▁precaution, ▁cautioned
45758 673 0.356959 0.97
29807 272 0.356963 0.99 ▁272
33551 291 0.357017 0.98 ▁291
38056 371 0.357064 0.98 ▁371
48724 572 0.357179 1
27412 368 0.357337 1 ▁368
29211 336 0.35744 0.97 ▁336
45675 Excellent 0.357456 0.99
20677 ▁comr 0.357491 0.98 ▁comrades, ▁comrade
36260 498 0.357561 1
10995 Yeah 0.357676 0.96
34801 705 0.357688 0.99
30743 359 0.357714 0.99 ▁359
43864 672 0.357746 0.97
36720 372 0.357911 0.99 ▁372
7782 ▁occas 0.357953 0.95 ▁occasionally, ▁occasional, ▁occasions
39997 462 0.357987 0.99
26912 246 0.358026 0.97 ▁246
47325 699 0.358124 0.99
37991 @@@@@@@@ 0.358147 0.99
4060 vertisement 0.358287 0.28 Advertisement, vertisements, Advertisements, ▁advertisement, ▁advertisements, ...
46352 584 0.358499 0.99
45786 0.358519 0.98
34770 373 0.358551 1 ▁373
31911 449 0.358573 1
28211 Someone 0.358583 0.97
28592 253 0.358675 0.99 ▁253
16190 Everyone 0.358679 0.99
40523 689 0.358769 0.99
41974 accompan 0.358769 1 accompanied, ▁unaccompanied, ▁accompanies
23216 Additionally 0.358809 0.95
32417 605 0.358961 0.98
37511 Lastly 0.358977 0.5
23722 Could 0.359008 0.95
15562 Incre 0.359055 0.99 ▁Increases, ▁Increase, ▁Increased, Increases, ▁Incredible, ...
43239 597 0.359064 0.97
47106 439 0.359082 0.99 20439
28042 Unless 0.359107 0.95
35890 489 0.359466 0.99
26279 285 0.35956 0.98 ▁285
31697 331 0.359662 0.98 ▁331
43690 436 0.359667 0.98 ▁436
37967 329 0.359684 0.96 ▁329
7191 During 0.359688 0.98
48630 581 0.359692 1
18356 ▁opio 0.359852 0.97 ▁opioid, ▁opioids
26417 Actually 0.359968 1
47521 683 0.359968 0.95
30057 261 0.359994 0.99 ▁261
47896 Whoever 0.360038 1
31980 607 0.360061 0.99
12814 Using 0.360099 1
34653 Uh 0.360109 0.99
45223 fledged 0.360273 0.97
46900 574 0.360381 0.99
48464 Especially 0.360439 0.98
14311 Among 0.360448 0.97
33319 353 0.360461 0.99 ▁353
31128 358 0.360556 1 ▁358
37859 paralleled 0.360577 1 ▁unparalleled
36243 382 0.360662 0.99
9627 Those 0.360794 0.97
7085 Many 0.360962 0.97
15354 Whether 0.361178 0.91
19093 Thus 0.361214 0.94
43467 Understanding 0.36126 0.99
42338 Seriously 0.361299 1
32365 Hopefully 0.361405 0.93
37730 452 0.361475 0.99
35596 Simply 0.361548 0.98
26660 243 0.361574 0.97 ▁243
25948 161 0.361612 0.97 ▁161
31010 395 0.361655 0.99 ▁395
35402 706 0.361791 0.99
44550 753 0.361905 0.98
49211 568 0.361976 0.99
18357 Being 0.362125 1
43452 599 0.362133 0.97
41580 684 0.362142 0.99
25895 idepress 0.362166 0.9 ▁antidepress, ▁antidepressants, ▁antidepressant
33459 459 0.362192 0.98
40035 697 0.362201 0.99
42332 Luckily 0.362242 0.97
35809 668 0.362247 0.98
6943 Most 0.362284 1 ▁Mostly
36276 Finding 0.362471 1
23188 178 0.362505 0.98 ▁178
22745 207 0.362518 0.98 ▁207
36657 669 0.362526 0.98
45683 ▁Notably 0.362559 0.8
26276 269 0.362571 0.99 ▁269
41813 643 0.362618 0.99
26895 309 0.362757 0.99 ▁309
38907 578 0.362783 0.98
39251 757 0.362981 0.98
40179 677 0.363134 0.96
34583 809 0.363163 0.99
32066 356 0.363181 0.95 ▁356
31675 293 0.363242 0.99 ▁293
46239 583 0.36326 0.98
37988 806 0.363336 0.99
23237 164 0.363358 0.98 ▁164
49934 548 0.363412 0.98
32570 Palest 0.363413 0.93 Palestinian
46589 692 0.363426 0.99
23874 Making 0.36344 0.99
39710 441 0.363515 0.98
43134 493 0.363537 1
28978 348 0.363554 0.99 ▁348
34107 396 0.363575 0.97 ▁396
45228 uliffe 0.363576 0.43 ▁McAuliffe
44033 0.363587 1
39357 698 0.363667 0.99 ▁698
44232 Liverpool 0.363737 1
29088 379 0.363809 1 ▁379
11039 ▁tremend 0.363813 0.96 ▁tremendous, ▁tremendously
28753 Semitic 0.363848 0.49
23195 275 0.363927 0.99 ▁275
6109 Every 0.36395 0.98 ▁Everyone, ▁Everything, Everyone, Everything, ▁Everybody, ...
30336 284 0.363965 0.98 ▁284
49995 733 0.363986 0.98
45214 694 0.364032 0.99
27192 171 0.36404 0.95 ▁171
27270 Neither 0.364095 0.96
50148 956 0.364215 1
49541 691 0.364254 1
48564 681 0.364383 0.98
13300 Maybe 0.364413 0.99
28872 241 0.364525 0.98 ▁241
36445 659 0.364594 0.99
27260 446 0.364631 0.99
26514 377 0.364732 1 ▁377
32883 477 0.364779 0.97
8128 Because 0.364874 1
26469 Certain 0.364901 0.99 Certainly
45839 592 0.364915 0.98
46250 671 0.364933 0.96
34718 rehensible 0.364943 0.99 ▁incomprehensible
27175 imbabwe 0.365 0.81 ▁Zimbabwe
34137 484 0.365006 0.99
22883 184 0.365012 0.97 ▁184, ▁1840
43798 670 0.365021 0.98 ▁670
27202 Rather 0.36503 0.99
43564 803 0.36506 0.99
43665 752 0.365083 0.99
43643 Lots 0.36509 0.99
48096 553 0.365115 0.98
31276 Fortunately 0.365139 0.72
31952 398 0.365162 1 ▁398
43704 438 0.365172 1
3633 While 0.365273 0.94
32182 354 0.365405 0.99 ▁354
46302 786 0.36541 1
42117 433 0.365426 0.97 ▁433
33372 397 0.365435 0.97
35638 506 0.36547 0.99
32220 387 0.365532 1 ▁387
40149 482 0.365672 0.99
27137 296 0.365689 0.99 ▁296
24309 151 0.365788 0.99 ▁151
37444 ▁petertodd 0.365815 0.95
26050 279 0.365838 0.99 ▁279
13296 ▁Leban 0.36585 1 ▁Lebanon, ▁Lebanese
38783 483 0.365871 0.96
40486 558 0.365896 1
27988 276 0.365909 0.97 ▁276
25096 186 0.365974 0.97 ▁186, ▁1860, ▁1861, ▁1863, ▁1865, ...
33032 457 0.366156 1 ▁457
35992 WithNo 0.366169 0.97 ▁RandomRedditorWithNo
28676 257 0.366221 0.99 ▁257
33300 649 0.366292 0.99
27371 349 0.366303 0.98 ▁349
34059 Officers 0.366393 0.98
47101 434 0.366393 0.99
38380 463 0.366492 1
48333 Changing 0.366515 0.99
23815 228 0.366618 0.98 ▁228
29334 458 0.366626 1 ▁458
50165 783 0.36669 0.98
21599 156 0.36672 0.99 ▁156
45310 023 0.366725 0.98
33394 352 0.366738 1 ▁352
46096 537 0.366754 0.98
40341 Different 0.366761 0.99
36626 381 0.366853 0.99
48528 693 0.366865 0.98
27800 287 0.366878 0.99 ▁287
20219 139 0.366885 0.99 ▁139
26392 Really 0.366905 0.99
14574 Their 0.366951 0.97
40828 1974 0.366976 0.99
18925 Similar 0.366987 0.94 Similarly
44673 797 0.36699 0.99
41874 754 0.367044 1
42548 676 0.367044 0.98
41544 795 0.367107 1
39506 442 0.367174 0.98
48379 Specifically 0.367195 0.73
44578 464 0.367278 0.97
46928 fascist 0.367362 1 ▁fascists
36623 Critics 0.36738 0.92
35844 690 0.367418 0.98
43187 Jennifer 0.367483 0.99
13898 Unfortunately 0.36762 0.97
28688 608 0.367678 1 ▁608
21738 179 0.367697 0.98 ▁179
44230 885 0.367769 0.98
44617 587 0.367792 0.99
43343 Damn 0.367903 0.99
25710 295 0.367925 0.99 ▁295
38219 756 0.367945 1
40415 GGGGGGGG 0.36797 1
24839 183 0.367996 0.99 ▁183, ▁1830
28460 338 0.368033 0.99 ▁338
35378 507 0.368078 0.98
27203 385 0.368103 0.99 ▁385
35273 351 0.368228 0.99 ▁351
36862 EMOTE 0.368328 0.96
41290 642 0.368331 0.99
45937 ▁379 0.368487 0.99
22675 @@@@ 0.368492 0.96 @@@@@@@@
49955 righteous 0.368529 1
23539 229 0.368529 0.98 ▁229
43950 682 0.368596 0.97
38205 696 0.368609 0.99
40037 establish 0.368632 0.99 establishment
22005 Within 0.368656 1
36387 intestinal 0.368659 1 ▁gastrointestinal
27877 242 0.368695 0.99 ▁242
8491 Are 0.368877 0.99 ▁Area, ▁Aren, ▁Arena, Area, ▁Areas, ...
43038 ▁Okawaru 0.368886 0.88
42752 7601 0.368903 1
22567 209 0.36891 0.97 ▁209
42947 469 0.368921 0.99
41948 557 0.368951 1
24991 197 0.36896 0.95 ▁197, 1970, 1979, 1977, 1978, ...
40553 Across 0.369003 0.96
40271 481 0.369008 0.98
38905 585 0.369044 0.99
48387 Thankfully 0.369051 0.68
45165 organisms 0.369058 1
29626 339 0.369095 0.97 ▁339
15924 Rober 0.369126 1 Robert, ▁Robertson, ▁Roberto, Roberts
47582 774 0.369127 0.98
46968 ▁convol 0.369151 0.74 ▁convoluted
12510 Three 0.36918 0.94
49641 763 0.369255 0.98
49543 ▁inhibits 0.369278 0.85
48638 573 0.369323 0.97
22995 245 0.369324 0.97 ▁245
35175 319 0.369393 0.97 ▁319
47498 approximately 0.369407 0.99
40652 461 0.36946 0.97
26492 191 0.369462 0.96 ▁1914, ▁1919, ▁191, ▁1910, ▁1915, ...
44103 413 0.369486 0.98 ▁413
45449 CLASSIFIED 0.369508 1 ▁UNCLASSIFIED
20107 138 0.369512 0.99 ▁138
49287 883 0.369528 0.99
47915 561 0.369588 0.97 76561
43687 Laura 0.369655 1
25600 258 0.369697 0.99 ▁258
45664 mbuds 0.369732 0.98 mbudsman
4821 According 0.369908 0.98 ▁Accordingly
26561 297 0.369921 0.99 ▁297
32128 376 0.369947 0.97 ▁376
40064 435 0.36995 0.99 ▁435
41289 491 0.370025 0.97
48246 748 0.370029 0.99
34301 Prosecut 0.370052 0.99 ▁Prosecutor, Prosecutors, ▁Prosecutors
37750 790 0.370071 1
40516 0.370078 0.99
48531 526 0.370082 0.99
31380 334 0.370153 0.98 ▁334
34716 460 0.370177 0.99
36809 703 0.370219 0.99
39174 ▁278 0.37024 0.99
49649 953 0.370397 1
48494 Whereas 0.37048 0.97
41060 796 0.370481 0.97
21261 205 0.370532 0.98 ▁205, ▁2050
29193 Scientists 0.370636 0.95
29769 389 0.370759 0.99 ▁389
42199 466 0.370778 0.99
21626 249 0.370965 0.97 ▁249
32459 366 0.370968 0.97 ▁366
24693 237 0.370975 0.99 ▁237
35978 685 0.370984 0.99
29571 Eight 0.371012 0.99 ▁Eighth
35447 363 0.371055 1 ▁363
49150 736 0.371087 0.98
37381 695 0.37111 0.97
46519 782 0.371113 1
30120 407 0.37118 0.99 ▁407
48200 628 0.371211 0.99
8795 iscons 0.371278 0.86 isconsin, ▁Wisconsin, Wisconsin
20964 146 0.371387 0.98 ▁146
24982 ▁Consequently 0.371412 0.85
32971 WHAT 0.371468 0.96
48602 629 0.371486 0.95
48311 ilantro 0.371486 1
16263 ▁Obviously 0.371507 0.81
30995 347 0.37155 0.99 ▁347
18638 204 0.371607 0.98 ▁204, ▁2048, 20439
44994 533 0.371623 0.98
21652 185 0.371717 0.99 ▁185, ▁1850
39195 326 0.371721 0.96 ▁326
32568 282 0.371742 0.98 ▁282
37747 496 0.371808 0.99
45326 545 0.371823 0.98
48523 Herm 0.371903 1
28724 Eventually 0.371927 0.95
44218 554 0.371934 0.98
17854 Indeed 0.371957 0.96
43916 730 0.371975 0.98
16041 ▁referen 0.371991 0.61 ▁referenced, ▁referencing
8994 ailability 0.371991 0.29 ▁availability, Availability, channelAvailability, ▁Availability, availability
16454 Okay 0.372056 0.98 ▁Okay
42759 641 0.372059 0.97
32869 704 0.372072 0.99
30368 281 0.372129 0.98 ▁281
27057 181 0.372186 0.94 ▁181
41807 leanor 0.3723 0.95 ▁Eleanor
27326 335 0.372303 0.98 ▁335
36625 453 0.372313 0.99
42018 465 0.372327 0.97 ▁465
44085 518 0.37234 0.98
24898 Recomm 0.372369 0.96 ▁Recommend, ▁Recommended, Recommended, Recommend
33781 495 0.372375 0.99
47183 ▁Surprisingly 0.37238 0.87
37601 679 0.372385 0.96
23362 189 0.372406 0.97 ▁189, ▁1890, ▁1898, ▁1895, ▁1896, ...
41292 598 0.372419 0.99
48170 517 0.37243 0.99
40654 ▁284 0.372465 0.98
28054 1986 0.3725 0.98
19442 149 0.372582 0.99 ▁149
37466 656 0.372589 1 76561
31115 448 0.372593 1 ▁448
25191 259 0.372606 1 ▁259
25399 173 0.372654 0.98 ▁173
43198 Decre 0.372707 1
33916 324 0.372709 1 ▁324
43970 ▁Notwithstanding 0.372712 0.96
39768 ▁274 0.372774 0.97
40761 760 0.37278 0.99 7601, ▁760
39761 778 0.372841 1
35133 487 0.372855 1
20548 306 0.372929 0.99 ▁306
11486 Yet 0.373133 0.95
45455 799 0.373187 0.99
29743 uania 0.373187 0.76 ▁Lithuania
29639 inflamm 0.37321 1 inflammatory
27019 277 0.373227 0.98 ▁277
29558 263 0.373265 0.99 ▁263
38339 471 0.373312 0.99
39118 588 0.37332 0.99
29703 406 0.37332 1 ▁406
40220 1973 0.37334 0.97
7454 Once 0.373477 0.99
47785 519 0.373503 0.96
25022 268 0.373503 1 ▁268
21526 154 0.373566 0.99 ▁154
29110 1985 0.373615 0.99
34039 ▁Essentially 0.373649 0.89
48952 591 0.373676 0.97
42240 866 0.373686 0.98
40744 Manchester 0.373702 0.99
34173 ▁Honestly 0.373735 0.88
46785 Evidence 0.373767 0.99
31727 cffff 0.373776 0.63 cffffcc
35700 Pretty 0.37379 1
41199 surprisingly 0.373803 0.89 ▁unsurprisingly
29646 ▁gobl 0.37388 1 ▁goblin, ▁goblins
35195 361 0.373902 0.98 ▁361
29059 478 0.373927 0.99
27265 ▁Shortly 0.373984 0.5
40256 492 0.374041 0.98
22416 203 0.374045 0.98 ▁2030, ▁203
21273 158 0.37411 0.99 ▁158
50080 431 0.374125 0.95
50009 ▁strutConnector 0.374196 0.58
25270 288 0.374256 0.98 ▁288
48712 896 0.374324 0.98
28896 219 0.37435 0.97 ▁219
48365 751 0.374352 0.98
46890 Increase 0.374364 0.97
22515 305 0.37437 0.99 ▁305
49351 528 0.374378 0.99
17353 Would 0.374392 0.98 ▁Wouldn
20356 188 0.374395 0.97 ▁188, ▁1880, ▁1886, ▁1889, ▁1888
31916 604 0.374427 0.99
44729 ▁346 0.374443 0.98
30460 388 0.374529 0.99 ▁388
21315 208 0.374555 0.99 ▁208
43453 ▁SolidGoldMagikarp 0.374595 1
32148 332 0.374616 0.98 ▁332
33796 Important 0.374637 0.99
49327 ▁363 0.374653 0.99
45478 Danny 0.374668 0.99
11980 Have 0.374701 0.99 ▁Haven
13947 itzerland 0.374709 0.46 ▁Switzerland
46572 563 0.374736 0.99
37528 ▁258 0.374778 0.97
24038 707 0.374845 0.99 70710
37283 322 0.374872 0.98 ▁322
50242 794 0.374886 0.99
40401 789 0.374901 1
38582 Suddenly 0.374906 0.99
41392 Tue 0.374909 0.99 ▁Tues
35653 Were 0.374945 0.98 ▁Werewolf
32759 292 0.374945 0.99 ▁292
25272 196 0.374966 0.97 ▁196, 1969, 1960, 1968, 1967, ...
28977 271 0.374971 0.95 ▁271
23591 ▁Depending 0.375018 0.92
32113 addafi 0.375039 0.29 ▁Gaddafi
47159 661 0.375096 0.98
27301 1987 0.375128 0.99
32531 402 0.375136 0.97 ▁402
45473 ▁378 0.375152 0.97
37781 1977 0.375156 0.99
41742 awaited 0.375162 0.92
21920 innamon 0.375188 0.51 ▁cinnamon, ▁Cinnamon
24409 234 0.375246 0.98 ▁234
42218 excluding 0.375257 0.99
20986 165 0.375285 0.97 ▁165
24943 193 0.375296 0.98 ▁1933, ▁1936, ▁1938, ▁1937, ▁1934, ...
43336 ▁291 0.375305 0.95
27277 357 0.375348 1 ▁357
37688 784 0.375379 0.99
25674 267 0.375395 1 ▁267
42530 estyles 0.375424 1 ▁lifestyles
33153 ```` 0.375428 1
42444 675 0.375429 0.98
22186 195 0.375449 0.98 ▁1959, ▁1953, ▁1958, ▁1954, ▁195, ...
26583 Therefore 0.375474 0.96
43234 910 0.375567 0.96
36088 804 0.375609 1
16423 oubtedly 0.375616 0.48 ▁undoubtedly
28268 racuse 0.37563 0.97 ▁Syracuse
27534 ░░ 0.37569 1
19693 Everything 0.375697 0.99
17787 ▁cryst 0.375749 0.73 ▁crystals, ▁crystall
21495 308 0.375764 0.99 ▁308
4366 Some 0.375765 0.99 ▁Sometimes, ▁Something, Sometimes, ▁Someone, Something, ...
37856 472 0.375966 0.98
20213 ▁pestic 0.375993 0.98 ▁pesticides, ▁pesticide
5122 Our 0.376011 0.94
43193 652 0.376013 0.99
22996 307 0.376022 0.99 ▁307
24710 oldemort 0.37603 0.66 ▁Voldemort
40393 779 0.376053 0.99
32288 regor 0.376059 0.99 ▁McGregor
42311 arnaev 0.376095 0.97 ▁Tsarnaev
32351 Few 0.376218 0.93
21159 November 0.376248 0.99
22172 169 0.376268 0.97 ▁169
16676 Remember 0.376353 0.99
31869 esteem 0.376354 1 ▁esteem, ▁esteemed
43525 Located 0.37636 0.99
33023 hovah 0.376399 0.95 ▁Jehovah
28333 aepernick 0.376428 0.66 ▁Kaepernick
48194 762 0.376432 0.97
24369 290 0.376446 1 ▁290
34741 383 0.376447 0.97 ▁383
30505 455 0.37646 0.99 ▁455
26294 ioxid 0.376472 0.98 ▁antioxid, ▁antioxidant, ▁antioxidants
33622 Moving 0.376519 0.99
22288 1996 0.376588 0.99
42802 877 0.37664 0.99
8815 ▁tiss 0.376661 0.99 ▁tissue, ▁tissues
45975 Removed 0.376712 0.99
42184 Romney 0.376741 0.98
44826 ▁385 0.376742 0.98
4711 These 0.376754 0.98
49841 034 0.376813 0.98
33792 ormonal 0.376817 0.88 ▁hormonal
34229 454 0.376818 0.99
21875 Whatever 0.376819 0.99
26598 405 0.376825 0.99 ▁405
43356 423 0.376849 0.98 ▁423
1858 There 0.376871 0.97 ▁Therefore, ▁Theresa, Therefore
39322 580 0.376896 1 ▁580
38449 1975 0.376898 0.97
45572 Jessica 0.376902 0.99
41081 despite 0.376944 0.94
31575 342 0.376947 0.98 ▁342
6385 Since 0.37695 0.98
41172 785 0.376951 0.99
45151 725 0.376964 0.97
12486 ▁suspic 0.377045 1 ▁suspicious, ▁suspicion, ▁suspicions
26709 1988 0.377054 0.97
39135 ▁263 0.377056 0.99
42347 Residents 0.377071 0.98
18742 155 0.377099 0.99 ▁155
28567 355 0.377099 1 ▁355
43147 710 0.377196 0.97
34808 998 0.377205 0.98
49489 546 0.377214 0.97
23028 Impro 0.377225 0.99 ▁Improved, ▁Improvement, Improved, ▁Improvements, Improve
49374 Elsewhere 0.377227 0.97
38612 530 0.377234 0.99 ▁530
48800 ▁Presumably 0.377235 0.82
41103 ▁297 0.377267 0.96
39890 jriwal 0.377275 0.65 ▁Kejriwal
24970 254 0.377324 0.98 ▁254
49422 Supporters 0.377327 0.93
32576 475 0.377383 0.99 ▁475
46541 Integer 0.3774 1
40009 Various 0.377452 1
47946 ▁373 0.377486 0.99
38472 468 0.377523 0.98
39088 525 0.377559 0.97 ▁525
33524 anamo 0.377563 0.97 ▁Guantanamo
48156 792 0.377575 0.98
45403 1959 0.37761 0.91
47567 ▁353 0.377653 1
24403 227 0.377661 0.99 ▁227
4864 However 0.377671 0.97
17430 175 0.377721 0.99 ▁175
36886 STDOUT 0.37774 1
36879 665 0.377744 0.96
47576 544 0.37781 0.96
40427 552 0.377825 0.98
49542 522 0.377843 0.98
33580 504 0.377845 0.99 ▁504
25262 Between 0.377962 0.97
42363 427 0.378004 0.98 ▁427
41102 Looks 0.378015 0.97
41215 conservancy 0.37805 1 natureconservancy
45598 740 0.378119 0.98
39466 ▁279 0.378136 0.97
27033 286 0.378156 0.98 ▁286
15784 Looking 0.37824 0.99
16670 although 0.378246 0.97
42780 426 0.378322 0.99 ▁426
41810 ▁282 0.378328 0.98
29969 Democrats 0.378393 0.98
48524 728 0.378429 0.96
34934 Allows 0.378439 0.98 ▁Allows
47132 etchup 0.378486 1
44084 ▁348 0.378523 0.97
12630 ▁defic 0.378529 0.98 ▁deficits, ▁deficiency, ▁deficiencies, ▁deficient
44548 powerful 0.378557 0.97 ▁powerfully
42524 ivariate 0.378561 0.99
28544 Increases 0.37858 0.92
39380 662 0.378607 0.99
23721 238 0.378636 0.99 ▁238
32396 abortion 0.378638 1
6610 Another 0.378653 0.94
37950 1978 0.378705 0.96
39351 authored 0.37871 0.98
34511 ▁Conversely 0.378724 0.66
31579 Jere 0.378763 0.98 Jeremy, ▁Jeremiah
22263 ▁mosqu 0.378883 0.98 ▁mosques, ▁mosquit, ▁mosquito, ▁mosquitoes
43155 ▁341 0.378904 0.99
21113 1998 0.378906 0.98
19058 Qaeda 0.378916 0.98
33981 647 0.378941 1
41053 Chelsea 0.378958 1
45082 ▁perpend 0.379 0.79 ▁perpendicular
20621 akespe 0.37913 0.89 akespeare, ▁Shakespeare
38703 ▁277 0.379172 0.98
47173 Exactly 0.37918 1
21327 Currently 0.37919 0.98
48416 ▁shenan 0.379197 0.13 ▁shenanigans
15711 108 0.379201 0.99 ▁1080, 1080
37831 ▁259 0.379212 0.98
21719 260 0.379213 1 ▁2600
26007 393 0.37924 0.97
48882 747 0.379249 0.96
39114 ▁Needless 0.379253 0.99
36651 ▁Somehow 0.379323 0.95
36680 702 0.379376 0.99 ▁702
19782 190 0.379377 0.99 ▁190, ▁1900, ▁1905, ▁1901, ▁1908, ...
44087 022 0.379392 0.99
33238 ▁Assuming 0.379408 0.92
34825 447 0.37941 1
34475 dominated 0.379428 0.99
22229 Due 0.379438 0.98 ▁Duel
13518 Further 0.379497 0.96 Furthermore
45331 432 0.379618 0.98 ▁432
40236 FINEST 0.37962 0.99
44093 899 0.379623 0.99 ▁1899
34206 #$#$ 0.379624 0.96
38618 Avoid 0.379677 0.99
27251 ▁Nonetheless 0.379678 0.84
49517 712 0.379694 0.98
39906 EStream 0.379708 0.86 EStreamFrame
6987 ▁therap 0.379738 0.99 ▁therapy, ▁therape, ▁therapeutic, ▁therapist, ▁therapies, ...
42141 ▁329 0.379746 0.98
22245 Richard 0.379776 1
43019 ▁368 0.379797 0.99
22985 174 0.379844 0.99 ▁174
50030 ▁glared 0.379846 0.93
39937 ▁345 0.379872 0.99
41073 extremely 0.379876 0.97
25549 umbledore 0.379903 0.54 ▁Dumbledore
38776 ïve 0.379907 1 ▁naïve
28455 Republicans 0.379987 0.91
24102 Beyond 0.380025 0.99
44341 ▁342 0.380036 0.99
25429 233 0.380066 0.99 ▁233
3472 ▁streng 0.380107 1 ▁strength, ▁strengthen, ▁strengths, ▁strengthening, ▁strengthened, ...
47118 JOHN 0.380107 0.98
48758 ▁396 0.38011 0.99
30924 678 0.380116 1
40028 Anything 0.380118 1
20645 ▁dilig 0.380145 0.83 ▁diligence, ▁diligently, ▁diligent
11480 iversal 0.380171 0.99 ▁Universal, ▁universally, Universal, universal
46660 887 0.380171 0.99
22660 ▁Likewise 0.380182 0.66
39827 enfranch 0.380184 1 ▁disenfranch
21908 1995 0.380222 0.99
48250 635 0.380239 0.98
46951 870 0.380243 0.98
10294 Meanwhile 0.38028 0.96 ▁Meanwhile
42223 upuncture 0.38028 0.95 ▁acupuncture
24529 1991 0.380292 0.98
28018 urrencies 0.380353 0.45 ▁cryptocurrencies
45969 515 0.380378 0.97
28771 399 0.380379 0.98 ▁399
38172 755 0.380415 0.98
36174 ▁RandomRedditorWithNo 0.380443 0.84
31714 479 0.38045 0.99
19541 terrorism 0.380477 0.97 ▁counterterrorism
30885 ▁alleviate 0.380479 0.74
21940 167 0.380523 0.99 ▁167
24465 1993 0.380543 0.98
15046 Through 0.380571 1 ▁Throughout, Throughout
41945 efficients 0.380577 0.98 ▁coefficients
46871 773 0.380598 1
29458 Joseph 0.380622 1
35453 Improved 0.380628 0.98
44045 Rachel 0.380639 1
10060 ithub 0.380674 0.61 github, ▁github, ▁Github
24661 Recently 0.380686 0.99
47493 516 0.380713 0.96
48341 830 0.380738 0.98
34876 ortmund 0.380749 0.95 ▁Dortmund
19924 132 0.38076 1 ▁132
46636 421 0.380766 0.98 ▁421
35235 println 0.380792 1 ▁println
49786 evaluate 0.380798 1
5195 Why 0.38086 0.99
38503 1960 0.380901 0.96
8241 Who 0.380914 1 ▁Whole, ▁Whoever, Whoever
24909 226 0.380944 0.98 ▁226
23055 166 0.380984 0.98 ▁166
44367 ▁349 0.380987 0.98
46818 affiliated 0.381038 0.98
32437 ▁Smartstocks 0.38105 0.59
19626 Consider 0.381075 0.99 ▁Considering, Considering
33380 Educ 0.381093 0.97 Education
49814 ▁383 0.381109 0.99
39835 immigrant 0.38111 0.98
19420 126 0.381116 0.99 ▁126
46044 582 0.381219 0.95
43011 mittedly 0.381254 0.86
49633 ▁389 0.381271 0.98
40574 LinkedIn 0.38132 1
22501 strous 0.381355 0.99 ▁monstrous, ▁Monstrous
7571 Two 0.381357 0.98
37757 glomer 0.381363 0.99 ▁conglomer, ▁conglomerate
20809 136 0.381391 0.98 ▁136
43964 Growing 0.381403 0.98
40286 ▁309 0.381403 0.99
28239 Students 0.381443 0.99
44361 422 0.381443 0.97 ▁422
44064 doctoral 0.381445 0.99
28072 251 0.381449 0.99 ▁251
18092 ernandez 0.381466 0.98 ▁Hernandez, ▁Fernandez
41225 Qaida 0.381493 1
16371 Very 0.381524 0.98
11585 eatures 0.381538 0.56 ▁Features, Features, ▁Creatures, features
25475 1989 0.381587 0.99
23847 1992 0.381593 0.98
30365 Jonathan 0.381607 0.98
39254 570 0.381658 0.99 ▁570
21129 Anyone 0.381671 0.94
49234 952 0.381683 0.99
36042 318 0.381702 0.98 ▁318
36530 Playing 0.381705 1
41455 entanyl 0.381714 0.99 ▁fentanyl
42032 ▁283 0.381726 0.96
29613 otonin 0.381733 0.94 ▁serotonin
28998 perhaps 0.381787 0.95
27696 294 0.381827 0.95 ▁294
33646 488 0.381827 1
33207 WINDOWS 0.381829 1
27970 312 0.381853 0.99 ▁312
44750 793 0.381861 0.99
22666 1994 0.381865 0.96
15137 Four 0.381873 0.97 ▁Fourth, Fourth
27933 ampires 0.381938 0.99 ▁vampires
41521 traumatic 0.381982 1
35969 Palestinian 0.382011 0.99
46633 ▁372 0.382025 0.97
18298 116 0.38203 0.98 ▁116
42539 orthodox 0.382043 1 ▁orthodoxy, ▁unorthodox
48736 Michelle 0.382074 0.99
49111 Companies 0.382075 1
46438 594 0.382081 0.99
21106 Below 0.382082 0.99
42105 intuitive 0.382087 0.99
2504 That 0.382138 1 ▁Thatcher
27712 345 0.382152 0.99 ▁345
45271 1965 0.382189 0.97
18115 izoph 0.38219 1 izophren, ▁schizophren, ▁schizophrenia
7961 ▁obser 0.382218 0.97 ▁observe, ▁observations, ▁observation, ▁observers, ▁observing, ...
35443 Political 0.382244 0.99
32114 425 0.382299 0.98 ▁425
21395 153 0.382306 0.98 ▁153
24591 217 0.382327 0.98 ▁217
47213 Environmental 0.382333 0.99
47007 615 0.382339 0.97
33581 1979 0.382341 0.99
31654 505 0.38236 0.99 ▁505
2827 ▁surpr 0.382415 0.99 ▁surprise, ▁surprising, ▁surprised, ▁surprisingly, ▁surprises
33042 508 0.382415 0.98
37397 680 0.382476 0.99 ▁680
47578 ▁counteract 0.382497 0.83
22210 Something 0.382498 0.99
41208 1971 0.382572 0.96
42246 1968 0.382595 0.99
45261 ▁Numerous 0.382602 0.95
40097 ▁uphe 0.382628 0.99 ▁upheaval
20627 becca 0.38264 0.97 ▁Rebecca
21816 February 0.382647 1
42723 discrimination 0.382666 1
38485 Pakistan 0.382688 0.98
45833 Studies 0.382725 0.99
27367 273 0.38273 0.99 ▁273
43722 ▁331 0.382761 0.98
45987 026 0.382786 0.97
42489 ▁339 0.382804 0.97
28933 667 0.382806 0.99
41235 ▁294 0.38284 0.96
24940 236 0.382859 0.97 ▁236
36453 321 0.382877 0.95 ▁321
2215 When 0.38289 0.99 ▁Whenever, Whenever
19708 137 0.382932 0.99 ▁137
26582 325 0.382945 0.99 ▁325
47701 greSQL 0.382954 0.54
46238 Refer 0.382966 0.98 ▁Referred
40353 884 0.382974 1
39294 Nusra 0.382986 1
46841 614 0.383002 0.99
26200 408 0.383022 0.98 ▁408
42980 882 0.383023 0.99
5997 sembly 0.383042 0.068 ▁Assembly, ▁assembly, assembly, Assembly
48337 Jamie 0.383065 1
8421 Before 0.383101 0.99

Tokens with partial UTF-8 sequences

1 entries below threshold of 0.041

token_id token indicator in_other_tokens
39820 龍<0xE5><0xA5> 0.00473803 龍契士
215 additional entries above threshold
token_id token indicator in_other_tokens
33434 <0x96><0x9A>士 0.228036 龍喚士
13945 <0xA5><0x9E> 0.303934 , ▁神
23596 <0x93><0x98> 0.345571 ▁ⓘ,
39374 <0x91>士 0.346465 龍契士
47490 <0xA9><0xB6><0xE6> 0.35787 <0xA9><0xB6>極
32003 <0xE8><0x80> 0.387207
48071 <0xE0><0xA6> 0.387591
34247 ا<0xD8> 0.393233
33951 י<0xD7> 0.396959
6408 <0xA3><0x8F> 0.404938 ▁裏, ▁裏<0xE7>, ▁裏覚醒, ▁裏<0xE8>
28938 <0xE5><0x90> 0.410571
22757 <0x9A>醒 0.412554 覚醒, ▁裏覚醒
4204 <0xBF><0xBD> 0.415587 , ��, ����, ▁�, ▁����, ...
32573 <0xE8><0xBF> 0.415693
19469 <0xE0><0xA8> 0.415795 ▁<0xE0><0xA8>
11737 <0xE9><0xBE> 0.417682 , 龍<0xE5>, 龍喚士, 龍<0xE5><0xA5>, 龍契士
49035 <0xE5><0x87> 0.417972
45495 <0xE1><0xBD> 0.419487
46763 <0xE6><0x95> 0.420892
43297 <0xE0><0xA9> 0.420944
22887 <0xE5><0xB0> 0.421548
31204 <0x96><0x9A> 0.421635 <0x96><0x9A>士, 龍喚士
43897 <0xE6><0xA9> 0.421635
25001 <0xE5><0xA5> 0.421892 龍<0xE5><0xA5>, 龍契士,
46479 <0xE4><0xBF> 0.422215
11805 <0x98><0x85> 0.422489 , ▁★, ★★
47797 <0xE8><0x83> 0.423382
43380 <0xE5><0xAF> 0.424724
29826 <0xE6><0xAD> 0.424764
47728 <0xF0><0x9D> 0.425955
8955 <0x82><0xAC> 0.426059 ▁€, ,
18004 <0xE5><0xA3> 0.426143 , <0x96><0x9A>士, 龍喚士, <0x91>士, 龍契士
42314 ▁<0xE0><0xA8> 0.426841
39355 <0xE5><0x8D> 0.426848
36596 <0xBB><0x92> 0.427598
28839 <0xE5><0x9C> 0.427676
34402 <0xE9><0x81> 0.427756
33768 <0xE6><0x97> 0.428502
45911 <0xE7><0x95> 0.429211
33699 <0xE6><0x89> 0.429291
2515 <0xE3><0x81> 0.429297 , の<0xE5>, の<0xE7>, , , ...
22880 <0xE2><0x95> 0.429324 , ══
33566 <0xE7><0x9B> 0.429759
48958 <0xE8><0x88> 0.429958
35975 <0xEC><0x9D> 0.430034
25443 о<0xD0> 0.430098
45617 <0xE9><0xA3> 0.431005
32518 <0xE8><0xA3> 0.431464 ▁裏<0xE8>,
45739 <0xE8><0xAA> 0.431576
37863 <0xE5><0x86> 0.431818
20998 <0xE5><0x8F> 0.432199
34460 <0xE9><0x80> 0.432718
35707 <0xE6><0x84> 0.433167
12859 <0xE4><0xBA> 0.433441 ,
31479 <0xE0><0xB9> 0.433516
45379 <0xE7><0x8B> 0.433571
36181 <0xE5><0xBE> 0.433895
17358 <0xE8><0xA6> 0.434581 覚醒, ▁裏覚醒
46237 <0xE8><0xAF> 0.434703
29785 <0xE9><0x97> 0.434855
47991 <0xED><0x95> 0.434976
22755 <0xE6><0x88> 0.435315
47947 <0xE5><0x8B> 0.435446
27764 <0xE5><0xAD> 0.435643
34932 <0xE9><0x87> 0.435759
33232 <0xE5><0xBF> 0.435851
18923 ▁<0xD9> 0.435993 ▁و, ▁م
26344 <0xE5><0x88> 0.435996
38184 <0xE6><0xB5> 0.436985
28156 <0xE5><0xBC> 0.437017
46349 <0xE6><0x83> 0.437233
38461 <0xE9><0x96> 0.437295
45250 <0xE6><0x80> 0.437334
41840 <0xF0><0x9F><0x91> 0.43779 ▁<0xF0><0x9F><0x91>
40367 <0xE7><0x9C> 0.437845
17739 <0xE5><0x85> 0.437871
30298 <0xE5><0x89> 0.43827
11976 <0xE0><0xA4> 0.438317 ▁<0xE0><0xA4>,
18796 <0xE7><0x94> 0.438625 ,
20015 <0xE4><0xBB> 0.438872
37239 <0xE9><0x9B> 0.439004
31965 <0xE7><0x89> 0.439303
43718 <0xE6><0xA0> 0.43944
10310 <0xE4><0xB8> 0.439785 , , , ,
36685 <0xE5><0xA6> 0.439828
46695 <0xEB><0x8B> 0.441001
20046 <0xE4><0xB9> 0.44118
35069 <0xA5><0xB5> 0.441437 <0xA9><0xB6>極
23877 <0xE6><0x96> 0.441438
45865 <0xAB><0x98> 0.442465
47703 <0xA9><0xB6>極 0.442638
39611 <0xE1><0xB5> 0.442711
44293 <0xE5><0x8C> 0.443388
13783 <0xE5><0xA4> 0.444074 , , ▁<0xE5><0xA4>
36365 <0xE6><0xB0> 0.444148
21253 <0x9A><0xE9> 0.444315 <0x9A>醒, 覚醒, ▁裏覚醒
37605 <0xE5><0xBD> 0.445451
10253 <0x86><0x92> 0.445478 ▁→, <0x9A>醒, 覚醒, ▁裏覚醒,
7134 <0x88><0x92> 0.445621 ▁−, , ▁(−
41585 <0xE1><0xB8> 0.446065
22522 <0xE5><0xAE> 0.446072 の<0xE5><0xAE>
43769 <0x81><0xAB> 0.446723
17550 ▁<0xD8> 0.446802 ▁ال
27670 <0xE4><0xBC> 0.446905
34650 <0xE5><0xA7> 0.447247
43102 <0xE8><0xBB> 0.448647
45784 <0xE7><0xB7> 0.448778
37345 <0xE6><0xB3> 0.448809
17312 <0xE6><0x9C> 0.448831 ▁<0xE6><0x9C>
31619 ▁<0xEB> 0.448883
19526 <0xE4><0xBD> 0.449387 , 使
41753 <0xE5><0xBA> 0.449466
43636 <0xE5><0x82> 0.449937
49694 <0xE9><0x9A> 0.450265
32432 <0xE5><0xB7> 0.450722
49426 <0xE7><0x90> 0.451393
26292 <0xE1><0xB9> 0.451815
41340 <0xE0><0xBC> 0.452574
26534 <0x85><0x8B> 0.452691 , ㅋㅋ
45539 <0xAC><0xBC> 0.452921
46788 <0x8A><0xB1> 0.45309
44165 <0xE7><0xAB> 0.453118
47249 <0xF0><0x9F><0x98> 0.45324
43889 <0xE5><0x8E> 0.454025
24231 <0xE0><0xA5> 0.4547
19021 <0xE7><0x9A> 0.454705
30266 <0xE6><0x9D> 0.454756
25081 <0x99><0x82> 0.454799 ▁🙂
37772 <0xE5><0x91> 0.455125
29773 <0xEE><0x80> 0.455542
33176 <0xE5><0xB9> 0.455642
28225 ▁<0xE0><0xA4> 0.456704
23329 <0x8E><0x8B> 0.457161
35705 <0xE2><0x89> 0.45729 ▁≡, ▁≤
6552 <0xE2><0x94> 0.457555 , ──, ▁<0xE2><0x94>, ────, ▁│, ...
27950 <0xE5><0x8A> 0.45823
19049 龍<0xE5> 0.458679 龍喚士, 龍<0xE5><0xA5>, 龍契士
32849 <0xE9><0x83> 0.459038
50159 <0x99><0xBD> 0.459332
23294 ▁<0xE3><0x81> 0.459735
30585 <0xE5><0xB8> 0.46045
47078 <0xE7><0x84> 0.461019
15926 <0xE2><0x97> 0.461583 ▁<0xE2><0x97>, , ▁●,
12045 ー<0xE3><0x83> 0.461599 ーン, ール, ーテ, ーティ, ▁サーティ, ...
19567 <0xE0><0xB8> 0.462583
45433 <0x81><0x96> 0.46347
41365 <0x82><0xAA> 0.463866
32368 <0xE5><0x9B> 0.464174
15139 ▁<0xE2><0x89> 0.464177 ▁≥, ▁≡, ▁≤
39333 <0xB2><0xBE> 0.464247
20174 ▁裏<0xE7> 0.465363
48953 <0xAD><0xB7> 0.466475
42062 <0x88><0xE8> 0.466591
26193 <0xE8><0xA1> 0.466739
1792 <0xE3><0x82> 0.466803 , , , , , ...
36469 ▁<0xE5><0xA4> 0.46719
49149 の<0xE5><0xAE> 0.467537
42164 ▁<0xE6><0x9C> 0.469114
27032 の<0xE6> 0.469251
35266 <0xEF><0xB8> 0.470587
35050 <0xB6><0xE6> 0.470866 <0xA9><0xB6><0xE6>, <0xA9><0xB6>極
18433 <0xAD><0x94> 0.471003 , の魔
23821 ▁<0xEC> 0.473271
41678 <0xB6><0x85> 0.473586
50169 ▁<0xF0><0x9F><0x91> 0.474191
8582 <0xF0><0x9F> 0.475276 ▁<0xF0><0x9F>, ▁<0xF0><0x9F><0x98>, ▁🙂, <0xF0><0x9F><0x91>, <0xF0><0x9F><0x98>, ...
29705 <0xE2><0x86> 0.475609 ,
33426 の<0xE9> 0.475976 の魔
1209 <0xE3><0x83> 0.476922 , , , , , ...
43518 <0x82><0x8E> 0.477081
18074 ▁<0xCF> 0.477488 ▁τ
5008 <0xE2><0x96> 0.478699 , ██, ▁<0xE2><0x96>, ████, , ...
17992 <0xE2><0x99> 0.479316 ▁<0xE2><0x99>, ,
23626 <0xE6><0x98> 0.479671
30325 ▁<0xF0><0x9F><0x98> 0.480356
34504 ▁裏<0xE8> 0.480646
17433 ▁<0xE3><0x82> 0.48094 ▁サ, ▁サーティ, ▁サーティワン
24966 ▁<0xE2><0x97> 0.481815 ▁●
26486 <0xE2><0x9C> 0.482683 ▁✔
24583 <0xE2><0x98> 0.482836 ★★, ▁<0xE2><0x98>,
14360 ▁<0xD7> 0.483417
28053 ▁<0xE1> 0.486821
8008 <0x84><0xA2> 0.487476 , ™:
32391 <0xE2><0x9D> 0.489125 ▁<0xE2><0x9D>
14524 ▁<0xE3><0x83> 0.489284
17683 の<0xE7> 0.489534
13305 ▁<0xE2><0x94> 0.490468 ▁│, ▁├, ▁├──
15474 の<0xE5> 0.490928 の<0xE5><0xAE>
5525 ▁<0xE8> 0.490951 ▁裏, ▁裏<0xE7>, ▁裏覚醒, ▁裏<0xE8>
16268 ▁<0xE9> 0.491125
18872 ▁<0xE2><0x88> 0.491269 ▁∼
42527 ▁<0xE2><0x87> 0.492102
10545 ▁<0xE6> 0.495099 ▁<0xE6><0x9C>
17804 ▁<0xE2><0x86> 0.495721 ▁↑
24861 <0xE2><0x88> 0.497081 ▁(−, ▁∼
13328 ▁<0xE7> 0.497473 ▁神
34719 ▁<0xE2><0x98> 0.497487
14519 ▁<0xE2><0x9C> 0.497928 ▁✓, ▁✔
5099 <0xE3><0x80> 0.498404 , , , , , ...
34754 ▁<0xC4> 0.499504
7377 ▁<0xCE> 0.500787 ▁μ, ▁α, ▁β, ▁Δ, ▁μg
43074 ▁<0xE2><0x9D> 0.501241
46256 <0xE2><0x81> 0.503255
12466 ▁<0xD0> 0.504206
10263 ▁<0xE5> 0.505015 ▁<0xE5><0xA4>
12520 ▁<0xF0><0x9F> 0.507708 ▁<0xF0><0x9F><0x98>, ▁🙂, ▁<0xF0><0x9F><0x91>
25370 ▁<0xC5> 0.507781
1587 ▁<0xC2> 0.517733 ▁£, ▁\xa0, ▁±, ▁§, ▁©, ...
27332 ▁<0xEF> 0.520638 ▁��������
20724 ▁<0xE2><0x99> 0.521312
11019 ▁<0xE2><0x96> 0.529615 ▁█, ▁■, ▁►
6184 ▁<0xC3> 0.541164 ▁×, ▁à, ▁é, ▁þ, ▁É, ...
2343 ▁<0xE2> 0.543048 ▁…, ▁•, ▁−, ▁€, ▁<0xE2><0x96>, ...
447 <0xE2><0x80> 0.546348 ▁<0xE2><0x80>, ▁–, ▁—, , , ...
564 ▁<0xE2><0x80> 0.558202 ▁–, ▁—, ▁…, ▁•, ▁\u200b, ...

Byte tokens

45 entries below threshold of 0.005

token_id token indicator ord hex byte_type
178 <0xF6> 0.00357658 246 0xF6 unused_utf8
183 <0xFB> 0.00366396 251 0xFB unused_utf8
185 <0xFD> 0.00367862 253 0xFD unused_utf8
180 <0xF8> 0.0036788 248 0xF8 unused_utf8
184 <0xFC> 0.00368083 252 0xFC unused_utf8
187 <0xFF> 0.00386095 255 0xFF unused_utf8
179 <0xF7> 0.00395703 247 0xF7 unused_utf8
186 <0xFE> 0.00401849 254 0xFE unused_utf8
177 <0xF5> 0.00404406 245 0xF5 unused_utf8
182 <0xFA> 0.00411302 250 0xFA unused_utf8
210 \x16 0.00416565 22 0x16 ascii
197 \t 0.00420243 9 0x09 ascii
181 <0xF9> 0.00422907 249 0xF9 unused_utf8
207 \x13 0.00434786 19 0x13 ascii
124 <0xC0> 0.00435007 192 0xC0 unused_utf8
189 \x01 0.00436115 1 0x01 ascii
192 \x04 0.00437033 4 0x04 ascii
215 \x1b 0.00446457 27 0x1B ascii
217 \x1d 0.00447732 29 0x1D ascii
188 \x00 0.00448298 0x00 ascii
25 additional entries below threshold
token_id token indicator ord hex byte_type
205 \x11 0.00454181 17 0x11 ascii
221 \x7f 0.0045619 127 0x7F ascii
196 \x08 0.00457859 8 0x08 ascii
191 \x03 0.00458455 3 0x03 ascii
211 \x17 0.00458777 23 0x17 ascii
209 \x15 0.00459915 21 0x15 ascii
218 \x1e 0.00462502 30 0x1E ascii
219 \x1f 0.00462788 31 0x1F ascii
201 \r 0.00464106 13 0x0D ascii
199 \x0b 0.00464898 11 0x0B ascii
125 <0xC1> 0.00469691 193 0xC1 unused_utf8
213 \x19 0.00470841 25 0x19 ascii
214 \x1a 0.00472432 26 0x1A ascii
216 \x1c 0.00474203 28 0x1C ascii
204 \x10 0.00481361 16 0x10 ascii
195 \x07 0.00482035 7 0x07 ascii
208 \x14 0.00483632 20 0x14 ascii
202 \x0e 0.00483972 14 0x0E ascii
200 \x0c 0.0048672 12 0x0C ascii
193 \x05 0.00487089 5 0x05 ascii
206 \x12 0.00490856 18 0x12 ascii
190 \x02 0.00498682 2 0x02 ascii
194 \x06 0.00503385 6 0x06 ascii
212 \x18 0.00508702 24 0x18 ascii
203 \x0f 0.005108 15 0x0F ascii
211 additional entries above threshold
token_id token indicator ord hex byte_type
153 <0xDD> 0.101624 221 0xDD utf8
174 <0xF2> 0.133032 242 0xF2 utf8
173 <0xF1> 0.158842 241 0xF1 utf8
154 <0xDE> 0.236744 222 0xDE utf8
176 <0xF4> 0.266074 244 0xF4 utf8
155 <0xDF> 0.272137 223 0xDF utf8
175 <0xF3> 0.352298 243 0xF3 utf8
152 <0xDC> 0.361692 220 0xDC utf8
150 <0xDA> 0.375412 218 0xDA utf8
145 <0xD5> 0.394886 213 0xD5 utf8
147 <0xD7> 0.402306 215 0xD7 utf8
148 <0xD8> 0.402662 216 0xD8 utf8
143 <0xD3> 0.412755 211 0xD3 utf8
169 <0xED> 0.424768 237 0xED utf8
160 <0xE4> 0.425957 228 0xE4 utf8
151 <0xDB> 0.426262 219 0xDB utf8
149 <0xD9> 0.426925 217 0xD9 utf8
139 <0xCF> 0.427022 207 0xCF utf8
144 <0xD4> 0.427941 212 0xD4 utf8
172 <0xF0> 0.428334 240 0xF0 utf8
161 <0xE5> 0.434057 229 0xE5 utf8
164 <0xE8> 0.434989 232 0xE8 utf8
167 <0xEB> 0.436208 235 0xEB utf8
166 <0xEA> 0.436994 234 0xEA utf8
131 <0xC7> 0.437204 199 0xC7 utf8
162 <0xE6> 0.438629 230 0xE6 utf8
133 <0xC9> 0.441823 201 0xC9 utf8
168 <0xEC> 0.444221 236 0xEC utf8
142 <0xD2> 0.448408 210 0xD2 utf8
165 <0xE9> 0.448891 233 0xE9 utf8
159 <0xE3> 0.457052 227 0xE3 utf8
163 <0xE7> 0.457378 231 0xE7 utf8
156 <0xE0> 0.457913 224 0xE0 utf8
138 <0xCE> 0.462875 206 0xCE utf8
130 <0xC6> 0.46431 198 0xC6 utf8
132 <0xC8> 0.466773 200 0xC8 utf8
137 <0xCD> 0.471711 205 0xCD utf8
141 <0xD1> 0.472568 209 0xD1 utf8
146 <0xD6> 0.47605 214 0xD6 utf8
170 <0xEE> 0.477504 238 0xEE utf8
140 <0xD0> 0.482614 208 0xD0 utf8
157 <0xE1> 0.482663 225 0xE1 utf8
95 <0xA2> 0.487739 162 0xA2 utf8
107 <0xAF> 0.488241 175 0xAF utf8
111 <0xB3> 0.488399 179 0xB3 utf8
223 <0x81> 0.488869 129 0x81 utf8
241 <0x93> 0.489428 147 0x93 utf8
224 <0x82> 0.491641 130 0x82 utf8
247 <0x99> 0.491763 153 0x99 utf8
110 <0xB2> 0.492095 178 0xB2 utf8
244 <0x96> 0.492177 150 0x96 utf8
225 <0x83> 0.492266 131 0x83 utf8
112 <0xB4> 0.492432 180 0xB4 utf8
253 <0x9F> 0.492697 159 0x9F utf8
251 <0x9D> 0.493287 157 0x9D utf8
254 <0xA0> 0.494277 160 0xA0 utf8
255 <0xAD> 0.494315 173 0xAD utf8
252 <0x9E> 0.49484 158 0x9E utf8
246 <0x98> 0.494851 152 0x98 utf8
114 <0xB6> 0.494869 182 0xB6 utf8
171 <0xEF> 0.494928 239 0xEF utf8
109 <0xB1> 0.495309 177 0xB1 utf8
248 <0x9A> 0.495805 154 0x9A utf8
119 <0xBB> 0.496022 187 0xBB utf8
128 <0xC4> 0.496442 196 0xC4 utf8
243 <0x95> 0.497313 149 0x95 utf8
242 <0x94> 0.497334 148 0x94 utf8
249 <0x9B> 0.499328 155 0x9B utf8
106 <0xAE> 0.499618 174 0xAE utf8
104 <0xAB> 0.499897 171 0xAB utf8
250 <0x9C> 0.500014 156 0x9C utf8
101 <0xA8> 0.500088 168 0xA8 utf8
96 <0xA3> 0.500267 163 0xA3 utf8
235 <0x8D> 0.500727 141 0x8D utf8
134 <0xCA> 0.500826 202 0xCA utf8
99 <0xA6> 0.500846 166 0xA6 utf8
21 6 0.500962 54 0x36 ascii
230 <0x88> 0.501244 136 0x88 utf8
135 <0xCB> 0.501358 203 0xCB utf8
227 <0x85> 0.501449 133 0x85 utf8
238 <0x90> 0.501621 144 0x90 utf8
222 <0x80> 0.501622 128 0x80 utf8
98 <0xA5> 0.501672 165 0xA5 utf8
228 <0x86> 0.501718 134 0x86 utf8
94 <0xA1> 0.501764 161 0xA1 utf8
229 <0x87> 0.50192 135 0x87 utf8
233 <0x8B> 0.502061 139 0x8B utf8
232 <0x8A> 0.503524 138 0x8A utf8
239 <0x91> 0.503668 145 0x91 utf8
97 <0xA4> 0.504801 164 0xA4 utf8
245 <0x97> 0.504841 151 0x97 utf8
118 <0xBA> 0.504898 186 0xBA utf8
226 <0x84> 0.505213 132 0x84 utf8
24 9 0.505289 57 0x39 ascii
108 <0xB0> 0.505336 176 0xB0 utf8
40 I 0.505377 73 0x49 ascii
22 7 0.505386 55 0x37 ascii
100 <0xA7> 0.506099 167 0xA7 utf8
103 <0xAA> 0.50691 170 0xAA utf8
105 <0xAC> 0.507201 172 0xAC utf8
237 <0x8F> 0.507313 143 0x8F utf8
113 <0xB5> 0.507505 181 0xB5 utf8
121 <0xBD> 0.508921 189 0xBD utf8
236 <0x8E> 0.509359 142 0x8E utf8
136 <0xCC> 0.509445 204 0xCC utf8
240 <0x92> 0.511487 146 0x92 utf8
23 8 0.51159 56 0x38 ascii
234 <0x8C> 0.511855 140 0x8C utf8
102 <0xA9> 0.513131 169 0xA9 utf8
117 <0xB9> 0.513452 185 0xB9 utf8
116 <0xB8> 0.513628 184 0xB8 utf8
92 } 0.516246 125 0x7D ascii
115 <0xB7> 0.517139 183 0xB7 utf8
20 5 0.517951 53 0x35 ascii
122 <0xBE> 0.519544 190 0xBE utf8
231 <0x89> 0.520111 137 0x89 utf8
123 <0xBF> 0.520551 191 0xBF utf8
129 <0xC5> 0.522187 197 0xC5 utf8
32 A 0.523314 65 0x41 ascii
38 G 0.523397 71 0x47 ascii
39 H 0.524269 72 0x48 ascii
45 N 0.524296 78 0x4E ascii
158 <0xE2> 0.524765 226 0xE2 utf8
47 P 0.525705 80 0x50 ascii
51 T 0.526013 84 0x54 ascii
36 E 0.52649 69 0x45 ascii
120 <0xBC> 0.526868 188 0xBC utf8
57 Z 0.528437 90 0x5A ascii
53 V 0.528514 86 0x56 ascii
37 F 0.529285 70 0x46 ascii
46 O 0.530764 79 0x4F ascii
127 <0xC3> 0.530766 195 0xC3 utf8
126 <0xC2> 0.53111 194 0xC2 utf8
54 W 0.531545 87 0x57 ascii
56 Y 0.531745 89 0x59 ascii
49 R 0.532044 82 0x52 ascii
35 D 0.532831 68 0x44 ascii
43 L 0.533514 76 0x4C ascii
33 B 0.534334 66 0x42 ascii
90 { 0.536511 123 0x7B ascii
44 M 0.53667 77 0x4D ascii
42 K 0.537574 75 0x4B ascii
15 0 0.53768 48 0x30 ascii
19 4 0.538883 52 0x34 ascii
41 J 0.539086 74 0x4A ascii
34 C 0.539478 67 0x43 ascii
50 S 0.541234 83 0x53 ascii
48 Q 0.546992 81 0x51 ascii
52 U 0.547228 85 0x55 ascii
61 ^ 0.547996 94 0x5E ascii
18 3 0.549325 51 0x33 ascii
3 $ 0.551978 36 0x24 ascii
63 ` 0.562052 96 0x60 ascii
80 q 0.562073 113 0x71 ascii
27 < 0.562264 60 0x3C ascii
55 X 0.563346 88 0x58 ascii
93 ~ 0.565295 126 0x7E ascii
17 2 0.56656 50 0x32 ascii
2 # 0.56879 35 0x23 ascii
59 \ 0.568917 92 0x5C ascii
91 | 0.568966 124 0x7C ascii
4 % 0.570719 37 0x25 ascii
29 > 0.571246 62 0x3E ascii
28 = 0.573413 61 0x3D ascii
16 1 0.576343 49 0x31 ascii
60 ] 0.580425 93 0x5D ascii
74 k 0.581422 107 0x6B ascii
5 & 0.582079 38 0x26 ascii
31 @ 0.582178 64 0x40 ascii
67 d 0.584961 100 0x64 ascii
85 v 0.587432 118 0x76 ascii
79 p 0.587719 112 0x70 ascii
70 g 0.587884 103 0x67 ascii
69 f 0.590236 102 0x66 ascii
73 j 0.591116 106 0x6A ascii
65 b 0.591514 98 0x62 ascii
86 w 0.591966 119 0x77 ascii
71 h 0.59446 104 0x68 ascii
1 " 0.595244 34 0x22 ascii
84 u 0.595933 117 0x75 ascii
83 t 0.601825 116 0x74 ascii
81 r 0.601858 114 0x72 ascii
58 [ 0.602964 91 0x5B ascii
76 m 0.603477 109 0x6D ascii
89 z 0.605349 122 0x7A ascii
66 c 0.605613 99 0x63 ascii
7 ( 0.606195 40 0x28 ascii
10 + 0.607464 43 0x2B ascii
8 ) 0.607725 41 0x29 ascii
77 n 0.60812 110 0x6E ascii
75 l 0.608751 108 0x6C ascii
9 * 0.608906 42 0x2A ascii
87 x 0.611797 120 0x78 ascii
26 ; 0.616286 59 0x3B ascii
30 ? 0.616614 63 0x3F ascii
62 _ 0.618593 95 0x5F ascii
64 a 0.619658 97 0x61 ascii
0 ! 0.621929 33 0x21 ascii
68 e 0.62241 101 0x65 ascii
78 o 0.623259 111 0x6F ascii
72 i 0.624589 105 0x69 ascii
88 y 0.6323 121 0x79 ascii
220 0.649912 32 0x20 ascii
6 ' 0.652061 39 0x27 ascii
82 s 0.664972 115 0x73 ascii
25 : 0.700677 58 0x3A ascii
14 / 0.706989 47 0x2F ascii
198 \n 0.720999 10 0x0A ascii
13 . 0.773379 46 0x2E ascii
12 - 0.783385 45 0x2D ascii
11 , 0.816911 44 0x2C ascii

Special tokens

0 entries below threshold of 0.005

1 additional entries above threshold
token_id token indicator
50256 <|endoftext|> 0.587272