Wikipedia:Typo Team/moss
This page has a backlog that requires the attention of willing editors. Please remove this notice if and when the backlog is cleared. |
The moss project seeks to find and remove the furry green typos that have been growing on Wikipedia articles. It uses software written by User:Beland to automatically find misspellings, mistakes in English grammar, violations of the Wikipedia:Manual of Style, and confusing or broken wiki markup.
About misspellingsEdit
How the lists are madeEdit
The moss spell checker is run against a recent set of database dumps, which are generated on the 1st and 20th of every month (but take a few days to process). All the articles in the English Wikipedia are examined. The following are ignored:
- Text inside references, templates, tables, quotation marks, sections like "External links" and "Works", and some other weird places.
- Capitalized words (which are presumed to be correctly-spelled proper nouns)
- Words that appear in titles in the English Wiktionary (which has definitions of all words in all languages, excluding proper nouns and systematic words like chemical names and large numbers)
- Words that appear in titles in the English Wikipedia (which explains some things that don't appear in the dictionary)
- Words that appear in titles in the Wikispecies (which has many technical words that don't appear in the dictionary or encyclopedia)
Many mistakes are not (yet) caught:
- Improper addition of 's (possessives are not added to Wiktionary, so these are excluded systematically)
- Incorrect capitalization
- Incorrect multi-word phrases
- Wrong word used in context
- Non-English language words not tagged with {{lang}} or where an English misspelling happens to be the same as a word in another language. (These are counted as correct spellings if they are in the English Wiktionary, which lists words in all languages – only the definitions are restricted to English.)
- Other situations listed in #False negatives below
New statisticsEdit
- See also: Older statistics
In the year from March 2019 to March 2020, moss volunteers fixed over 94,000 typos! The most impressive progress is in the T1 category (single-letter misspellings), where we eliminated about half from the English Wikipedia. During this period we also started fixing missing spaces (focusing on those around punctuation) and those have dropped by about one-fifth. As we make progress, clear misspellings are increasingly mixed in with unclear cases; I'll be doing some more work on separation algorithms to keep the typo reports useful, so you'll probably see some more changes to typo classifications. Thanks to everyone who has been helping out! -- Beland (talk) 16:54, 28 April 2020 (UTC)
Reporting symbol | Explanation | Change from 2019-03-01 to 2020-02-20 | Instances, 2020-04-01 dump (9f6d726) | Instances, 2020-04-20 dump (5ff589d) | Instances, 2020-05-01 dump (1a96ded) | Instances, 2020-05-20 dump (e511f74) | Instances, 2020-06-01 dump (509f79a) | Instances, 2020-06-20 dump (825ceb4) | Instances, 2020-07-01 dump (db9db23) | Instances, 2020-07-20 dump (caa619f) | Instances, 2020-08-01 dump (cf76e8c) | Instances, 2020-08-20 dump (f104e58) | Instances, 2020-09-01 dump (4654d88) | Instances, 2020-09-20 dump (a26ccca) | Instances, 2020-10-01 dump (686f5db) | Instances, 2020-10-20 dump (4f90810) | Instances, 2020-11-01 dump (ac54580) |
---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
TS | Missing or extra whitespace or dash (or new compound) | -39368 (-21%) | 145297 | 144673 | 331658** | 330624 | 328249 | 325399 | 324179 | 322282 | 321801 | 318621 | 317183 | 315825 | 314747 | 312110 | 310537 |
T1 | Edit distance 1 from common English word | -36192 (-48%) | 41090 | 41081 | 39967 | 39452 | 38783 | 38379 | 38436 | 38271 | 37803 | 36783 | 35976 | 34036 | 33539 | 33764 | 32347 |
T2 | Edit distance 2 from common English word | -7560 (-10%) | 64526 | 63263 | 60690 | 60321 | 59589 | 58603 | 58649 | 58521 | 58200 | 58085 | 57845 | 57329 | 57152 | 57487 | 57387 |
T3 | Edit distance 3 from common English word | -5276 (-7%) | 74396 | 73255 | 70516 | 70039 | 68887 | 68192 | 68149 | 68020 | 67769 | 67788 | 67482 | 67226 | 67025 | 67101 | 67002 |
R | Regular word (A-Z only) not near a common English word | -3525 (-3%) | 97726 | 96916 | 94793 | 93855 | 93252 | 91537 | 91489 | 91746 | 91521 | 91729 | 91513 | 91613 | 91339 | 91813 | 92329 |
I | Definitely not English (International) due to accents or mixed with punctuation (other than hyphen) | -22196 (-24%) | 72151 | 69118 | 65842 | 64827 | 63630 | 61844 | 61888 | 61782 | 61899 | 62113 | 61916 | 62003 | 62049 | 62274 | 62287 |
W | Not in English Wiktionary, in non-English Wiktionary | -6764 (-8%) | 75913 | 74351 | 86935 | 85604 | 83173 | 81894 | 81946 | 82173 | 81943 | 82170 | 81912 | 81968 | 81792 | 81256 | 81052 |
L | Probable Romanization (transLiteration) | +81 (+2%) | 4435 | 4486 | 4266 | 4199 | 4120 | 4122 | 4104 | 4113 | 4137 | 4140 | 4151 | 4164 | 4165 | 4207 | 4203 |
ME | Probable coMpound, English (with and without dash) | +976 (+2%) | 52269 | 48761 | 47187 | 47153 | 46830 | 46856 | 46967 | 47163 | 47052 | 47170 | 47009 | 47070 | 47066 | 47045 | 47023 |
MI | Probable coMpound, non-English (International) in English Wiktionary (both A-Z and non-ASCII characters, with and without dash) | -18475 (-9%) | 177646 | 176929 | 171484 | 169592 | 166216 | 164828 | 165140 | 165351 | 165605 | 166016 | 166208 | 166499 | 166572 | 167349 | 167961 |
MW | Probable coMpound, found in non-English Wiktionary | -5544 (-11%) | 46113 | 45103 | 43501 | 42931 | 40436 | 41383 | 41325 | 41440 | 41173 | 41234 | 40990 | 40956 | 40795 | 40353 | 40272 |
ML | Probable coMpound, transLiteration | -124 (-3%) | 3909 | 3874 | 3707 | 3663 | 3672 | 3575 | 3589 | 3593 | 3628 | 3639 | 3658 | 3717 | 3724 | 3779 | 3769 |
C | Chemistry words | -176 (-9%) | 1782 | 7564 | 7530 | 7644 | 7640 | 7655 | 7658 | 7659 | 7660 | 7662 | 7654 | 7644 | 7659 | 7661 | 7665 |
N | A-Z plus numbers and hyphens | -1391 (-5%) | 25209 | 23813 | 22650 | 22511 | 22290 | 22020 | 22052 | 22053 | 21971 | 22009 | 21960 | 21923 | 21879 | 21856 | 21885 |
Z | Decimal fraction missing leading Zero | - | 47* | 0* | 11405** | 11418 | 11414 | 11398 | 11402 | 11421 | 11455 | 11530 | 11546 | 11578 | 11598 | 11669 | 11683 |
P | Patterns (e.g. rhyme schemes) | -20 (-43%) | 27 | 28 | 7 | 9 | 7 | 7 | 3 | 2 | 2 | 4 | 5 | 4 | 5 | 5 | 4 |
H | HTML/XML/SGML tag | -539 (-15%) | 3010 | 2886 | 2938 | 2903 | 2904 | 2848 | 2693 | 2697 | 2680 | 2747 | 2757 | 2729 | 2565 | 2569 | 2542 |
HB | Known bad HTML tag, like <font> | -1080 (-7%) | 14465 | 14121 | 12903 | 13928 | 12919 | 14733 | 14022 | 11428 | 11670 | 11198 | 10191 | 8860 | 8756 | 8842 | 9725 |
HL | Bad HTML-like linking, like <http://...> | -98 (-19%) | 414 | 418 | 377 | 394 | 394 | 421 | 408 | 425 | 420 | 413 | 373 | 359 | 356 | 329 | 324 |
U | URL | -94 (-7%, from 2019-03-20) | 1179 | 1152 | 1118 | 1134 | 1117 | 1122 | 1129 | 1124 | 1120 | 1124 | 1124 | 1103 | 1101 | 1099 | 1091 |
BC | Bad characters | -12678 (-6%, from 2019-09-01) | 192230 | 190482 | 186651 | 186517 | 185572 | 178698 | 175325 | 166116 | 159095 | 124158 | 112959 | 112755 | 112695 | 112633 | 112479 |
BW | Bad words | -6542 (-5%, from 2019-09-20) | 113682 | 106327 | 381288** | 380259 | 378710 | 374982 | 375107 | 375206 | 375431 | 375306 | 374622 | 374740 | 374560 | 375010 | 375008 |
Total | -39115 (-3%, from 2019-09-20) | 1207516 instances | 1188601 instances | 1647413** instances | 1638977 instances | 1619804 instances | 1600496 instances | 1595660 instances | 1582586 instances | 1574035 instances | 1535639 instances | 1519034 instances | 1514101 instances | 1511139 instances | 1510211 instances | 1508575 instances | |
Parse failure | Mismatched punctuation | -5145 (-3%) | 154084 articles + 40705 MOS:STRAIGHT violations | 153033 articles + 40838 MOS:STRAIGHT violations | 214365 articles + 37697 MOS:STRAIGHT violations | 214463 articles + 37667 MOS:STRAIGHT violations | 214101 articles + 37607 MOS:STRAIGHT violations | 214465 articles + 37767 MOS:STRAIGHT violations | 214732 articles + 37849 MOS:STRAIGHT violations | 215081 articles + 37993 MOS:STRAIGHT violations | 215447 articles + 38067 MOS:STRAIGHT violations | 215915 articles + 38169 MOS:STRAIGHT violations | 216227 articles + 38210 MOS:STRAIGHT violations | 216472 articles + 38205 MOS:STRAIGHT violations | 216738 articles + 38213 MOS:STRAIGHT violations | 216991 articles + 38246 MOS:STRAIGHT violations | 217192 articles + 38338 MOS:STRAIGHT violations |
- red = Probably need to fix
- yellow = Unsorted
- blue = Probably OK (but may need to verify)
- bold = actively working on fixing
* Identification of Z was broken
** Affected by major bug fix for counting inter-word typos (e.g. involving punctuation)
Instructions for editorsEdit
Just like a regular spell checker, sometimes a word that's highlighted is really a misspelling and should be changed, but sometimes it is a correct spelling that needs to be added to the spell checker's dictionary (which in this case is the English Wiktionary and Wikispecies). For the below lists, here's how you can help:
- For spelling mistakes: Click on the links to the individual Wikipedia articles, and edit them to correct the misspelling. Make sure this is actually a misspelling, and not a technical term that needs to be better explained, or an alternate spelling (possibly from a different regional variety of English).
- For non-English words (including words from Old English and Middle English, since they are pronounced differently): Edit the article and use the {{lang}} or {{transl}} templates to mark all non-English passages. Template contents are ignored, so they will not show up in the next report. If you can define the word, it would still be helpful to add the non-English word to the English Wiktionary or the same-language Wiktionary if you speak that language. As of the March 20, 2019 dump, only words not found in any Wiktionary are reported by moss as misspellings. (The "home" Wiktionary for Old and Middle English words is the modern English one.)
- If you don't know which language is being used, you can tag it with {{which lang}}. If you add a "reason=" parameter, that will change the pop-up tooltip text readers will see when they hover over "what language is this?". If you have a guess as to which language it might be, or any other question or comment, you can leave that here to help future editors. If you use this tag, you can delete the article from the moss listing; the article will be added to Category:Articles with unidentified words instead, and ignored by future runs of moss until the mystery is solved.
- For languages that don't have a code (often happens with historical languages), use "mis" and add an HTML comment indicating the language. For example: {{lang|mis|sharbe do kin ratz}}<!-- Old Runish -->
- For incorrect spellings in direct quotes:
- These shouldn't be picked up by the spell checker, as text in double quotes "" is ignored. The article probably has incorrect punctuation.
- Regardless of punctuation problems, you can add {{sic}} around the word or phrase. See Wikipedia:Manual of Style#Quotations for guidance.
- For correct spellings that belong in the dictionary: Click on the word to add it to the English Wiktionary. Remember the word might not be English (though the definition must be) and be sure to check capitalization!
- For correct spellings already in the dictionary: Delete from the list or strike through; these have been added in the meantime since the database dump by other editors. They do not automatically turn red as internal Wikipedia links do.
- For correct spellings not appropriate for Wiktionary:
- For DNA sequences, add {{DNA sequence}} around it.
- For species, add the whole name to Wikispecies:Wikispecies:Requested articles#From_Wikipedia and it will be suppressed from future runs.
- For proper nouns and (including non-English titles) that aren't capitalized, put inside a {{proper name}} tag.
- Use <code></code> or similar tags for computer programs; see Wikipedia:WikiProject_Computer_science/Manual_of_style#Code_samples.
- For terms that are only relevant to one Wikipedia article (and for which the article makes clear the definition) consider creating a redirect to the article. As long as the "typo" word is in the title (as a whole word), it won't show up as a mistake in future spell checks.
- {{IPA}} or {{respell}} can be used for word pronunciations. See Wikipedia:Manual of Style/Pronunciation for details.
- For bird calls: Treat these as foreign-language words or words-as-words and put them in italics, following MOS:ITALICS. Put the call inside {{not a typo}} so it won't show up on moss spell check reports. (It doesn't matter if the double apostrophes that make the italics go inside or outside the template.)
- Anything else, add {{not a typo}} around it (for example, nonsense series of letters used as examples in puzzles).
- Correct or incorrect, when finished delete or
strike outthe entry for the word from the lists on this page (or subpages), so work won't be duplicated. It is preferred to delete the entry for sections that rotate through specific letters, and strikethrough for sections where the whole thing gets updated (to prevent duplicating work done while the dumps were being processed, which can take more than a week). - If an article or section has generally bad grammar, and you don't have time to fix the whole thing, just add {{copyedit}} at the top of the article or {{copyedit|section}} at the top of the affected section. If it's just a sentence or two, {{copy edit inline}} or {{incomprehensible inline}} can go at the end of the problem passage.
- If you see errors being reported from footnotes or bibliographies, check to make sure the section is titled with a standard name following MOS:APPENDIX conventions. Standard end-matter sections like "References" and "Further reading" and "Works" are ignored.
- If it helps to leave a message on the article's talk page asking if the word is correct or incorrect, you can use Template:Typo help like this when editing the bottom of the talk page (leave the section header blank; it will automatically be added):
- {{subst:typo help|PUT WORD HERE}} -- ~~~~
- NEW: If you are uncertain whether a word is spelt correctly or not, you can add {{typo help inline}} immediately after it. If you add a "reason=" parameter, that will change the pop-up tooltip text readers will see when they hover over "check spelling". You can add a specific question or comment that may help identification. If you use this tag, you can delete the article from the moss listing; the article will be added to Category:Articles with unidentified words instead, and ignored by future runs of moss until the mystery is solved.
Don't worry if you miss something; it will reappear in a future report if there are still mistakes.
Suggested edit summariesEdit
If you want to help publicize this project, you can copy-and-paste these into your edit summary, if appropriate.
For Wikipedia edits:
- Fix misspelling found by [[Wikipedia:Typo Team/moss]] – you can help!
- Tag non-English text found by [[Wikipedia:Typo Team/moss]] – you can help!
- Tag correct text as {{not a typo}} for automated spell checkers (including [[Wikipedia:Typo Team/moss]])
- Fix mismatched quote marks found by [[Wikipedia:Typo Team/moss]] – you can help!
For Wiktionary edits:
- Add word identified by [[w:Wikipedia:Typo Team/moss]] – you can help!
Wiktionary cheat sheetEdit
Need to add a word to Wiktionary? The Wiktionary cheat sheet has copy-and-paste templates that make it easy for the types of words commonly encountered here, even if you've never done it before.
Misspellings - lists of things to fixEdit
Likely misspellings by article (main listing)Edit
The most efficient list to work on if all you want to do is fix misspellings. These listings try to list all the typos from a given article, so they can be fixed all at once. It also tries to only show typos that legitimately need fixing. It's not perfect, so a few words found need to be added to Wiktionary or tagged as not English, not a typo, etc. Only a few letters are updated on each run, to avoid stale listings as the whole list takes far longer than two weeks to work through. (This also avoids duplicating recent work when listings are refreshed.)
See subpages due to length:
- Wikipedia:Typo Team/moss/before A - Completed 2020-04-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/A - Completed 2020-05-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/B - Completed 2020-06-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/C - Completed 2020-07-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/D - Completed 2020-08-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/E - Completed 2020-08-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/F - Completed 2020-09-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/G - Completed 2020-09-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/H - Has typos to fix from 2020-10-01 dump
- Wikipedia:Typo Team/moss/I - Has typos to fix from 2020-10-20 dump
- Wikipedia:Typo Team/moss/J - Completed 2019-02-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/K - Completed 2019-02-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/L - Completed 2019-02-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/M - Completed 2019-03-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/N - Completed 2019-05-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/O - Completed 2019-05-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/P - Completed 2019-06-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/Q - Completed 2020-05-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/R - Completed 2019-07-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/S - Completed 2019-07-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/T - Completed 2019-08-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/U - Completed 2019-08-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/V - Completed 2019-08-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/W - Completed 2019-08-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/X - Completed 2019-03-20 dump, currently empty
- Wikipedia:Typo Team/moss/Y - Completed 2019-05-01 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/Z - Completed 2019-03-20 dump (except for cases that need tagging or investigation)
- Wikipedia:Typo Team/moss/after Z - Completed 2019-03-20 dump, currently empty
Notes:
- For more cases that require investigation, see Category:Articles with unidentified words.
- Due to length and an increased number of false positives, typo reports for dumps 2020-05-20 and later don't include T2, T3, and TS+BRACKET.
Likely misspellings by frequency (a-m)Edit
(updated from 2020-05-20 dump)
The best list to work on if you want to eliminate all instances of a specific typo. Only typos that are very close to known words are shown. The algorithm is not perfect, so some of these may still be words that need to be added to Wiktionary. For each run, only words from half of the alphabet are shown, to avoid duplicate work from when new dumps are being processed.
Legitimate misspellings are candidates for Wikipedia:Lists of common misspellings. If there is an obvious correction, adding that to Wikipedia:Lists of common misspellings/For machines will help editors who use automated tools to fix cases faster.
- 106 -
wikt:entroph - Catajapyx aquilonaris, Catajapyx confusus, Catajapyx ewingi, Catajapyx singularis, Ctenjapyx boneti... find all → created by a robotic editor, now all changed. Graeme Bartlett (talk) 07:57, 6 October 2020 (UTC) - 88 -
wikt:houseold- Allendale County, South Carolina, Androscoggin County, Maine, Ashland, Nebraska, Ayr, Nebraska, Bartlett, Nebraska ... find all- Fixed. Doghouse09 (talk) 19:01, 8 September 2020 (UTC)
- 57 -
wikt:copbo - Byzantine text-type, Codex Alexandrinus, Codex Athous Lavrensis, Codex Ephraemi Rescriptus, Codex Koridethi... find all → redirected and defined - 34 -
wikt:immed - Intel 8086, Military Ordinariate of Colombia, Military Ordinariate of Peru, Roman Catholic Archdiocese of Antequera, Oaxaca, Roman Catholic Archdiocese of Campinas... find all → abbreviations expanded - 24 - wikt:deuls - Anara, Purulia, Bahulara Ancient Temple, Baidyapur Jora Deul, Banda Deul, Bardhaman ... find all
17 - wikt:initally- Exocet, F1 (video game series), Glenn Saxson, HM Prison Feltham, Kingdom (South Korean TV series) ... find all- Fixed. Doghouse09 (talk) 19:01, 8 September 2020 (UTC)
17 - wikt:arrangerments- A Fool to Care, December (Chris Botti album), Friends Can Be Lovers, Fuzzy Logic (David Benoit album), Just as I Am (Bill Withers album) ... find all- 16 -
wikt:eunited - 2018 CWL Pro League, Arcitys, Call of Duty Championship 2019, Clayster, Fighting game community... find all → marked - 13 -
wikt:belived- Béatrix de Choiseul-Stainville, COVID-19 pandemic in Rajasthan, Govindasvāmi, List of Italian soccer clubs in Victoria, Australia, Manius Aemilius Lepidus (consul 11) ... find all - 11 -
wikt:dbts - Casein kinase 1, Doubletime (gene)... find all → redirect created 11 - wikt:critisized- Ivan Bahrianyi, Linfen, Michiel Hillen van Hoochstraten, Nkeiruka Onyejeocha, Olympiastadion (Munich) ... find all- 11 -
wikt:availabe - ESCP Business School, Flightradar24, Gamma-L-Glutamyl-L-cysteine, Hoiamides, Intel 8089... find all- The only remaining instances have been [sic]ed (by someone, not me). Doghouse09 (talk) 18:52, 8 September 2020 (UTC)
10 - wikt:influental- Alan Flusser, Aleksander Bajt, Arckanum, Christianity in Serbia, Hrvatska prosvjeta ... find all10 - wikt:incuding- Asian hornet, BBC Archives, Erhard Grieder, Joseph Michelli, Lathkill Dale ... find all- Fixed. Doghouse09 (talk) 18:48, 8 September 2020 (UTC)
- 10 - wikt:daees - List of Dai of Dawoodi Bohra, Sulaymani ... find all
10 - wikt:asssistant- Giving Up the Gun, Higher (Treponem Pal album), Holiday (Vampire Weekend song), Horchata (song), Ioan Reinhardt ... find all- 10 -
wikt:abandonned - Aït Yahia, EZ Platform, FUFA Women Super League, Francis Johnson (Brownist), HMS Hannibal (1779)... find all- The only remaining instances have been [sic]ed (by someone, not me). -sche (talk) 00:07, 29 August 2020 (UTC) → 2 more corrected Graeme Bartlett (talk) 03:53, 7 October 2020 (UTC)
- 9 - wikt:kurals - Aram (Kural book), Impact of Tirukkural, Inbam (Kural book), M. Gopala Krishna Iyer, Parimelalhagar ... find all
- 9 - wikt:gunome - Glossary of Japanese swords, Kenzō Kotani, List of National Treasures of Japan (crafts: swords), Muramasa ... find all
9 - wikt:etablished- Climate Change Research Centre, Hvidovre Stadium, Latvian-Estonian Basketball League, Lokalavisen Favrskov, Mauritel ... find all9 - wikt:dicatorship- Claude Lefort, Experiencias '68, Gaston Z. Ortigas, Inang Laya, Joan Oliver i Sallarès ... find all- 8 -
wikt:modeun - Baek Mu-san, Choi Seungho, Ha Jaeyoun, Lee Eung-jun, Lee Gi-seong... find all→Romanised Korean for "all"; now marked with templates. - 8 - wikt:madake - Baren (printing tool), Japanese bamboo weaving, Phyllostachys bambusoides, Shakuhachi, Ōita Prefecture ... find all
- 8 - wikt:limbei - Alexandru Philippide, Ion Creangă, Ion Heliade Rădulescu, Palatschinke, Tache Papahagi ... find all
- 8 -
wikt:laikes - Laiki agora, Marketplace... find all → redirected/marked transliterated plural of laiki - 8 -
wikt:joing - Bitumirim River, Brigate Garibaldi, Disappearance of Najeeb Ahmed, Gerald Le Mesurier, Joan Josep Nuet... find all - 8 - wikt:iqtas - Iltutmish, Jalal-ud-din Khalji, Qutbuddin Mubarak Shah, Razia Sultana ... find all
- 8 -
wikt:ibol - Consortium for the Barcode of Life, DNA barcoding, Daniel H. Janzen, Dunama indereci, Kuba Kingdom... find all → redirected - 8 -
wikt:hectars - Futa Pass Cemetery, Lerik, Azerbaijan, Marie-Thérèse Chappaz, Osterseen, Peat... find all → corrected 8 - wikt:greenary- Al-Namas, East Godavari, Ghatal, Jharkhand, Markayankottai ... find all- Fixed. Doghouse09 (talk) 19:12, 8 September 2020 (UTC)
8 - wikt:goups- 1914–15 FC Basel season, Eldon Public Library, Eucalyptus alba, Eucalyptus diminuta, Eucalyptus goniantha ... find all- 8 - wikt:garhs - Garhwali people, Jangipara (community development block), Lunahar, Panchagarh District, Sangram Shah ... find all
- 8 - wikt:epublishing - Aimée du Buc de Rivéry, Lektz, Mediterranean Marine Science, Murugappa Group, PROSE Awards ... find all
8 - wikt:colloborated- Ayiroor Sadasivan, Bhuvan Gowda, Eugene C. Barker, G. K. Vishnu, H. W. Janson ... find all- 8 -
wikt:calld - Guugu Yimithirr language, High Plains Drifter, Luna Yin, Maerten Boelema de Stomme, Makruk ...find all - 8 - wikt:cakwe - Bubur ayam, Congee, Indonesian cuisine, Youtiao ... find all
- 8 -
wikt:behaviorial - Camera shyness, Full Thrust, Glossary of education terms (S), Günter Schmölders, Limbic system... find all - 8 -
wikt:annouced - Alessandro Matri, Antoine Fuqua, Ioana Stănciulescu, Manzoor Azwira, Rolls-Royce Holdings... find all - 8 -
wikt:anniversay - Al Zawiya University, Bibi Torriani, Butterfingers (Australian band), Darkening, Kadril... find all - 8 - wikt:ampulate - Anopodium ampullaceum, Spidroin ... find all
- 7 -
wikt:maried - Egla Harxhi, Marian Turski, Mary Stuart Smith, Philip Emmanuel, Prince of Piedmont, Ricardo Rivera Schreiber... find all - 7 - wikt:knige - Collection of Poems. 1889–1903, Ellendea Proffer, Jovan Skerlić, List of Prekmurje Slovene literature, Mihály Bakos ... find all
- 7 - wikt:kalaries - Kalari, Kalari Panicker ... find all
- 7 -
wikt:illnes - Han Aiping, Penicillium variabile, Queen of Air and Darkness (Clare novel), Theodore II Laskaris, Valters Frīdenbergs... find all - 7 -
wikt:frmly - 1918 Birthday Honours, 1919 New Year Honours... find all - 7 -
wikt:finnally - Behörighetslagen, Black Death in the Holy Roman Empire, Henriette of France (1727–1752), Louise-Jeanne Tiercelin de La Colleterie, Nüzi canzheng tongmenghui... find all - 7 - wikt:eview - ANU Press, International HL7 Implementations, Java Caps, Transactive energy ... find all
- 7 -
wikt:establised - Killarney, Queensland, National Institute of Informatics, Region of Murcia, Repartition of Ireland, St. Joseph's Convent School, Varanasi ...find all - 7 - wikt:econtact - Antti Sakari Saario, Arne Eigenfeldt, Biosignal, Eldad Tsabary, Gordon Fitzell ... find all
- 7 -
wikt:dissassembled - Breakdance (ride), Copley station, Dot blot, Leikanger Church (Herøy), Metamorpho... find all - 7 -
wikt:currentley - Chidi Omeje, Daniel Braaten, David Meihuizen, Marcus Pedersen, Mohamed Didé Fofana... find all - 7 - wikt:cpmm - 9 track tape, List of International Organization for Standardization standards, 1-4999, List of International Organization for Standardization standards, 5000-7999, List of International Organization for Standardization standards, 8000-8999 ... find all
- 7 -
wikt:constituences - Crookston, Glasgow, East Flanders (Flemish Parliament constituency), Elections in Japan, Flemish Brabant (Flemish Parliament constituency), Freddie Blay... find all 7 - wikt:confimed- COVID-19 pandemic in the Donetsk People's Republic, Dante Mossi, Kőröshegy, List of birds of Olympic National Park, Melodifestivalen 2020 ... find all- Fixed. Doghouse09 (talk) 20:46, 8 September 2020 (UTC)
- 7 -
wikt:coeloids - Belemnotheutis, Evolution of cephalopods, Posidonia Shale... find all → defined Graeme Bartlett (talk) 10:00, 6 October 2020 (UTC) - 7 -
wikt:becme- Habib Rahman (architect), Hemtabad, Kaliaganj (community development block), Lamington, Queensland, Mladen Krstajić ... find all- Fixed. Doghouse09 (talk) 20:46, 8 September 2020 (UTC)
7 - wikt:becaming- Correspondence of Lorraine toponyms in French and German, Destino (magazine), Heinrich Sturm, Iamgold, Leonte Răutu ... find all- 7 -
wikt:autonomnous - Yugoslavia in the Eurovision Song Contest 1961, Yugoslavia in the Eurovision Song Contest 1962, Yugoslavia in the Eurovision Song Contest 1963, Yugoslavia in the Eurovision Song Contest 1964, Yugoslavia in the Eurovision Song Contest 1965... find all 7 - wikt:assisant- Emerich K. Francis, James Patton (American football coach), Luke Erede Ejohwomu, My Sister in Law, Rush Propst ... find all- The only remaining instances have been [sic]ed (by someone, not me). Doghouse09 (talk) 20:46, 8 September 2020 (UTC)
7 - wikt:arround- 2020 Zagreb earthquake, Arnulf, Count of Holland, Chicagoland Speedway, Guleria, Kirrule-type ferry ... find all- Fixed. Doghouse09 (talk) 20:46, 8 September 2020 (UTC)
7 - wikt:againt- 1905 Western University of Pennsylvania football team, Beatrix Ramosaj, Spain national football team results (1920–29), Staples Center, Sting (wrestler) ... find all- Fixed. Doghouse09 (talk) 20:46, 8 September 2020 (UTC)
7 - wikt:adressed- Licypriya Kangujam, Margarita Robles, María-Esther Vidal, Tamon Yamaguchi, Victorian dress reform ... find all7 - wikt:abbrevriated- Daniel Angelici, Melbourne High School Old Boys Association, Mornington Peninsula Nepean Football League, OMAS, Olimpia Basketball Club ... find all6 - wikt:mentionned- Address, Akurgal, Gudea, Il, king of Umma, Leçons de ténèbres ... find all- 6 - wikt:manthras - Kadri Manjunath Temple, Mithrananthapuram Trimurti Temple, Sengalipuram Muthanna, Sree Panayannurkavu Devi Temple, Zoroastrianism ... find all
- 6 - wikt:maatras - Chandas (poetry), Tala (music) ... find all
- 6 - wikt:lyrates - Albert's lyrebird, Lyrebird, Superb lyrebird ... find all
- 6 -
wikt:localites - Bijjur, Kudankulam Nuclear Power Plant, Pir Ghaib Hunting Lodge and Observatory, Sonorella neglecta, Sri Sunama Jakini Matha... find all 6 - wikt:jurisdicition- Baneshwarpur, Dighirpar, Kheadaha, Khodar Bazar, Masat, Diamond Harbour ... find all6 - wikt:interrupped- Fabio Eguelfi, Gabriel Lunetta, Lorenzo Gavioli, Stefano Mazzini ... find all- Fixed Doghouse09 (talk) 20:46, 8 September 2020 (UTC)
6 - wikt:inluding- Brandon Mouton, Charles Osborne (politician), EUTM, Hunter Monroe, Kazan Ansat ... find all6 - wikt:inlcuding- Barry Seal, Henry Jayasena, LSL Property Services, Peninsular Malaysian montane rain forests, Presidential Band of the State Security Service of the Republic of Kazakhstan ... find all- Fixed Doghouse09 (talk) 20:46, 8 September 2020 (UTC)
- 6 -
wikt:inkeeper - Adina Mandlová, John Carson (physician), Ray Middleton (actor), Svatby pana Voka, White Paradise... find all
Likely new English compounds by frequency (a-m)Edit
(Updated from 2020-05-20 dump.) The best list to work on if you want to add variations of known words to Wiktionary, mostly compound words. The algorithm is not perfect, so some of these might be common mistakes that need to corrected. For each run, only words from half of the alphabet are shown, to avoid duplicate work from when new dumps are being processed.
- 92 -
wikt:dancethon - 1996 Chilean telethon, El Gran Show (season 1), El Gran Show (season 10), El Gran Show (season 11), El Gran Show (season 12)... find all → fixed - 89 - wikt:bellcast - Acadia National Park carriage paths, bridges and gatehouses, Administration Building, Missouri State Fruit Experiment Station, Asbury United Methodist Church (Knoxville, Tennessee), Avondale, Parramatta, Benjamin Franklin Prescott House ... find all
- 84 - wikt:buyrate - Backlash (2003), Badd Blood: In Your House, Campbell McLaren, Chuck Liddell vs. Tito Ortiz, Conor McGregor ... find all
- 68 - wikt:ballybetagh - Aghakinnigh, Aghnacally, Borim (Kinawley), Carn, Tullyhunco, Cloghoge ... find all
- 65 - wikt:kafanas - Balkan Cinema building, Belgrade, Belgrade, Belgrade Youth Center, Bora Spužić Kvaka, City Park, Zemun ... find all
- 50 - wikt:cornerboards - Abraham Hall, Babson-Alling House, Bangor Elevator, Benjamin Franklin Prescott House, Benoit Apartments ... find all
- 50 - wikt:amphoes - Ang Thong Province, Ayothaya Floating Market, Bang Sue District, Buriram Province, Chachoengsao Province ... find all
- 48 - wikt:cellspot - Agrochola lota, Apamea anceps, Apamea furva, Apamea oblonga, Apamea ophiogramma ... find all
- 44 - wikt:lagums - BIP Brewery, Building of the Patriarchate, Belgrade, Gardoš, Gardoš Tower, House at 10 Cara Dušana Street ... find all
- 42 - wikt:hammerfists - Agenor Moreira Sampaio, Akiyo Nishiura, Alexander Otsuka, Andrei Arlovski, Antônio Silva (fighter) ... find all
- 41 - wikt:beltcourses - Adams Memorial Building, Albion station (Michigan), Alma Downtown Historic District (Alma, Michigan), Braastad–Gossard Building, Burbach Block ... find all
- 40 - wikt:backrow - 2016 Northern Pride RLFC season, 2018 Northern Pride RLFC season, Alan Tongue, Andrew Gibbs, Ashley Johnson (rugby union) ... find all
- 39 - wikt:flushboarding - Binks Hess House and Barn, Burt Henry Covered Bridge, Call-Bartlett House, Capt. William McGilvery House, Casey House (Mountain Home, Arkansas) ... find all
- 38 - wikt:akarere - Bugesera District, Burera District, Cyarubare District, Districts of Rwanda, Gakenke District ... find all
- 34 - wikt:favehotel - Archipelago International ... find all
- 32 - wikt:digibook - 'N Crugu Bradului, A Bit o' This & That, A Strange Thing to Say, Anoraknophobia, At the Arena ov Aion – Live Apostasy ... find all
- 31 - wikt:katuns - Battle of Kolašin, Bjelasica, Kelmendi (tribe), Komovi, Kriči ... find all
- 31 - wikt:gametype - 3D Tetris, Battlefield 1943, Battlefield 4, Defense of the Ancients, Devastation (video game) ... find all
- 31 - wikt:fanmeeting - After School (group), Apeace, B.O.Y, CLC (group), Cross Gene ... find all
- 31 - wikt:csexp - Canonical S-expressions ... find all
- 31 - wikt:byali - ELEAGUE Major 2017, ESL One Cologne 2016, FACEIT Major: London 2018, MLG Major Championship: Columbus, PGL Major: Kraków 2017 ... find all
- 31 - wikt:beastkin - BNA: Brand New Animal, BlazBlue Alter Memory ... find all
- 29 - wikt:dropsondes - Atmospheric sounding, Dropsonde, Economy of Columbus, Ohio, Eyewall replacement cycle, Global Positioning System ... find all
- 28 - wikt:funfactor - All-Star Baseball '97 featuring Frank Thomas, Battle Arena Toshinden 2, Brain Dead 13, Brandish (video game), Chrono Trigger ... find all
27 - wikt:hoodline- BMW Z1, Chevrolet Corvette (C5), Chevrolet Lumina APV, Chevrolet/GMC B series, Chrysler Concorde ... find all- 27 - wikt:alcippus - Danaus (butterfly), Danaus chrysippus, List of butterflies of Benin, List of butterflies of Burkina Faso, List of butterflies of Cameroon ... find all
- 26 - wikt:jihada - Glossary of Japanese swords, Japanese sword, Japanese sword polishing, List of National Treasures of Japan (crafts: swords) ... find all
- 26 -
wikt:hydropowered - Acoustic tag, Analog computer, Book of Ingenious Devices, Ferrous metallurgy, Grafton, Wisconsin... find all → already defined. - 26 - wikt:diplexed - In-band on-channel, KEAR (AM), KEST, KFWB, KQFN ... find all
- 26 - wikt:danceband - After the Ball (album), Andy Nye, Ansco Bruinier, Bris (disambiguation), Donnez ... find all
- 26 - wikt:bodykits - 2018 WeatherTech SportsCar Championship, Autodelta (UK), Brabus, Bōsōzoku, Citroën Saxo ... find all
- 25 - wikt:mysticker - Blazer Drive, List of Blazer Drive characters ... find all
- 25 - wikt:mprs - Allopregnanolone, Membrane progesterone receptor, Membrane steroid receptor, Pharmacodynamics of progesterone, Progesterone ... find all
- 25 - wikt:maavg - List of Mullard–Philips vacuum tubes ... find all
- 25 - wikt:istudy - Bored of Studies, Cottesmore School, McMaster Integrated Science, System Technology-i Co, Ltd ... find all
- 25 - wikt:bandform - Analogue filter, Composite image filter, Distributed-element filter, Electronic filter topology, Filter (signal processing) ... find all
- 24 - wikt:auroglaucin - Aspergillus aerius, Aspergillus appendiculatus, Aspergillus biplanus, Aspergillus brunneus, Aspergillus caperatus ... find all
- 24 - wikt:aeroengines - Avio, Bristol Filton Airport, CRAIC CR929, Fedden Mission, Gustav Otto ... find all
- 23 - wikt:degredados - 2nd Portuguese India Armada (Cabral, 1500), Barra (neighborhood), Cacheu, Colonial Brazil, Degredado ... find all
- 23 - wikt:counterlungs - Dräger Ray, Gordon Smith (inventor), Halcyon RB80, Human physiology of underwater diving, Lambertsen Amphibious Respiratory Unit ... find all
- 23 - wikt:biradaris - Bhishti, Bisati, Churihar, Dharhi, Doodwala ... find all
- 23 - wikt:afwc - 25th Bangladesh Infantry Regiment, Abdul Waheed Kakar, Army Golf Club, Bangladesh Coast Guard, Bangladesh Institute of Peace Support Operation Training ... find all
- 22 - wikt:mapeak - List of Mullard–Philips vacuum tubes ... find all
- 22 - wikt:godspoken - Gloriously Bright, List of Ender's Game characters, Xenocide ... find all
- 22 - wikt:everset - Kamen Rider Drive, Kamen Rider Drive: Surprise Future, Kamen Rider Fourze, Kamen Rider Gaim, Kamen Rider OOO ... find all
- 22 - wikt:bandname - 59 Times the Pain, Cute (Japanese idol group), Days N' Daze, Dynamite Boy, Ill Niño ... find all
- 21 - wikt:metepimeron - Anax immaculifrons, Copera marginipes, Copera vittata, Elattoneura souteri, Idionyx corona ... find all
- 21 - wikt:keytype - Comparison of programming languages (associative array), Postage stamps and postal history of Malta, Postage stamps and postal history of Nigeria, Revenue stamps of Aden, Revenue stamps of Bermuda ... find all
- 21 - wikt:geoviewer - Alder Brook (West Branch French Creek tributary), Bailey Brook (West Branch French Creek tributary), Baskin Run (South Branch French Creek tributary), Beaver Run (South Branch French Creek tributary), Beaverdam Creek (Crabtree Creek tributary) ... find all
- 21 - wikt:fireplan - 110th Siege Battery, Royal Garrison Artillery, 121st Siege Battery, Royal Garrison Artillery, 171st Siege Battery, Royal Garrison Artillery, 1st Lincolnshire Artillery Volunteers, 1st Midlothian Artillery Volunteers ... find all
- 21 - wikt:cocklestoves - Baia, Ceramic art, Confidencen, Hakkemose Brickworks, Hatu ... find all
- 21 -
wikt:allokotosaurs - Archosauromorpha, Azendohsaurus, Boreopricea, Elessaurus, Epipophyses... find all → defined - 21 -
wikt:alderpersons - Alderman, Amsterdam, Appleton, Wisconsin, De Pere, Wisconsin, DeKalb, Illinois... find all - 21 - wikt:akaval - Ciṟupāṇāṟṟuppaṭai, Five Great Epics, Indian epic poetry, Kalittokai, Kuṟiñcippāṭṭu ... find all
- 20 - wikt:longiconic - Ascoceratidae, Ascocerida, Basslerocerida, Campendoceras, Ellesmeroceratidae ... find all
- 20 - wikt:lockerboxes - Allermöhe station, Bahrenfeld station, Billwerder-Moorfleet station, Diebsteich station, Eidelstedt station ... find all
- 20 - wikt:isoleukotoxin - CYP2C18, CYP2C19, CYP2C8, CYP2C9, CYP2J2 ... find all
- 20 - wikt:insts - Darijan Božič, List of compositions by Constant Lambert, List of compositions by Heinrich Schütz, Peter Dickinson (musician) ... find all
- 20 - wikt:gsang - Benedikt Gletting, Guhyagarbha tantra, Guhyasamāja Tantra, Nyingtig Yabshi, Tam Shek-wing ... find all
- 20 -
wikt:governship - Alauddin Khalji's conquest of Ranthambore, Amarah, An Sishun, Benjamin Butler, Bera, Count of Barcelona... find all → fixed - 20 -
wikt:frontlit- Game Boy Advance, Game Boy Advance SP, Game Boy Advance family, Game Boy Micro, Game Boy family ... find all - 20 - wikt:endwall - Capt. Richard Strong House, Cheng Xu, Edward R. Wilson House, Expansion tube, Hersey-Duncan House ... find all
- 20 - wikt:datsans - Buddhism in Buryatia, Buddhism in Russia, Buryats, Damba Ayusheev, Datsan ... find all
- 20 - wikt:commlock - All That Glisters (Space: 1999), Dragon's Domain, Earthbound (Space: 1999), Guardian of Piri, Seed of Destruction (Space: 1999) ... find all
- 20 - wikt:bushdrive - Toyota Land Cruiser, Toyota Land Cruiser (J40) ... find all
- 20 -
wikt:autobid - 2014 National Invitation Tournament, Canisius Golden Griffins, College Hockey America, LIU Sharks women's ice hockey, Lindenwood Lions... find all→ defined - 20 - wikt:antepreparatory - Anna Maria Taigi, August Czartoryski, Domenico Lentini, Elena Guerra, Franz-Josef Rudigier ... find all
- 19 -
wikt:metalbending- Avatar: The Last Airbender – North and South, Avatar: The Last Airbender – The Promise, Avatar: The Last Airbender – The Rift, Bolin (The Legend of Korra), Korra ... find all- WT doesn't want it, thanks
- 19 - wikt:lipfire - Ethan Allen (armsmaker) ... find all
- 19 - wikt:lifestory - Buddhist texts, Cynesige, Helena Whitbread, Inniskeen, Johan Furåker ... find all
Likely new words by frequency, all languages (a-m)Edit
(updated from 2020-05-20 dump)
Good candidates for words to add to the English Wiktionary (which provides English definitions for words in all languages), as it seems English Wikipedia readers will frequently encounter them. For each run, only words from half of the alphabet are shown, to avoid duplicate work from when new dumps are being processed.
Most of the words are not from English. To get them off this list, you can either add an entry to the English Wiktionary (which provides English definitions for words in all languages) or tag all instances of the word on the English Wikipedia with {{lang}}. Wiktionary does not accept Romanizations for some languages, so those cases must be tagged as {{transl}} or {{lang}}.
- 165 - wikt:alangaram - Abirameswarar temple, Adhirangam Ranganathaswamy temple, Adi Jagannatha Perumal Temple, Adi Kumbeswarar Temple, Kumbakonam, Adikesava Perumal temple, Mylapore ... find all
- 162 - wikt:aradanai - Abirameswarar temple, Adhirangam Ranganathaswamy temple, Adi Jagannatha Perumal Temple, Adi Kumbeswarar Temple, Kumbakonam, Adikesava Perumal temple, Mylapore ... find all
- 140 - wikt:gandharam - Abhogi, Ahiri, Amritavarshini, Anandabhairavi, Andolika ... find all
- 121 - wikt:æftiʀ - Ardre image stones, Aringsås Runestones, Arkils tingstad, Asferg Runestone, Ballstorp Runestone ... find all
- 119 - wikt:farābād - Bagh-e Jafarabad, Jafarabad, Ahar, Jafarabad, Alborz, Jafarabad, Amol, Jafarabad, Andika ... find all
- 115 - wikt:стрелковая - 109th Rifle Division (Soviet Union), 10th Guards Motor Rifle Division, 114th Rifle Division (Soviet Union), 121st Guards Rifle Division, 12th Guards Rifle Division ... find all
- 101 - wikt:jangha - Aisanyesvara Siva Temple, Akhadachandi Temple, Arjunesvara Siva Temple, Astasambhu Siva Temples, Belesvara Siva Temple ... find all
- 92 -
wikt:dancethon - 1996 Chilean telethon, El Gran Show (season 1), El Gran Show (season 10), El Gran Show (season 11), El Gran Show (season 12)... find all - 91 - wikt:dhaivatham - Abheri, Abhogi, Anandabhairavi, Asampurna Melakarta, Bageshri ... find all
- 82 - wikt:λmax - 3-Hydroxyflavone, Amino radical, Aniline (data page), Astronomical spectroscopy, Azo violet ... find all
- 74 - wikt:chathusruthi - Abhogi, Anandabhairavi, Bageshri, Bahudari, Bhairavi (Carnatic) ... find all
- 71 - wikt:kiruthigai - Abirameswarar temple, Adi Kumbeswarar Temple, Kumbakonam, Agastheeswar Temple, Agnipureeswarar Temple, Thirupugalur, Aiyarappar temple ... find all
- 68 -
wikt:ballybetagh - Aghakinnigh, Aghnacally, Borim (Kinawley), Carn, Tullyhunco, Cloghoge... find all - 65 - wikt:kafanas - Balkan Cinema building, Belgrade, Belgrade, Belgrade Youth Center, Bora Spužić Kvaka, City Park, Zemun ... find all
- 60 - wikt:īlābād - Boneh-ye Esmail, Khuzestan, Eshqabad, West Azerbaijan, Esmailabad (28°20′ N 60°27′ E), Gowhar Kuh, Esmailabad (28°37′ N 60°25′ E), Gowhar Kuh, Esmailabad (30°01′ N 52°36′ E), Dorudzan ... find all
- 59 - wikt:lēah - Acklam, Ryedale, Adel, Leeds, Alderley, Gloucestershire, Aldersley, Alwoodley ... find all
- 59 - wikt:bafen - Chinese characters, Prince Cheng of the Second Rank, Prince Chun (醇), Prince Ding, Prince Dun ... find all
- 58 - wikt:dhaivatam - Ahiri, Amritavarshini, Anandabhairavi, Andolika, Asaveri ... find all
- 55 - wikt:književnosti - August Kovačec, Bogdan Popović, Božidar Petranović, Bratoljub Klaić, Croatian Language Corpus ... find all
- 54 - wikt:ispánate - Arnold II Hahót, Atyusz (genus), Atyusz Hahót, Atyusz III Atyusz, Bachaler Olaszkai ... find all
- 51 - wikt:localizated - Achiwib, Adventure, Guyana, Aishalton, Andújar, Anna Regina ... find all
- I cleaned up the instances. I also added a Wiktionary entry for it because it's rather common, but it's what Wiktionarins call wikt:Category:Non-native speakers' English, so any subsequent reoccurences should also be cleaned up, but I don't expect any, because almost all instances were one copy-pasted phrase regarding disputed Guyana/Venezuela-area territory. -sche (talk) 02:02, 4 August 2020 (UTC)
- 50 - wikt:σαυρος - Abelisauridae, Abelisaurus, Abrictosaurus, Abrosaurus, Acteosaurus ... find all
- 50 - wikt:adhyayas - Abel Bergaigne, Adi Parva, Aitareya Brahmana, Anushasana Parva, Ashramavasika Parva ... find all
- 49 - wikt:kagawads - 2013 Philippine local elections, Achila, Bohol, Ang Probinsyano (season 7), Ayala Alabang, Barangay ... find all
- 48 - wikt:гвардейская - 100th Guards Rifle Division, 10th Guards Motor Rifle Division, 10th Guards Uralsko-Lvovskaya Tank Division, 121st Guards Rifle Division, 126th Guards Rifle Division ... find all
- 48 - wikt:molodezhnaja - Foster Daddy, Tora!, Hearts and Flowers for Tora-san, Maid-Droid, Marriage Counselor Tora-san, Stage-Struck Tora-san ... find all
- 47 - wikt:īdābād - Aqeh Kheyl, Gorgabad, Ardabil, Kalateh-ye Seyyed Ali, South Khorasan, Mohammadabad-e Saidabad, Nematabad-e Ghar ... find all
- 47 - wikt:kaisiki - Ahiri, Andolika, Asaveri, Bageshri, Bahudari ... find all
- 46 - wikt:moughataa - Adrar Region, Assaba Region, Brakna Region, Dakhlet Nouadhibou Region, Gorgol Region ... find all
- 46 - wikt:maçkolik - 1963–64 Mersin İdmanyurdu season, 1964–65 Mersin İdmanyurdu season, 1965–66 Mersin İdmanyurdu season, 1966–67 Mersin İdmanyurdu season, 1967–68 Mersin İdmanyurdu season ... find all
- 45 - wikt:προσευχη - Codex Alexandrinus, Codex Augiensis, Codex Claromontanus, Codex Porphyrianus, Minuscule 1739 ... find all
- 42 - wikt:ghilmān - Abu'l-Najm Badr, Ahmad ibn Tulun, Al-Aziz Billah, Al-Mu'tadid, Al-Mu'tasim ... find all
- 42 - wikt:fänikor - 1st Life Grenadier Regiment (Sweden), 2nd Life Grenadier Regiment (Sweden), Dalarna Regiment, Fähnlein, Halland Regiment ... find all
- 41 - wikt:изд - Albena Stambolova, Boris Koyalovich, Church of St Demetrius, Boboshevo, Church of St Elijah, Boboshevo, Daniel Kluger ... find all
- 41 - wikt:βtrcp - Anaphase-promoting complex, BTRC (gene), FBXW11, SCF complex, Vpu protein ... find all
- 41 - wikt:efilmcritic - A Christmas Horror Story, Adrift in Tokyo, All the Boys Love Mandy Lane, Amy (1997 film), Belphegor, Phantom of the Louvre ... find all
- 41 - wikt:cheilos - Acheilognathus, Adenochilus, Ancistrochilus, Anoectochilus, Arthrochilus ... find all
- 40 - wikt:ŭnbyŏng - Goryeo coinage, Korean currency, Korean mun ... find all
- 39 - wikt:haplolepideous - Bruchia (plant), Bruchia elegans, Bruchiaceae, Calymperaceae, Campylopus ... find all
- 39 - wikt:draconium - Dragon Booster, List of dragons in Dragon Booster ... find all
- 39 - wikt:actinosiphonate - Acleistoceratidae, Actinomorpha, Adelphoceras, Augustoceras, Balashovia ... find all
- 38 - wikt:akarere - Bugesera District, Burera District, Cyarubare District, Districts of Rwanda, Gakenke District ... find all
- 37 - wikt:inquerendum - Alvania, Amauropsis, Antalis, Aplysia, Berthella ... find all
- 37 -
wikt:hibbertopterids - Campylocephalus, Eurypterid, Hibbertopteridae, Hibbertopterus, Mycteropoidea... find all - 37 - wikt:adelospondyls - Acherontiscus, Adelospondyli, Lepospondyli ... find all
- 36 - wikt:aswangs - Agimat: Ang Mga Alamat ni Ramon Revilla, Ang Panday (2017 film), Aso ni San Roque, Buntot Pagi, Darna, Kuno? ... find all
- 35 - wikt:hangavulu - Gela language ... find all
- 35 - wikt:audava - Abhogi, Amritavarshini, Bhupalam, Gambhiranata, Hamsadhvani ... find all
- 34 - wikt:θεου - Codex Alexandrinus, Codex Athous Lavrensis, Codex Basilensis A. N. IV. 4, Codex Glazier, Codex Vaticanus 2061 ... find all
- 34 - wikt:imirenge - Bugesera District, Burera District, Busengo, Rwanda, Districts of Rwanda, Gakenke District ... find all
- 34 - wikt:dzongpons - Bumthang Province, Daga Province, Districts of Bhutan, Dzongpen, House of Wangchuck ... find all
- 34 - wikt:benandante - Benandanti, The Night Battles ... find all
- 34 -
wikt:arrodissement - Bouassi, Boukanere, Chein, Danri, Daroukpara... find all - 32 - wikt:midare - Eijirō Tōno, Former Nine Years' War, Glossary of Japanese swords, Izumi Aki, Japanese bamboo weaving ... find all
- 32 - wikt:digibook - 'N Crugu Bradului, A Bit o' This & That, A Strange Thing to Say, Anoraknophobia, At the Arena ov Aion – Live Apostasy ... find all
- 31 - wikt:νηστεια - Codex Alexandrinus, Codex Augiensis, Codex Claromontanus, Codex Porphyrianus, Minuscule 1739 ... find all
- 31 - wikt:katuns - Battle of Kolašin, Bjelasica, Kelmendi (tribe), Komovi, Kriči ... find all
- 31 - wikt:fstnt - TNNI2, TNNT1, TNNT2, TNNT3 ... find all
- 31 - wikt:etatsråd - Anker Heegaard, Bolle Luxdorph, Carl Adolph Castenschiold, Carsten Anker, Christian Frederik Hansen ... find all
- 31 - wikt:dābād - Aliabad-e Jowhari, Asgarabad, Fars, Narmeh, Radabad, Sadabad, Anbarabad ... find all
- 31 - wikt:dcdn - Content delivery network interconnection ... find all
- 31 - wikt:byali - ELEAGUE Major 2017, ESL One Cologne 2016, FACEIT Major: London 2018, MLG Major Championship: Columbus, PGL Major: Kraków 2017 ... find all
- 31 - wikt:apetura - Primera División de Fútbol Profesional 1981, Primera División de Fútbol Profesional 1982, Primera División de Fútbol Profesional 1985, Primera División de Fútbol Profesional 1987–88, Primera División de Fútbol Profesional 1988–89 ... find all
- 31 - wikt:anuratha - Aisanyesvara Siva Temple, Arjunesvara Siva Temple, Astasambhu Siva Temples, Bata Mahadeva, Bhringesvara Siva Temple ... find all
- 30 - wikt:κυριος - Codex Laudianus, Codex Vaticanus 2061, Cotton Genesis, Cyril, Family Kr ... find all
- 30 - wikt:đồngs - Bình Phước Province, Bình Định Province, Bắc Giang Province, Bắc Kạn Province, Cao Bằng Province ... find all
- 30 - wikt:maechis - Ayya (Pali word), Maechi, Siladhara Order ... find all
- 30 - wikt:khoshuu - Aimag, Bayandelger, Töv, Chingünjav, Dashdorjiin Natsagdorj, Dulduityn Danzanravjaa ... find all
- 30 - wikt:cajunb - DreamHack Winter 2014, ELEAGUE Major 2017, ESL One Cologne 2016, MLG Major Championship: Columbus, PGL Major: Kraków 2017 ... find all
- 29 - wikt:δij - Archimedes' principle, Borel–de Siebenthal theory, Brownian motion, Buoyancy, Cartesian tensor ... find all
- 29 - wikt:ādatābād - Dashtabad, Narmashir, Saadatabad, Abadeh, Saadatabad, Arsanjan, Saadatabad, Bardsir, Saadatabad, Darab ... find all
- 29 - wikt:mihnah - Ahmad ibn Abi Du'ad, Harthamah ibn al-Nadr al-Jabali, Ishaq ibn Ibrahim al-Mus'abi, Ishaq ibn Yahya ibn Mu'adh, Kaydar Nasr ibn Abdallah ... find all
- 29 - wikt:dropsondes - Atmospheric sounding, Dropsonde, Economy of Columbus, Ohio, Eyewall replacement cycle, Global Positioning System ... find all
- 29 - wikt:bhikkhunīs - Buddhist Cultural Centre, Buddhist flag, Dhammadharini Vihara, Pāṭimokkha, Ānanda ... find all
- 29 -
wikt:alloted - 103d Attack Squadron, 1896 Western University of Pennsylvania football team, 319th Operations Group, Anne Evans (arts patron), Arms (video game)... find all → fixed - 28 - wikt:field - 2001 New Year Honours, 2007 New Year Honours, Battle of Halen, Bioelectromagnetic medicine, Bipolar magnetic semiconductor ... find all
- 28 - wikt:αυτω - Codex Athous Lavrensis, Codex Boernerianus, Codex Ephesinus, Lectionary 239, Matthew 1:24 ... find all
- 28 - wikt:musumeyaku - Asuka Tono, Ayane Sakurano, Mari Hanafusa, Natsuki Mizu, Risa Junna ... find all
- 28 - wikt:lăutărească - Ciocârlia (Romanian folk tune), Costi Ioniță, Damian Drăghici, George Nicolescu, Lăutari ... find all
- 28 - wikt:derebeys - Charter of Alliance, Derebey, Greeks in Georgia, History of the Laz people, Khimshiashvili ... find all
- 28 - wikt:chevaulegers - 1st Polish Light Cavalry Regiment of the Imperial Guard, Hanau order of battle, Étienne Marie Antoine Champion de Nansouty ... find all
- 27 - wikt:župans - Albanian nobility, Ban (title), Bulgarian–Serbian wars of 917–924, Byzantine–Bulgarian war of 913–927, Constantine Bodin ... find all
- 27 - wikt:mukhamantapa - Bankapura, Bhimeshvara Temple, Nilagunda, Chennakeshava Temple, Hullekere, Chennakeshava Temple, Turuvekere, Dah Parvatiya ... find all
- 27 - wikt:liveshows - Andrea Renzullo, Cẩm Ly, Deutschland sucht den Superstar, Deutschland sucht den Superstar (season 10), Deutschland sucht den Superstar (season 12) ... find all
- 27 - wikt:komēs - Anna (wife of Artabasdos), Artabasdos, Aëtius of Amida, Byzantine army, Chartoularios ... find all
- 27 - wikt:faʻafafine - Fa'afafine ... find all
- 27 - wikt:dacoz - Epimeria ... find all
- 27 - wikt:būta - Bunt (community), Buta Kola ... find all
- 27 - wikt:alcippus - Danaus (butterfly), Danaus chrysippus, List of butterflies of Benin, List of butterflies of Burkina Faso, List of butterflies of Cameroon ... find all
- 27 - wikt:achilid - Abas unipunctata, Achilidae, Achilus (planthopper), Acus (planthopper), Bunduica (planthopper) ... find all
- 26 - wikt:дивизија - 11th Air Defense Division, 13th Air Defense Division, 15th Air Defense Division, 21st Aviation Division, 29th Aviation Division (Socialist Yugoslavia) ... find all
- 26 - wikt:śikhā - Chudakarana, Nambudiri, Sikha, Sthanika Brahmins ... find all
- 26 -
wikt:muwaqqits - Muwaqqit... find all → defined - 26 - wikt:monocable - 1990 Tbilisi aerial tramway accident, 3S Cable Car, Awana Skyway, Denniston, New Zealand, Emirates Air Line (cable car) ... find all
- 26 - wikt:lyrium - Characters of Dragon Age: Inquisition, Dragon Age II, Dragon Age: Knight Errant, List of Dragon Age characters, Varric Tethras ... find all
- 26 - wikt:kurakas - Cacique, Calchaquí, Diaguita, Efraín Trelles, Indian reductions in the Andes ... find all
- 26 - wikt:fylkesordfører - Administrative divisions of Norway, Arnfinn Nergård, Audun Tron, County council (Norway), County municipality (Norway) ... find all
- 26 - wikt:diplexed - In-band on-channel, KEAR (AM), KEST, KFWB, KQFN ... find all
- 26 - wikt:cypsellae - Felicia aethiopica, Felicia amelloides, Felicia amoena, Felicia annectens, Felicia bellidioides ... find all
- 26 - wikt:chatushruti - Amritavarshini, Andolika, Atana, Devagandhari, Dheerashankarabharanam ... find all
- 25 - wikt:ваздухопловна - 1st Air Command, 21st Aviation Division, 29th Aviation Division (Socialist Yugoslavia), 32nd Aviation Division, 37th Aviation Division (Socialist Yugoslavia) ... find all
- 25 - wikt:σαρκα - Codex Alexandrinus, Codex Athous Lavrensis, Codex Boernerianus, Codex Claromontanus, Codex Glazier ... find all
m* 25 - wikt:mukhamandapa - Architecture of Karnataka, Arjuna Ratha, Baroli Temples, Chavundaraya Basadi, Group of Monuments at Mahabalipuram ... find all
- 25 - wikt:külliyye - Külliye ... find all
- 25 - wikt:kubbs - Kubb, The Amazing Race 6 ... find all
- 25 - wikt:kombonis - Komboni ... find all
- 25 - wikt:khaet - Administrative divisions of Cambodia, Banteay Meanchey Province, Battambang Province, Cambodia, Geography of Cambodia ... find all
- 25 - wikt:jatras - Banawadi, Barowari, Culture of West Bengal, Jatra (theatre), Kathmandu District ... find all
- 25 - wikt:hōlua - HaMerotz LaMillion 6, Hawaiian lava sledding, Holualoa Bay, Honokōhau Settlement and Kaloko-Honokōhau National Historical Park, Keauhou Holua Slide ... find all
- 25 - wikt:hrvatskoga - Croatian Vukovians, Croatian language, Eduard Hercigonja, Etymological dictionary, Franjo Marković ... find all
- 25 - wikt:haltijas - Finnish paganism, Haltija, Haltya ... find all
Cases with notes from older dumps:
- 45 - wikt:groundcolour - Anaxyrina cyanopa, Asura euprepioides, Carposina maritima, Choreutis porphyratma, Coptotelia margaritacea ... find all
- 25 (down from 53) - wikt:οτι - Lectionary 12, Lectionary 239, Lectionary 240, Matthew 28:5–6, Minuscule 2427 ... find all
- These all appear to be the Greek word 'οτι', which does not appear in wikt without breath marks. That is, see wikt:ότι, which then mentions forms wikt:τι, wikt:ὅτι, wikt:ό,τι.
- It would appear then that the proper action is to mark all these quoted Greek texts with {{lang}}? ::Also, I think I'll ask over at wikt if it would be reasonable for them to have an entry for wikt:οτι. They do have an entry for wikt:oti, which mentions at least wikt:ότι and wikt:ό,τι, but not wikt:ὅτι. (sigh) Oh what a tangled web we wind, when first we endeavor these defined. Shenme (talk) 04:30, 13 October 2019 (UTC)
- Additionally, many (all?) of these appear to be 'biblical' == classical == ancient Greek, which has ISO 639-2 code 'grc'. Modern Greek is ISO 639-1 code 'el', ISO 639-2 code 'gre'. Shenme (talk) 04:49, 18 October 2019 (UTC)
- Ah, but not all. Some found with search are modern Greek, so lang|el, and some 'oti' found having breath marks. Currently searching using "οτι" -insource:"lang|grc" -insource:"lang|el" -insource:"lang|gre" and working on labelling any form of Greek. Shenme (talk) 02:27, 20 October 2019 (UTC)
- 32 (down from 39) - wikt:āgamas - Anekantavada, Antakrddaasah, Anuttaraupapātikadaśāh, Aupapatika, Bhairava ... find all - see agama. It is normal to add s to pluralize many Sanscrit words. Johnbod (talk) 12:22, 29 January 2020 (UTC)
- 121 (down from 230) - wikt:æftiʀ - Ardre image stones, Aringsås Runestones, Arkils tingstad, Asferg Runestone, Ballstorp Runestone ... find all
- I don't think Old Norse entries with ʀ are allowed (they are either presented in Runic or normalized to r) on Wiktionary; the solution is to language-tag instances on here (generally as Old Norse although glancing at a few, it seems the articles/infoboxes helpfully specify which language it is in each case). -sche (talk) 20:13, 18 November 2018 (UTC)
- 133 - wikt:tetartos - Archon (Gnosticism), Byzantine music, Echos, Hagiopolitan Octoechos, Nana (echos) ... find all
- 26 - wikt:zeitlose - Count of St. Germain, Martin Werhand, Martin Werhand Verlag, St. Germain (Theosophy) ... find all
Likely new compounds by frequency (n-z)Edit
(Waiting for next dump; only words with manual notes are shown below.)
- 35 - wikt:woges - Blond Ambition (Grimm), Chupacabra (Grimm), Clear and Wesen Danger, Cold Blooded (Grimm), Death Do Us Part (Grimm) ... find all
Most common words with slashesEdit
This is a special manual report from the 2019-08-20 dump. -- Beland (talk) 00:37, 31 August 2019 (UTC)
Compound units of measure, probably eligible for Wiktionary:
- 59 - wikt:µmol/l - Aflatoxin B1, Apalutamide, Argininosuccinic aciduria, Bilirubin, Biogenic silica ... find all → I made a redirect for this on 9 April 2019, so why is this shoing up here? Wiktionary does not appear to welcome this kind of symbol with "/". Graeme Bartlett (talk) 07:51, 9 September 2019 (UTC)
- 27 - wikt:kwh/m² - Andasol Solar Power Station, Cost of electricity by source, Gavdos, Heinrich Böll Foundation, IEA Solar Heating and Cooling Programme ... find all
- 12 - wikt:kw/m² - Photovoltaic system, Solar constant, Solar thermal energy, Solar updraft tower, Sunlight ... find all
Probably need correcting or tagging in articles, per MOS:SLASH:
- 13 - wikt:ръ/рь - Belogradchik dialect, Botevgrad dialect, Breznik dialect, Central Balkan dialect, Dupnitsa dialect ... find all
For WiktionaryEdit
This is a special section; putting a Wiktionary link here will cause a word to be ignored by the spell checker everywhere it appears (on the assumption it will soon be added to Wiktionary.)
Vocab pagesEdit
- 61 - Boontling - wikt:bahlness, wikt:beelch, wikt:beemsch, wikt:beeljeck, wikt:belhoon, wikt:blooch, wikt:bloocher, wikt:breggo, wikt:borp, wikt:bowgley, wikt:burlapping, wikt:chigrel, wikt:cloddies, wikt:comoshe, wikt:condeal, wikt:crazeek, wikt:deeger, wikt:deejy, wikt:dehigged,wikt:dissies, wikt:donicker, wikt:donagher, wikt:dreek, wikt:dreeked, wikt:dreeking, wikt:dulcey, wikt:eeld, wikt:eesole, wikt:haireem, wikt:heelch, wikt:pockety, wikt:higged, wikt:higgied, wikt:hobneelch, wikt:keishbook, wikt:kimoshe, wikt:kingster, wikt:madging, wikt:modocker, wikt:moldune, wikt:moldunes, wikt:nettied, wikt:nonch, wikt:oshtook, wikt:peeril, wikt:pusseek, wikt:rawncher, wikt:seertail, wikt:sirtle, wikt:sharkin, wikt:shoveltooth, wikt:somersetting, wikt:steedos, wikt:teebow, wikt:tuddies, wikt:tuddish
- 43 - English words first attested in Chaucer - wikt:attourne, wikt:feminie, wikt:gigge, wikt:louke, wikt:emprent, wikt:enbaissing, wikt:ensampler, wikt:entach, wikt:entech, wikt:entalent, wikt:eschaufe, wikt:festivally, wikt:foleye, wikt:forline, wikt:formly, wikt:fortunel, wikt:fortunous, wikt:habitacule, wikt:hustlement, wikt:necess, wikt:overwhelve, wikt:plungy, wikt:portionable, wikt:presentary, wikt:previdence, wikt:purveyable, wikt:rhetorian, wikt:slead, wikt:troublabla, wikt:unbetide, wikt:undoubtous, wikt:unleeful, wikt:unmovablety, wikt:unparegal, wikt:unplite, wikt:unweened, wikt:vengeress, wikt:weeply, wikt:witnessfully
- 9 - Longest word in English - wikt:broughammed, wikt:subdermatoglyphic, wikt:gravedinously, wikt:shakalshas, wikt:galahads, wikt:leucocytozoans, wikt:quiaquia
- wikt:tragediously: I moved this one out of the list above because it seems to have only ever been used once, by Aston Cockayne, whereas Wiktionary only includes English words that have been used by three different people. -sche (talk) 21:00, 13 August 2020 (UTC)
- moved out from the list for English words first attested in Chaucer, this is apparently a misspelling of prentishood as Chaucer spelled it (wikt:prenticehood) --Xurizuri (talk) 12:55, 7 January 2021 (UTC)
- wikt:scorkle (from English words first attested in Chaucer list) apparently used to be on wikt then got RfD'd
0-9Edit
- 1 - 2008 Dublin Senior Football Championship - wikt:sline: Gaelic Football notation ('sideline') \\ this is on wikt but not for this meaning --Xurizuri (talk) 13:06, 7 January 2021 (UTC)
- 1 - 1830–1831 papal conclave - wikt:unvetoed - not a typo
- 3 - 1842 Wallachian princely election - wikt:sortitioned - past tense verb form of sortition
- 1 - 2000 and Whatever - wikt:auspOp: the name of a web site. Ira Leviton (talk) 16:21, 24 September 2019 (UTC)
- 1 - 2018 in Germany - wikt:indiologist - seems to be a real word
- 1 - 42 (dominoes) - wikt:renegger: a term used in the game and defined in the article. Ira Leviton (talk) 20:59, 26 September 2019 (UTC)
- I think this belongs in the dictionary, as a derivation of wikt:reneg -- Beland (talk) 01:48, 24 March 2020 (UTC)
- 1 - 2015 African Youth Athletics Championships - wikt:octathlete - competitor in an octathlon*
- 1 - 1607 - wikt:pallisadoed - a real word
- 1 - 17th Armored Engineer Battalion - wikt:chespaling- "chespaling mat" is a real term for a type of field matting
- 2 - 1980 Quebec referendum - (probably OK: wikt:regroupments) - conscious adaptation of a French word specifically in the context of the politics of Quebec
- 3 - 1854 Broad Street cholera outbreak - wikt:vibriones, wikt:vibriones, wikt:vibriones - a real word
- 1 - 1894 United States House of Representatives elections - wikt:silverist - if this is a real word, it means an American political faction
- 1 - 1938 NSWRFL season - wikt:trygetters - conceivably a real word (Australian)
- 2 - 1957 in jazz - wikt:sazabo - a Turkish musical instrument
- 2 - 1968–69 Mersin İdmanyurdu season - (probably OK: wikt:maçkolik) = maçkolik.com (Turkish sports website)
- 2 - 1st Aeromedical Evacuation Squadron - wikt:aeromedically, wikt:aeromedically - if "aeromedical" is an adjective, no reason why "aeromedically" can't be an adverb
- 1 - 2001 Taiwan legislative election - wikt:reunificationist - must surely be a real word = "supporter of reunification"
- 1 - 2003 Somaliland presidential election - wikt:mistabulation - a real word
- 1 - 2NU - wikt:synclaver - this is a common spelling, so I've left it, but it may nevertheless be a typo for "synclavier"
- 1 - 3x3 - wikt:unusabilities) - "unusability" is already in Wikt as an uncountable, but this may be some special IT use, so left as is for now
- 1 - 2014 in Costa Rica - wikt:unjournalistic: this word seems OK. Ira Leviton (talk) 02:05, 30 September 2019 (UTC)
- 2 - 2016 PSOE crisis - wikt:officialists, wikt:officialists: a name given to a faction in this crisis (opposed to the "critics". Ira Leviton (talk) 21:17, 2 October 2019 (UTC)
- 6 - 2008 Murshidabad beheading - wikt:shalishi, wikt:shalishi, wikt:shalishi, wikt:shalishi, wikt:shalishi, wikt:shalishi = "shalishi court", which is a kangaroo court in India
- 1 - 2008 TC3 - wikt:polymict - a real word
- 2 - 2010–11 Reading F.C. season - wikt:backheeler - never seen it as a noun but no reason why not
- 1 - 2012 Ingleside, San Francisco homicide - wikt:undeportable - a real word
- 1 - 2006 Iranian Assembly of Experts election - wikt:provisionist: seems to be a term in Iranian politics. It comes up on Internet searches, but the citation in the article is in Persian. Ira Leviton (talk) 23:53, 29 September 2019 (UTC)
- 1 - 2006 Oregon Ballot Measures 46 and 47 - wikt:unobligated: OK, in an arcane financial way. Ira Leviton (talk) 23:53, 29 September 2019 (UTC)
- 1 - 20th Lancers (British Indian Army) - wikt:risallahs - word for an Indian cavalry unit - please add to Wikt
- 1 - 24/7 service - wikt:rehumanisation - apparently a word in the service sector
- 1 - 251st Cyberspace Engineering Installation Group - wikt:remissioning - cyberspace jargon
- 1 - 2017–18 Taça da Liga - wikt:repechaged - "repechage" is fine as a noun; "repechaged" is occasionally used, but is not necessarily correct, so leaving this here for a second opinion
- 4 - 3D cell culturing by magnetic levitation - wikt:adipospheres - real scientific term
- 1 - 1973 Soviet economic reform - wikt:derationalisation: seems OK in context and with British spelling. Ira Leviton (talk) 15:17, 29 September 2019 (UTC)
- 4 - 009-1 - wikt:cybernetized - intended for "adapted into a cybernetic form" but probably not a real word
AEdit
- 1 - A. carbonaria - wikt:varay: seems like a real word.
- This is part of the common name for the species, so I think that belongs in Wiktionary? If not, we would normally create a redirect from cotton varay to Albizia carbonaria and that would take care of it, but the latter hasn't been created yet. -- Beland (talk) 17:27, 13 October 2018 (UTC)
- 1 - Adana Center for Arts and Culture - wikt:ampire: possible Ottoman architectural style?
- 1 - Adeline's Dream - wikt:soddle - Possibly means 'soddy' a nickname for houses made of sod (earth and grass), potentially also Germanic slang for the same.
- 1 - African-American Vernacular English - wikt:fixina: dialect-specific
- Yes, but possibly too rare to meet Wiktionary Criteria For Inclusion in this specific spelling; we do have wikt:fixing to, wikt:finna and others. -sche (talk) 03:06, 28 November 2018 (UTC)
- 1 - Aina Wifalk - wikt:manuped: name for an invention.
- 1 - Alash Ensemble - wikt:limpi: old musical instrument?
- 1 - Allelopathy - wikt:allelo-: used as a prefix to explain a word root. (Wiktionary does have prefixes and suffixes.)
- 1 - Alice Holt Forest - wikt:hangra: an Old English word.
- 1 - Antas de Ulla - wikt:liscos: a Spanish word, possibly regional, for a type of dish of bacon bits.
- 2 - Amsterdamseweg - wikt:banpole, wikt:banpole: a type of monument or marker to indicate how far criminals were allowed to approach city. Western Europe, Netherlands. Not sure if it should be one or two words.
- 1 - Andrew Glover (composer) - wikt:aleotory: evidently a real word, although I can't define it. See https://aleacounterpoint.wordpress.com/2010/06/08/orpheus/
- 1 - ArmSCII - wikt:yiwn - normal Armenian ech (yech) and yiwn (vyun) small letters pair
- 1 - Arto Tunçboyacıyan - wikt:blul: an Armenian musical instrument, described as the same as or similar to a sring
- 1 - Arts and Science Center for Southeast Arkansas - wikt:seriographs: seems like a legitimate word. -> There's a Wikipedia article; Wiktionary needs the word and its plural.
These were checked for misspellings and determined to be OK. They need to be added to Wiktionary or an exclusion list:
- Apo (drink) - wikt:wiyu - checked, OK
- Apocalypse (Star Wars novel) - wikt:drochs - checked, OK
- Apodemia mormo langei - wikt:psychicola - checked, OK
- Apolinar's wren - wikt:twii (probably OK: wikt:tchorr) - checked, OK \\ w/o having checked, I'll bet all my savings that these are bird sounds --Xurizuri (talk) 13:24, 7 January 2021 (UTC)
- Aporia hippia - wikt:taupingi - checked, OK
- Aposthia - wikt:aposthic, wikt:aposthic, wikt:aposthic, wikt:aposthic - checked, OK
- Apotomops rhampha - wikt:rhamphos - checked, OK
- Apotropaic mark - wikt:trepein (probably OK: wikt:apotrepein) - checked, OK
- Appendix Probi - wikt:denasalised, wikt:numqua - checked, OK
- Appendix Vergiliana - wikt:keirein - checked, OK
- Appias ada - wikt:thasia (probably OK: wikt:tindalti) - checked, OK
- Apple Blossom Handicap - wikt:distaffers - checked, OK
- Apple of my eye - wikt:iyshown, wikt:iyshown, wikt:iyshown - checked, OK
- Application Enhancer - wikt:haxies, wikt:haxies, wikt:haxies - checked, OK
- April 2009 Moldovan parliamentary election protests - wikt:episodul - checked, OK
- April Daniels - wikt:andrn - checked, OK
- Aprosphylosoma - wikt:julidan - checked, OK
- Aptamer - wikt:aptabodies (probably OK: wikt:postranslational, wikt:trxA) - checked, OK
- Aptenia - wikt:ptenos - checked, OK
- Apulet - wikt:apulettes - checked, OK
- Aqraba, Nablus - wikt:khirbets - checked, OK
- Aqua Virgo - wikt:vinustas - checked, OK
- Aquatic garter snake - wikt:zaxanthus - checked, OK
- Aquilarhinus - wikt:palimentus - checked, OK
- Aquilino Ribeiro - wikt:encoiradas - checked, OK
- Arab Street - wikt:pukadai, wikt:sadkku - checked, OK
- Arabana people - wikt:wadlu, wikt:wagka (probably OK: wikt:woqka) - checked, OK
- Araeosoma - wikt:dactylous (probably OK: wikt:brunnichi) - checked, OK
- Aralez (mythology) - wikt:aralezes, wikt:aralezes, wikt:aralezes, wikt:aralezes - checked, OK
- Arancini - wikt:bburru - checked, OK
- Arapian - wikt:kavourma, wikt:loutza, wikt:caseri, wikt:chasapaki - checked, OK
- Araripedactylus - wikt:daktylos - checked, OK
- Araucaria biramulata - wikt:biramule - checked, OK
- Arbore people - wikt:kyrnat, wikt:qawots (probably OK: wikt:chirnan, wikt:morqo, wikt:qawot) - checked, OK
- Arboretum La Alfaguara - wikt:manleb (probably OK: wikt:euromericana) - checked, OK
- Arbostola - wikt:heuritica - checked, OK
- Arbutus unedo - wikt:kocimare (probably OK: wikt:komaròs) - checked, OK
- Arbuzov - wikt:arbooz - checked, OK
- Arca (bivalve) - wikt:kauaia (probably OK: wikt:koumaci) - checked, OK
- Arcadian League - wikt:myrioi - checked, OK
- Archaefructus - wikt:eoflora - checked, OK
- Archaeocyon - wikt:leptodus (probably OK: wikt:falkenbachi)- checked, OK
- Archaeocyte - (probably OK: wikt:collencytes) - checked, OK
- Archaeognatha - (probably OK: wikt:koryphē) - checked, OK
- Archaeoindris - (probably OK: wikt:collodiaphyseal) - checked, OK
- Archaeology of Qatar - wikt:rawdas - checked, OK
- wikt:archaios (from Archaeornithomimus, Archaeornithoides, Archaeoistiodactylus, Archaeoindris, Archaeognatha, Archaeocyte) used to be in wikt but was deleted for being a frivolous entry.
BEdit
- 1 - Balloon light - wikt:tuboid: legitimate word, meaning resembling or like a tube.
- 1 - Batog - wikt:batogs: legitimate word.
- 1 - Battered (band) - wikt:burkies: probable legitimate slang use of a word.
- 1 - Bathford - wikt:drayning: old English spelling of draining.
- 1 - Baths of Agrippa - wikt:quadran: a Roman bronze coin worth one quarter of an as.
- 1 - Bathtub boat - wikt:tubbers: somebody who races in bathtubs; a bathtub racer.
- 2 - Baju Kurung - wikt:sampin, wikt:sampin: probably a Malay word.
- 1 - Bruce Kiskaddon - wikt:creakin - many in' word endings: can they be special-cased?
- 1 - Buhler Group - wikt:gristing - needs to be in wikt (it's what grist mills do)
- 1 - Butts Up - wikt:savies - defined in artile. Slang/sports term. Elfabet (talk)
- 1 - Béton brut - wikt:shetting - defined in article with source, should be added, probably. Elfabet (talk)
- 1 - Brisket - wikt:brusket: Middle English.
- 1 - Bourgueticrinida - wikt:cirrals: part of a sea lily (plural).
- 1 - Bible and Orient Museum - wikt:ethnologica: old-fashioned, but OK.
- 2 - Big-bang firing order - wikt:twingles, wikt:twingles: plural for twingle, a type of engine re-engineered to have cylinders fire simultaneously instead of alternately.
- 1 - Birstall, West Yorkshire - wikt:byrh: Old English.
- 1 - Black Shuck - wikt:skuh: Old English.
- 1 - Blagdon - wikt:bloec: Old English, meaning 'black' or 'bleak'.
- 1 - Blasius Merrem - wikt:carinates: an term for flying birds, with a keeled sternum.
- 1 - Bellwether - wikt:bellewether: Middle English spelling, used as an example in the article.
- 1 - Berberis canadensis - wikt:glaucose: used correctly in the article - it's a word.
- 1 - Bergstedt - wikt:stedt: used as a suffix, as in -stedt.
- 1 - Bertram de Criol - wikt:constabularie: Old English spelling.
- 2 - Band government - wikt:treatied, wikt:untreatied: I'm not sure if this is proper use of the word treaty.
- 1 - Barbiturate - wikt:tooties: slang for barbiturate, as mentioned in the article.
- 1 - Bahaba - wikt:chaptis: a common name taken from a species name.
- 1 - Businessman (film) - wikt:flexies - unsure
- 1 - Breakscore - wikt:zouching: zouch is a term for a scoreless roll in the game, maybe in other games too.
- 3 - Book signing - wikt:ereading, wikt:ereading, wikt:ereading: short for "electronic reading".
- 1 - Barney Berlinger - wikt:septathlon: used in the newspaper article used as a reference.
- 2 - Bathtub racing - wikt:tubbers, wikt:tubbers: a bathtub racer.
- 1 - Battle of Byczyna - wikt:elears: a pretty obscure word. A type of cavalry fighter (and plural).
- 2 - Bay of Sielmönken - wikt:warfts, wikt:warfts: a type of Northern European artificial dwelling mound - see Terp
- 2 - Blood and Thunder (comics) - wikt:squigs, wikt:squigs: a type of character in a game or comic. Short for "squiggly beast".
- 1 - Belmond Las Casitas - wikt:colcas: Spanish or a local Indian word for the mud and stone granaries built into the cliffs or caves, and for which the Colca Valley is named. Plural.
- 1 - Battle-Pieces and Aspects of the War - wikt:outly: this appears to be a typo in a source copied online. I can't locate the original source, nor figure out what the word should be. I don't want to mark it with [sic] because I don't know if there's an error in the original source.
- 27 - wikt:adobong - Cabalen, Ipomoea aquatica, Kapamilya, Deal or No Deal, Philippine adobo, Squid as food ... find all
- A Filipino word derived from adobo meaning cooked in a marinade. I redirected, but if someone knows how to add these to Wiktionary, pleaes do it.
CEdit
- 2 - Capriccio (art) - wikt:quadratture: may be mispelled (one 't'). The same word applied to math and other fields has one 't', but I'm not sure about this. I'll leave it to the art experts.
- 1 - Catherine de' Medici's building projects - wikt:priants: correct, people kneeling in prayer.
- 1 - Chattanooga Choo Choo (film) - wikt:chuggin: in a movie tagline, as "chuggin' "
- 1 - Cheese on toast - wikt:choast: slang for cheese and toast, explained in the article.
- 1 - Cheltenham - wikt:cilta: word root explaining the etymology of the town name.
- 1 - Cardinal Vicar - wikt:vicegerens: I'm not sure if this is legit, or it should be vicegerent.
- 1 - Catopta saldaitisi - wikt:stroky: quoted correctly from a citation.
- 1 - Cusec - wikt:cufm - unit of flow rate. stands for 'cubit feet per minute'. Includable, though uncommon.
- 1 - Cyclist fatality rate in U.S. by year - wikt:trikkes - A company's name for their 3-wheeled, body-powered transport device that is not copyright, so isn't a proper noun, but almost should be.
- 1 - Cremlingen - wikt:deestablishment - "Until its deestablishment in 1974"
- 1 - Cybermind - wikt:asence - unsure. A made-up word by one author about the presence or lack there-of of people online. Not in common usage. Probably just {notatypo} it and call it day? Elfabet (talk)
- 1 - Current (mathematics) - wikt:comass - unsure. Mathematical term? Defined in article? -- seems to be 'co-mass' Jkgree (talk) 15:46, 20 February 2019 (UTC)
- 1 - Crime in Iran - wikt:toumans - "amounts to 10 trillion toumans a year (1 touman equals 10 rials)"
- 1 - Crinan Canal - wikt:foamin - "Them big foamin' breakers"
- 1 - Cricothyrotomy - wikt:crike - "performs her first “emergency crike” on Camille"
- 1 - Cernach mac Fergusa - wikt:subsept: a subdivision of a tribe. Legitimate use.
- 1 - Chaos (genus) - wikt:uroid: a subcellular portion of an amoeba.
- 1 - Conulariida - wikt:conulate: seems like a legitimate word, although restricted to science.
- 1 - Coregonus maraena - wikt:whitfishappears to be spelled correctlybobdog54 (talk) 19:44, 14 December 2018 (UTC)
- 4 - Coat of arms of Barcelona - wikt:paletts, wikt:paletts, wikt:fomer, wikt:paletts: diminutive of Pale (heraldry)?
- 1 - Coat of arms of the London Borough of Hammersmith and Fulham - wikt:pomels: a heraldic term.
- 1 - Coat of arms of the Prince of Asturias - wikt:bordured: a heraldic term - bordure
- 2 - Coat of arms of the Valencian Community - wikt:paletts, wikt:paletts: diminutive of Pale (heraldry)? - i.e., "pallets" (usual spelling)
- 1 - Ciconiae Nixae - wikt:regionaries: seems legit (plural).
- 1 - Cox Green, Tyne and Wear - wikt:coccs: Old English cocc, crest of a hill.
- 1 - Coagulin - wikt:proxins: a type of protein.
- 1 - Craver Farmstead - wikt:skipples - "In the 1790 lease The Millers agreed to a yearly rent of 24 1/2 skipples of winter wheat"
- Merriam-Webster says this an alternate spelling of wikt:schepel, a Dutch unit -- Beland (talk) 20:56, 14 September 2019 (UTC)
- 1 - Champion (apple) - wikt:shampion: alternative spelling of the Champion type of apple.
- 1 - Chest (furniture) - wikt:wakis short for "wagon-kist".
- 2 - Chester Zoo - wikt:mantellas, wikt:mantellas: plural of mantella, a type of frog.
- 2 - Cheuksin - wikt:jesas, wikt:jesas: a type of Korean ritual.
- 1 - Chhurpi - wikt:durkha: a Nepali type of cheese.
- 1 - Chimantaea - wikt:paramoid: correctly used, according to the page, meaning "like paramo."
- 1 - Chorus line - wikt:twirlies: a term used for girls in a chorus line.
- 2 - Christopher O'Hare - wikt:cremain: short for cremated remains.
- 1 - Chub (disambiguation) - wikt:chubbing: a legislative discussion among several members to waste time and/or block action. Similar to filibustering.
- 2 - Chung Do Kwan - wikt:guep: a level in Tang Soo Do, a Korean martial art.
- 1 - Church of St. James (Brno) - wikt:flanning: architectural term meaning "the internal splay or bevel of a window-jamb."
- 2 - Coat of arms of Nuuk - wikt:siminar, wikt:siminar: the name of a type of building in Nuuk.
- 3 - Cobbler (food) - wikt:cobeler, wikt:sonker, wikt:sonker. Cobeler explains the etymology of cobbler. Sonker is a local North Carolina variation, a cross between a cobbler and a pie.
- 1 - Cipriani Potter - wikt:valzers: in the title of a piece written for piano.
- 1 - Cornish currency - wikt:dynar: an old Cornish currencey.
- 1 - Cornish jack - wikt:labeos: a type of fish (plural).
- 1 - Corymbium - wikt:plampers: a local name for this plant.
- 1 - Cowboy poetry - wikt:quakie: cowboy vernacular, in a poem.
- 1 - Creedmoor Branch - wikt:demapped - "finally being torn up and demapped in the early 1970s."
- 1 - Chain conveyor - wikt:multiflexing
- 1 - Charles Dallas - wikt:mcrt: it's an abbreviation but I don't know for what. (It even has a period.)
- 1 - Cinema of the United States - wikt:photogenia: nots sure if it's a legit word. It's used to mean "the desire to make everything photogenic for social media impact".
- 2 - Cigu Niru - wikt:nirus: transliteration of a Chinese word (plural) of an army unit.
- 1 - Cleobury Mortimer - wikt:clifu: an Old English word, meaning a steep place.
PEdit
YEdit
- 1 - Yacambú National Park - wikt:caramerudo - this is the common name used in Venezuela for Odocoileus virginianus deer (see here) - I'm not sure what to do with it. DferDaisy (talk) 15:55, 4 August 2018 (UTC)
By wordEdit
- wikt:degradated - Southern American English
- 9 - wikt:lavwa - Bélé, Chanté mas, Chouval bwa, Music of Dominica, Music of Martinique ... find all →means "voice" in several; Caribbean creoles.
- wikt:vlně - Czeck (wool)
- wikt:vlne - Slovak
- According to User:Palmyrah on Template talk:Which lang#Patna, used in Horton Plains National Park:
- wikt:patna, (Sri Lankan) English, noun: a plain or, more usually, a hillside covered with patna grass
- wikt:patna grass - a particular kind of grass
- Possible etymology: a similar kind of grass grows in Patna, India, and brooms made from it are used all over India
- wikt:patana, Sinhalese, noun: patna
- Possible etymology: English patna
- patna and patana are both in wikt, but not with these meanings
Most common non-English, missing from English WiktionaryEdit
These words are commonly found in English Wikipedia, are present in a non-English Wiktionary, but are missing from English Wiktionary. Word counts are from English Wikipedia. This is a special report from the 2019-08-20 dump.
- 106 - wikt:стрелковая - 109th Rifle Division (Soviet Union), 10th Guards Motor Rifle Division, 114th Rifle Division (Soviet Union), 121st Guards Rifle Division, 137th Rifle Division (Soviet Union) ... find all
- 101 - wikt:īn - Ab Barik-e Sofla, Kermanshah, Ab Garmak-e Sofla, Khuzestan, Abbasabad-e Moin, Abd al-Mu'in ibn Musa'id, Alhashem-e Sofla ... find all
- 80 - wikt:ει - Ancient Greek verbs, Attic Greek, Axiotta, Celtic deities, Cernunnos ... find all
- 67 - wikt:pasangan - Ba (Javanese), Ca (Javanese), Da (Javanese), Dha (Javanese), Ga (Javanese) ... find all
- 66 - wikt:liveshow - Anders Kobro, Australian Idol 3: The Final 13 – Australian Made: The Hits, Big Brother (Swedish season 5), Bảo Thy, Bằng Kiều ... find all
- 52 - wikt:književnosti - August Kovačec, Bogdan Popović, Borisav Stanković, Božidar Petranović, Bratoljub Klaić ... find all
- 48 - wikt:jazyka - 1818 in literature, Aleš Klégr, Andrey Korolev, Andrey Zaliznyak, Bohemian ... find all
- 46 - wikt:изд - Albena Stambolova, Boris Koyalovich, Church of St Demetrius, Boboshevo, Demyanka River, Fedor Kapelyush ... find all
- 42 - wikt:гвардейская - 100th Guards Rifle Division, 104th Guards Airborne Division, 10th Guards Motor Rifle Division, 10th Guards Uralsko-Lvovskaya Tank Division, 121st Guards Rifle Division ... find all
- 40 - wikt:eskadrila - 122nd Hydroplane Liaison Squadron, 461st Light Combat Aviation Squadron, 462nd Light Combat Aviation Squadron, 463rd Light Combat Aviation Squadron, 464th Light Combat Aviation Squadron ... find all
- 38 - wikt:tunjos - Aguazuque, Bacatá, Eastern Hills, Bogotá, El Dorado, Epítome de la conquista del Nuevo Reino de Granada ... find all
- 38 - wikt:serambi - Al-Wustho Mangkunegaran Mosque, Grand Mosque of Bandung, Great Mosque of Banten, Great Mosque of Malang, Great Mosque of Surakarta ... find all
- 36 - wikt:ουκ - Codex Vaticanus 2061, Lectionary 12, Lectionary 225, Lectionary 240, Matthew 27:6 ... find all
- 36 -
wikt:alongwith - 2018 Rajasthan Legislative Assembly election, Alexander Daniell, Ali Azim, Aminath Rishfa, Apollo Intensa Emozione ... find all- This might be from Indian English? -- Beland (talk) 19:33, 31 August 2019 (UTC)
- Some was, often accompanied by punctuation and capitalisation problems. Just because some Indian editors do not know how to use English does not mean we should accept their spelling mistakes as words. Graeme Bartlett (talk) 21:28, 21 October 2019 (UTC)
- This might be from Indian English? -- Beland (talk) 19:33, 31 August 2019 (UTC)
- 32 - wikt:….. - Ado J. G. Muhammad, David Hobkirk, Devol, Oklahoma, Enhanced Partnership with Pakistan Act of 2009, Hatfield Forest ... find all
- 31 - wikt:ombiasy - Andrianampoinimerina, Antambahoaka, Antemoro people, Bara people, Culture of Madagascar ... find all
- 31 - wikt:kozane - Dō (armour), Japanese armour, Kimura Shigenari, Lamellar armour, Scale armour ... find all
Mineral wordsEdit
Several pages with lists of minerals are showing up as some of the pages with the most detected typos. Below is a list of words from these pages. I'm pretty sure some of them are misspelled, so they all require verification. I don't see anything in wikt:Wiktionary:CFI that would exclude these names; some but not all of them are IUPAC systematic. We could also add Wikipedia stubs or redirects as needed if Wiktionary doesn't want them. -- Beland (talk) 15:36, 30 May 2019 (UTC)
- wikt:aluminoctaoxotrisilicate try aluminum trisilicate octaoxide
- wikt:aluminodecaoxotrisilicate
- wikt:aluminodecaoxytetrasilicate
- wikt:aluminodisilicate
- wikt:aluminohexaoxodisilicate
- wikt:aluminohexaoxosilicate
- wikt:aluminotetraoxosilicate
- wikt:aluminotrisilicate
- wikt:alumotrisilicate
- wikt:berylloalumotrisilicate
- wikt:chloro-potassichastingsite
- wikt:decaoxodihydroxy
- wikt:decaoxotetrasilicate
- wikt:decaoxotriphosphate
- wikt:decaoxotrisilicate
- wikt:decaoxydihydroxy
- wikt:dialuminiosilicate
- wikt:dialuminoctaoxodisilicate
- wikt:dialuminodecaoxodisilicate
- wikt:dialuminodisilicate
- wikt:dialuminohexasilicate
- wikt:dialuminopentaoxosilicate
- wikt:dialuminotrisilicate
- wikt:dialumodisilicate
- wikt:diboro
- wikt:dihydroxoarsenate
- wikt:dihydroxophosphate
- wikt:dihydroxotellurate
- wikt:dioxoarsenate
- wikt:dioxoborate
- wikt:dioxochloride
- wikt:dioxodiarsenate
- wikt:dioxodichloride
- wikt:dioxodifluorine
- wikt:dioxodiphosphate
- wikt:dioxodiselenite
- wikt:dioxofluorine
- wikt:dioxohydroxy
- wikt:dioxophosphate
- wikt:dioxoselenite
- wikt:dioxosilicate
- wikt:dioxosulfate
- wikt:dioxotetrasulfate
- wikt:dioxotriarsenate
- wikt:dioxotriphosphate
- wikt:dioxydecahydroxy
- wikt:dioxydifluorine
- wikt:dioxydihydroxy
- wikt:dioxydodecahydroxy
- wikt:dioxyhydroxy
- wikt:diREE
- wikt:disulfa
- wikt:disulfarsenide
- wikt:ditetraoxosilicate
- wikt:docosahydroxy
- wikt:docosaoxide
- wikt:docosaoxotetrasilicate
- wikt:docosatantalum → valid but too rare for wiktionary
- wikt:dodecahydroxy
- wikt:dodecaoxotetrasilicate
- wikt:dodecaoxotrisilicate
- wikt:dodecaoxychloride
- wikt:dodecaoxytetrasilicate
- wikt:fluoro-potassichastingsite
- wikt:fluoro-potassicrichterite
- wikt:henicosahydrate
- wikt:heptaicosahydrate
- wikt:heptaicosaoxodisilicate
- wikt:heptaoxodivanadate
- wikt:heptaoxopentaborate
- wikt:heptaoxosilicate
- wikt:heptasilicon
- wikt:heptasulfadiarsenide
- wikt:heptatelluride → too rare for wiktionary but appears valid
- wikt:heptawater
- wikt:hexaaluminotetraicosaoxohexasilicate
- wikt:hexacontaoxide
- wikt:hexahydrogen
- wikt:hexahydroxide
- wikt:hexaicosahydroxy
- wikt:hexaoxodiborate
- wikt:hexaoxodisilicate
- wikt:hexaoxopentaborate
- wikt:hexaoxtellurate
- wikt:hexaoxy → needs changing
- wikt:hexaoxydihydroxy
- wikt:hexasulfa
- wikt:hexatricontahydrate
- wikt:hydrodioxoarsenate
- wikt:hydroheptaoxide
- wikt:hydrohexaoxodisilicate
- wikt:hydrophosphate
- wikt:hydrotrioxosilicate
- wikt:hydroxoarsenate
- wikt:hydroxophosphate
- wikt:hydroxyarsenate
- wikt:hydroxyhexaoxodisilicate
- wikt:hydroxypentaoxide
- wikt:hydroxytriborate
- wikt:hydroxytridecaoxodisilicate
- wikt:hydroxytrioxosilicate
- wikt:icosahydrate
- wikt:icosalead
- wikt:icosaoxide
- wikt:icosaoxo
- wikt:icosaoxooctasilicate
- wikt:icosaoxopentasilicate
- wikt:nonadecaoxoctasilicate
- wikt:nonaoxodiarsenate
- wikt:nonaoxodiborate
- wikt:nonaoxodivanadate
- wikt:nonaoxohexaborate
- wikt:nonaoxopentaborate
- wikt:nonaoxosilicate
- wikt:nonaoxotetravanadate
- wikt:nonaoxotrisilicate
- wikt:nonaoxyhydroxytetrasilicate
wikt:octadecaheptasilicate- wikt:octadecaoxide
- wikt:octadecaoxoheptasilicate
- wikt:octadecaoxohexasilicate
- wikt:octadecaoxopentasilicate
- wikt:octaoxodiborodisilicate
- wikt:octaoxoicosahydroxy
- wikt:octaoxopentaborate
- wikt:octaoxotetraborate
- wikt:octaoxotetrasilicate
- wikt:octaoxotrisilicate
- wikt:octaoxotritellurate
- wikt:octaoxydihydroxy
- wikt:octasulfa
- wikt:octasulfadiantimonide
- wikt:octatelluride
- wikt:octatriacontaoxide
- wikt:octauranyl
- wikt:oxoarsenate
- wikt:oxocarbonate
- wikt:oxochromate
- wikt:oxodecachloride
- wikt:oxodiarsenate
- wikt:oxodiborate
- wikt:oxodiphosphate
- wikt:oxodisulfate
- wikt:oxodisulfide
- wikt:oxohydrophosphate
- wikt:oxosulfate
- wikt:oxotetraoxosilicate
- wikt:oxotrisulfate
- wikt:oxydihydroxy
- wikt:oxydinitride
- wikt:oxyhydroxy
- wikt:oxyphosphate
- wikt:oxytrivanadate
wikt:pentadecaoxide- wikt:pentadecaoxohexasilicate
wikt:pentadecaselenidewikt:pentadecasulfidewikt:pentafluorine- wikt:pentaicosahydro
- wikt:pentaicosamanganese
wikt:pentantimonide- wikt:pentaoxodiarsenate
- wikt:pentaoxodiborate
- wikt:pentaoxodisilicate
- wikt:pentaoxotellurate
- wikt:pentaoxotetraborate
- wikt:pentaoxotrivanadate
- wikt:pentaoxoundecaborate
- wikt:pentasulfa
- wikt:pentatetracontaoxooctadecasilicate
- wikt:polytypoids
- wikt:potassic-aluminosadanagaite
- wikt:potassic-aluminotaramite
- wikt:potassicarfvedsonite
- wikt:potassic-chloropargasite
- wikt:potassic-ferrisadanagaite
- wikt:Potassic-jeanlouisite
- wikt:potassium-fluorrichterite
- wikt:protoferro-anthophyllite
- wikt:proto-ferro-suenoite
- wikt:stannotrisilicate
- wikt:stewardite
- wikt:sulfantimonide
- wikt:surkhobite
- wikt:tengerite
wikt:tetraantimonide- wikt:tetrabismuthide
- wikt:tetradecaborate
- wikt:tetradecalead
- wikt:tetradecaoxopentasilicate
- wikt:tetrahydroxoarsenate
- wikt:tetraicosaoxide
- wikt:tetraicosaoxodecasilicate
- wikt:tetraicosaoxotrisilicate
- wikt:tetraluminotetrasilicate
- wikt:tetraoxoarsenate
- wikt:tetraoxoborate
- wikt:tetraoxodichloride
- wikt:tetraoxodiphosphate
- wikt:tetraoxodisulfate
- wikt:tetraoxogermanate
- wikt:tetraoxomolybdate
- wikt:tetraoxoselenate
- wikt:tetraoxosulfate
- wikt:tetraoxotellurate
- wikt:tetraoxotetraphosphate
- wikt:tetraoxovanadate
- wikt:tetraoxozincate
- wikt:tetraoxy
- wikt:tetraoxysilicate
- wikt:tetraoxytetrabismuth
- wikt:tetraoxytriborate
- wikt:tetraselenite
- wikt:tetrastannide
- wikt:tetrasulfa
- wikt:tetrawater
- wikt:triacontaoxide
- wikt:triacontaoxoctasilicate
- wikt:triacontaoxydodecasilicate
- wikt:trialuminotrisilicate
wikt:triantimonidewikt:triarsenide- wikt:triberylohexasilicate
- wikt:triborododecasilicate
wikt:tricosahydratewikt:tridecahydratewikt:tridecaoxide- wikt:tridecaoxoditellurate
- wikt:tridecaoxoheptaborate
- wikt:tridecasulfa
- wikt:trihydronium
- wikt:triicosaoxotetrasilicate
- wikt:trilithiododecasilicate
wikt:trimolybdate- wikt:trioxoarsenate
- wikt:trioxoborate
- wikt:trioxosilicate
- wikt:trioxotellurate
- wikt:trioxotriborate
- wikt:triREE
- wikt:trisulfa
wikt:trisulfantimonide- wikt:triwater
- wikt:undecalead → too rare for Wiktionary
- wikt:undecaoxoheptasilicate
- wikt:undecaoxohexahydrohexaborate
- wikt:undecaoxotetrasilicate
- wikt:undecaoxotitanotetrasilicate
- wikt:zircono
Needs Wikipedia article instead?Edit
- 2 - Anacron - wikt:cronie, wikt:cronie
- 2 - Carpathian Large Carnivore Project - wikt:cntours, wikt:cntours - redlinked company in Romania needs an article.
- 1 - Club Penguin (franchise) - wikt:puffles: plural of a type of character in an online game.
- 33 hexipentisteriruncicantitruncated - a nest of specialized geometrical form names; what to do? → since this is a part of several compound names, it may need a set index or disambig page. If it has use in books, it could go in Wiktionary, but Wikipedia seems to be the source of these geometric terms.
Articles with the most possibly misspelled wordsEdit
These are likely to be lists using non-English-language or technical words.
- For articles that are just lists of species names, please link to the article from Wikispecies:Wikispecies:Requested articles#From_Wikipedia and delete the entry here. Those are now automatically suppressed.
- For non-English-language words, add {{lang}} around the foreign passages and delete the row. Articles that don't do this often have formatting of non-English words that is inconsistent either internally or with the Manual of Style, so this is an easy way to fix that at the same time as helping the spell checker and screen readers do the right things.
(Waiting for new dump; Beland mostly now just tags these with {{cleanup lang}}, {{you}}, or adds them to the Wikispecies request page directly, so you can find plenty of articles in those work queues if you like.)
Possible typos by lengthEdit
Longest or shortest in certain categories are shown, sometimes just for fun and sometimes because they form a useful group. Please use strikethrough (or leave a note) for this section rather than removing lines, to avoid repeating work done while the dumps were being processed. Thanks!
Likely chemistry wordsEdit
(updated from 2020-06-01 dump)
These need to be checked by a chemist and marked {{not a typo}}.
- 84 - wikt:trans-2-hydroxyisoxypropyl-3-hydroxy-7-isopentene-2,3-dihydrobenzofuran-5-carboxylic - Cāng zhú
- 79 - wikt:d-1,2,3,9,10,10a-hexahydro-6-methoxy-11-methyl-4h-10,4a-iminoethano-phenanthren - Controlled Drugs and Substances Act
- 73 - wikt:d-1,2,3,9,10,10a-hexahydro-11-methyl-4h-10,4a-iminoethanophenanthren-6-ol - Controlled Drugs and Substances Act
- 72 - wikt:l-11-allyl-1,2,3,9,10,10a-hexahydro-4h-10,4a-iminoethanophenanthren-6-ol - Controlled Drugs and Substances Act
- 70 - wikt:n-2'-hydroxyoctadecanoyl-2-amino-9-methyl-4,8-heptade-cadiene-1,3-diol - Ramaria botrytis
- 69 - wikt:n-methyl-l-alanyl-l-leucyl-n-methyl-trans-dehydrophenyl-alanyl-glycyl - Tetrapeptide
- 66 - wikt:dihydrofuran-2,5-dione,3β-hydroxy-5α,8α-epidioxyergosta-6,22-diene - Russula densifolia
- 65 -
wikt:tetrafluoroethylene-perfluoro-3,6-dioxa-4-methyl-7-octenesulfonic- Nafion - 65 - wikt:dl-1,2-anhydro-4,5-o-cyclohexylidene-1,2,3/4,5-cyclopentanepentol - 1,2,3,4,5-Cyclopentanepentol
- 63 - wikt:uridine-5'-diphospho-n-acetyl-2-amino-2-deoxy-3-o-lactylglucose - UDP-N-acetylmuramate dehydrogenase
- 59 - wikt:dihydroxy-21-oxa-21-chloromethylpregna-1,4-diene-3,20-dione - List of corticosteroids
- 59 - wikt:cis-5,6-dihydroxy-4-isopropylcyclohexa-1,3-dienecarboxylate - 2,3-dihydroxy-2,3-dihydro-p-cumate dehydrogenase
- 58 - wikt:decahydro-10-methoxy-3,6,9-trimethyl-3,12-epoxy-12h-pyrano - Artemether
- 58 - wikt:anti-7β,8α-dihydroxy-9α,10α-epoxy-7,8,9,10-tetrahydrobenzo - Benzo(j)fluoranthene
- 56 - wikt:hydroxy-17α,21-dimethyl-19-norpregna-4,9-dien-3,20-dione - Trimegestone
- 54 - wikt:s-adenosyl-l-methionine:3-hexaprenyl-4,5-dihydroxylate - Hexaprenyldihydroxybenzoate methyltransferase
- 54 - wikt:cis-11,12-dichloro-9,10-dihydro-9,10-ethano-2-anthroic - Field effect (chemistry)
- 53 -
wikt:n-butanoxy-5β,19-epoxycucurbita-6,23-diene-3β,25-diol- Cucurbitane - 53 -
wikt:alanyl-2-aminoethyl-2,3-dipalmitoylglycerylphosphoric- Mifamurtide - 52 -
wikt:n-ethyl-n-methyl-1-methyl-3,3-di-2-thienylallylamine- Ethylmethylthiambutene
Probable DNA sequencesEdit
If you're sure this is a DNA or RNA sequence, tag it {{DNA sequence}}.
(All fixed; waiting for 2020-06-20 dump!)
Repeating patternsEdit
For rhyme schemes, they probably need to be re-styled to follow Wikipedia:WikiProject Poetry#Style for rhyme schemes. If this ends up making them all-caps, they won't show up here on the next run. For mixed-case rhyme scheme notations, use {{not a typo}} after making sure dashes, commas, and spaces follow the recommended style.
(2020-05-20 dump all fixed; waiting for 2020-07-01 dump)
For Beland todoEdit
- Rhyme scheme hunting:
- Sync style for articles in Category:Stanzaic form and Category:Rhyme and add to rhyme scheme list if appropriate.
- Sync annotation style for articles that mark up poems line-by-line (use tables, not column divs or parens)
- Manually search for patterns like:
- a-b-a-b-a-b-c-c
- AB,CD,AB (internal rhyme)
- "aa", "ab", "aaa", "aab", "aba", "abb", "abc", "aaaa", "aaba", "aabb", "aabc", "abaa", "abab", "abba", "abca", "abcb", "abcc", "abcd" - probable rhyme sequences where there's an article present so it's not detected as a misspelling
False positivesEdit
Is there a word that is correctly used in an article, but which shouldn't be added to Wiktionary? List it here, and Beland will fix the problem.
Archived solutions: Wikipedia:Typo Team/moss/Archive
False negativesEdit
Is there a misspelled word in an article mentioned here that was not reported? Feel free to list it below and Beland will try to improve the code if appropriate.
These are currently over-ignored, but could be used to suggest correct spellings:
- Wikipedia articles with {{R from misspelling}}, {{R from incorrect name}}, {{R from miscapitalisation}}, and redirects to these templates
- Wiktionary entries that are known misspellings (e.g. wikt:anticiliary)
- In cases where there are variant spellings of the same word or phrase, Wikipedia should probably pick one and stick to it except to mention the variants. This happens with:
- Compound words - whether to use a space, dash, or nothing, as in "junebug" vs. "june bug" or "email" vs. "e-mail".
- Words with multiple transliterations from another language (often there are multiple systems, no particular system, or a modern system different from historical systems).
- Redirects with {{R from alternate spelling}} and redirects to that template.
- Article Ana Recio Harvey | detected misspelling: appoinment | additional, undetected misspelling: enterpreneur
- Looks like this was because of redirects with "enterpreneur" in the title. I have tagged them all {{R from misspelling}}, but I'll have to change the code to ignore those, as noted above. Thanks for catching that! -- Beland (talk) 23:52, 18 October 2018 (UTC)
Archived notesEdit
Mismatched markup and punctuationEdit
Errors in punctuation (mostly quotation marks) and wiki markup generally cause confusion for readers, and also prevent the spell checker from running on these articles.
Inches and feet should not use " and ', per Wikipedia:Manual of Style/Dates and numbers#Specific units; use letters instead. (See MOS:UNITS for general guidance.) Where conversions are needed, use {{convert}}, for example: 2 feet 3 inches (69 cm)
WORK IN PROGRESS
- Integrating these with main listings
- Filter only unmatched " for now
- Filter articles with non-ASCII quote marks to a separate list for JWB processing
- Filter \d" and \d' to a separate sublist for inch/feet style conversion
- Explain ✂ or skip snippets showing this
- Bracketbot web UI seems to be down
cquoteEdit
MOS:BLOCKQUOTE says {{cquote}} should be replaced by {{quote}} in articles.
Find all current instances. (35500 transclusions as of October 2019)
Gender-neutral languageEdit
MannedEdit
The word "manned" and related forms like "unmanned" are used in many articles, but is not gender-neutral as required by MOS:S/HE and the NASA style guide. Gender-neutral alternatives include:
- Crewed, uncrewed
- Staffed, unstaffed
- Human spaceflight
- Defended
Not all instances need to be changed.
- Proper nouns should remain the same, like Manned Orbiting Laboratory
- Titles of sources and quotes should remain unchanged.
- If the term itself is being discussed, for example to say that "manned spaceflight" is another way of saying human spaceflight.
- There seems to be consensus on unmanned aerial vehicle that this and related phrases (like unmanned aerial system) should remain intact, since it is much more frequent than "uncrewed aerial vehicle" at the moment. However, when using Wikipedia's voice it is preferred to describe a UAV as "uncrewed" when not using the whole phrase.
- Non-article pages that are retained for historical interest shouldn't be modified if they won't be visible to readers.
- Redirects with this title should be left alone if they are redirecting readers to a gender-neutral title
If the word is found the names of articles and categories (except those with names directly related to UAVs), those should be renamed, and the links changed. Many articles have already been renamed, and the links just need to be updated. (Remember that to rename a category, all the articles in that category must be edited to change their pointers.)
- Coming soon: moss report on "manned" that ignores references, page titles, proper nouns, and consensus-OK phrases.
- Find all instances of "manned" in articles
- Find all instances of "unmanned" in articles
- Find all instances of "manned" in Wikipedia:, File:, Category:, and Portal: (recommended for advanced editors only)
- Find all instances of "unmanned" in Wikipedia:, File:, Category:, and Portal: (recommended for advanced editors only)
Borderline casesEdit
These may need to be discussed before being potentially renamed.
These are generic terms, like Human mission to Mars, as opposed to proper names like Manned Orbiting Laboratory. -- Beland (talk) 19:41, 21 May 2019 (UTC)
- Manned Venus flyby - Based on the NASA style guide, NASA probably would now refer to this as "human Venus flyby" but historical sources say "manned Venus flyby" so that's what the majority of editors commenting on the talk page currently favor. There is some question as to whether the scope of the article concerns a specific mission or this type of mission in general, which is related to the proper name exception (but then the title would be "Manned Venus Flyby"). Compare Colonization of Venus and Human mission to Mars. -- Beland (talk) 19:41, 21 May 2019 (UTC)
Objections in specific cases:
MarriageEdit
Wikipedia:Writing about women § Marriage points out:
- "is the wife of" is less neutral than "is married to" - find all "is the wife of"
- "born to X and his wife Y" is less neutral than "born to X and Y" - approximate search
- "man and wife" is less neutral than "husband and wife", and to be fully neutral the order should be varied - find all "man and wife"
LadiesEdit
Wikipedia:Writing about women § Girls, ladies prefers "women" to "ladies" except where part of set phrases or traditional titles (like first lady). find all lowercase "ladies"
Instructional and presumptuous languageEdit
MOS:NOTE says to avoid the following phrases when they address the reader directly. Not all instances are problematic, such as those in direct quotations.
- remember that - find all "remember that"
- note that - find all "note that"
- of course - find all "of course"
- naturally - find all "naturally" (the meaning "related to nature" is not problematic)
- obviously - find all "obviously"
- clearly - find all "clearly"
- actually - find all "actually"
- rhetorical questions, especially in headings - find all questions in headings (some cases, like the names of works, are not problematic)
Internationally comprehensible spelling and vocabularyEdit
MOS:COMMONALITY advises the use of vocabulary and spellings that are shared across national varieties of English, where possible. This section collects instances where an unshared term is being used which could be improved. For proper nouns and direct quotes, a translation or re-spelling into another dialect may be helpful.
- "gaol" should be "jail"
- Disputed, discussion underway at Wikipedia talk:Manual of Style#Gaol vs. jail
Currency styleEdit
Per MOS:CURRENCY:
- For the UK, Irish, Australian, New Zealand, and South African pound, ₤ should be changed to £
- ₤ is OK to use with Italian lira. Changing e.g. ₤100,000 to [[Italian lira|₤]]100,000 will prevent legitimate uses from showing up in automated reports, and also help readers understand that this is not British pounds. (Mentions of Italian lira are increasingly rare because it has been replaced by the Euro.)
Caution: Not all problem pages show up reliably; if you do a search, fix all the pages in the results, and then do another search, you will probably get a fresh batch of problem pages. It may also take a minute or two for fixed pages to disappear from the results, due to lag updating the search index.
Work is in progress on detecting and fixing other MOS-related issues with numbers and currencies.
Small capsEdit
Per MOS:BCE, smallcaps are not to be used for years like "400 BC". Find all instances of known smallcaps issues...
HTML tagsEdit
Updated from 2020-09-20 dump.
You can do one of two things for these articles:
- Remove, repair, or convert the HTML markup to wiki markup yourself.
- Tag the article {{cleanup HTML}} and it will show up under Category:Articles with HTML markup but not on this list. Use the "tags" parameter to indicate which tags are present on the page; many editors find it hard to locate the offending HTML. For example: {{cleanup HTML|tags=table, cite}}
How to clean upEdit
See Category:Articles with HTML markup for instructions on how to find the offending tags and what to do about them.
Find all articles by tagEdit
Can't wait for the next database dump? Want to look for or fix all instances of a specific tag? Use the links below!
- <tt> - find all
- <li>, <ol>, and <ul> - find all
- <table>, <tr>, <td>, <th>, <caption> - find all
- <i> or <em> - find all
- <dd>, <dt>, and <dl> - find all
- <cite> - find all
- <p> - find all
- <strong> and <b> - find all
- <name=> - find all
- </br> - find all
- <hr> and <hr/> - find all
- <font> - find all
- <ins> - find all
- <samp> - find all
- <q> - find all
- <wbr> - find all and find ­
- <ruby>, <rt>, and <rp> - find all
- Elements and attributes obsoleted in HTML 5 have prefab searches linked from Wikipedia:HTML 5
Additional HTML problems are listed at Special:LintErrors.
Sometimes editors use angle brackets (< and >) for other purposes. Though these are not HTML markup, they often need to be fixed.
<<...>> find all can indicate:
- French quotation marks rendered as <<quoted text>>. These should be normalized to "quoted text" or 'quoted text', even in quotations, per MOS:CONFORM.
- A broken citation that should be converted to {{cite web}})
Other weirdness:
- <the> - find all - More French quoting style, bad linking, bad citation style, etc.
- <blockquote> sometimes shows up on the reports if it is capitalized or all-caps on the article page. It should be all lowercase.
Known bad HTML tags (HB)Edit
These are also included in the main listings.
- 1829 - <tt> - Briefcase (Microsoft Windows), CPAN, Chosen-plaintext attack, Cisco IOS, Comparison of file archivers ... find all
- 1642 - <li> - 2007 UST Growling Tigers men's basketball team, 2019 European Parliament election in Romania, 2019 U.S. Open Polo Championship, 2K Sports Classic, Absolutely convex set ... find all
- 769 - <i> - 2020 HA10, 2020 OV1, A. Ernest Fitzgerald, Adam Wilson, Alexei Verkhratsky ... find all
- 382 - <p> - 2020 Arizona Cardinals season, Abradable powder coatings, Ambiguities in Chinese character simplification, Arthur L. Selland, Assange v Swedish Prosecution Authority ... find all
- 370 - </em> - Alacritty, April 10 (Eastern Orthodox liturgics), April 11 (Eastern Orthodox liturgics), April 12 (Eastern Orthodox liturgics), April 13 (Eastern Orthodox liturgics) ... find all
- 257 - <b> - Affinity chromatography, African-American businesses, Ancient Egyptian royal titulary, Ang Mo Kio - Thye Hua Kwan Hospital, Arnold tongue ... find all
- 162 - <cite> - Commutative ring, Constructivism (philosophy of mathematics), David Lewis (philosopher), Evidentiality, German submarine U-553 ... find all
- 150 - <ol> - 2019 European Parliament election in Romania, Absolutely convex set, Battle of Grand Port, Bianchi classification, Biarc ... find all
- 101 - <strong> - HEPES, Haml, IUsask, Indian locomotive class WCAM-2, International Transtar ... find all
- 74 - </table> - 2K Management, Adventure Time (season 7), Adventure Time (season 8), Ang Probinsyano (season 1), Ang Probinsyano (season 2) ... find all
- 54 - </ins> - Bachelor of Homeopathic Medicine and Surgery, Executions of Kokkinia, Gadsby's Tavern, Guinea, Hemisphere GNSS ... find all
- 42 - <hr/> - Gasparín FC, Helioself, Index of sociology articles, Linha do Alentejo, List of Cleveland Gladiators seasons ... find all
- 41 - <hr> - Jacob Bidermann, Lady Molly of Scotland Yard, List of Air Service American Expeditionary Force aerodromes in France, MySQLi, Nonconvex great rhombicosidodecahedron ... find all
- 19 - </font> - Hayagreeva Rao, Ja'afar of Negeri Sembilan, Jalal Baba, Lefty O'Doul, List of German states by area ... find all
- 14 - <tr> - 2K Management ... find all
- 12 - <td> - 2K Management ... find all
- 9 - </th> - 2K Management ... find all
- 4 - <table> - 2K Management ... find all
- 4 - <q> - Projective line over a ring, The Importance of Being Earnest (1992 film) ... find all
- 4 - </q> - Hand washing, The Importance of Being Earnest (1992 film) ... find all
- 4 - </br> - Horse racing colours in Great Britain, Jacobite Syrian Christian Church, List of structural engineering companies ... find all
- 3 - </dl> - Flip-flop (electronics) ... find all
- (Mediawiki bug in nested table layout?)
Bad link formatting (HL)Edit
These are also included in the main listings. Angle brackets are not used for external links (per Wikipedia:Manual of Style/Computing § Exposed URLs); "tags" like <https> and <www> are actually just bad link formatting. See Wikipedia:External links#How to link for external link syntax; use {{cite web}} for footnotes.
- 123 - <https> - Cow protection movement, Cumberland Compact, Dalyan, Emergency response (museum), Environmental issues on Maury Island ... find all
- 106 - <http> - Charles Arthur Bissonette, Dmitry Gabrilovich, Enoch H. Pardee, Fauna of Montenegro, Glutamate—prephenate aminotransferase ... find all
- 57 - <http/> - E. C. Manning Provincial Park, Eakly, Oklahoma, Fauna of Montenegro, Health impact of light rail systems, In the Next World, You're on Your Own ... find all
- 55 - <https/> - Alfred Scharf, Buzzy Boop, Fauna of Montenegro, Gregory Victor Babic, Guinness Rishi ... find all
- 15 - <www> - Harry W. Brown, Inga alley cropping, John D'Amico Jr., John Kerr (sailor), Journal of West African Languages ... find all
Unsorted (H)Edit
Many of these can be replaced by {{var}} (for text to be replaced) or {{angbr}} (e.g. for linguistic notation).
- 27 - <c> - Bills C-1 and S-1, Cello Concerto No. 1 (Saint-Saëns), Energy development, Fatty alcohol, Lipps–Meyer law ... find all
- 26 - <n> - Gaj's Latin alphabet, Global cascades model, Ingrian language, Japaridze's polymodal logic, Merionethshire ... find all
- 22 - <lf> - HTTP message body, Hayes command set, Hypertext Transfer Protocol, JSON streaming, NMEA 0183 ... find all
- 22 - <e> - Cello Concerto No. 1 (Saint-Saëns), Eawy Forest, Epirote Greek, Grunewald, Hana Ichi Monme ... find all
- 22 - <d> - Cello Concerto No. 1 (Saint-Saëns), DT-Manie, Dataphor, Division algorithm, Experix ... find all
- 22 - <cr> - Carriage return, HTTP message body, Hayes command set, Hypertext Transfer Protocol, NMEA 0183 ... find all
- 21 - <m> - Certificate (complexity), Godfried-Willem Raes, Itô calculus, Kaingang language, Lee You-cheong ... find all
- 18 - <t> - Ancient Roman bathing, Anglo-Saxon riddles, Binary heap, Breitenau concentration camp, C++20 ... find all
- 18 - <cabela> - Cabela's Big Game Hunter 2005 Adventures, Cabela's Big Game Hunter 2006 Trophy Season, Cabela's Big Game Hunter: 2004 Season ... find all
- 17 - <k> - Distributed lag, Dividend policy, Evolution of a random network, Gompertz function, Hub (network science) ... find all
- 17 - </activision> - Cabela's Big Game Hunter 2005 Adventures, Cabela's Big Game Hunter 2006 Trophy Season, Cabela's Big Game Hunter: 2004 Season ... find all
- 16 - <x> - Davenport chained rotations, Epirote Greek, Ethernet over SDH, Head (Unix), Maxima and minima ... find all
- 16 - <no> - 430 Space Shuttle, 57th NHK Kōhaku Uta Gassen, Confederate privateer, Cricket statistics, Kinnikuman Muscle Grand Prix ... find all
- 15 - <y> - Elementary function arithmetic, History of Proto-Slavic, Ingrian language, Languages of Argentina, Meroitic language ... find all
- 15 - <the> - Ahn Bong-soon, American Dairy Association, Boris Palmer, Castillo del Príncipe (Havana), Edgar Ludlow-Hewitt ... find all
- 15 - <number> - Btrieve, Fasti Ostienses, GRAU, Geom raid5, Time control ... find all
- 15 - <l> - Anatolian hieroglyphs, Blind deconvolution, Colonia Tovar, Geometric design of roads, Litema ... find all
- 15 - <encore> - 10th Anniversary Tour Lead Upturn 2012: Now or Never, Lead 15th Anniversary Live Box, Lead Live Tour Upturn 2005, Lead Upturn 2009: Summer Day & Night Fever, Lead Upturn 2010: I'll Be Around ... find all
- 13 - <o> - Alias (TV series), Epirote Greek, Heteronym (linguistics), Immortal Beloved, Ingrian language ... find all
- 13 - <link> - Ars Magica, GIO General, IELTS Life Skills, Listia, MTELP Series ... find all
- 12 - <string> - Abstract Document Pattern, C++ Standard Library, Control flow, Generic programming, Is-a ... find all
- 12 - <pv> - Bamboo Collage, Love Paradox, Softly (song), Sympathy (Hitomi Takahashi album), Vanilla (Leah Dizon song) ... find all
- 11 - <j> - County Kilkenny, Fast wavelet transform, Glenmore, County Kilkenny, John (given name), Mo Bangfu ... find all
- 11 - <ch> - Basel German, Cuban Spanish, New Rumi Spelling, Nivaclé language, Old Saxon Baptismal Vow ... find all
- 10 - <statement> - C syntax, DG/L, XML for Analysis, Zahn's construct ... find all
- 9 - <interlude> - Hall Tour 2014: Bon Voyage, Live Tour 2007: Black Cherry, Live Tour 2015: Walk of My Life ... find all
- 9 - <filename> - Cross File Transfer, Data source name, Ddoc, Leet (programming language), PowerHouse (programming language) ... find all
- 8 - <tl> - Belizean Spanish, Costa Rican Spanish, Guatemalan Spanish, Nicaraguan Spanish ... find all
- 8 - <templatestyles/> - 2018 Joox Thailand Music Awards, 2019 Brit Awards, 2019 Joox Thailand Music Awards, 2020 Brit Awards, 2020 Joox Thailand Music Awards ... find all
- 8 - <personal> - Andrei Katkov, Doc Cheatham, Fort Dunlop, George Air Force Base, Hangar Theatre ... find all
- 8 - <mm> - LRC (file format) ... find all
- 8 - <lol> - Ala Boratyn, Before I'll Die..., Blog 27, LOL (Blog 27 album), Who I Am (Blog 27 song) ... find all
- 8 - <ll> - Languages of Argentina, Literary Welsh morphology, Lj (digraph), Paraguayan Spanish, Spanish verbs ... find all
- 8 - <is> - Gellish, Information model, Modeling language, Semantic data model ... find all
- 8 - <in> - Alexander Bogomazov, Austric languages, Ilocano numbers, Karl Spencer Lashley Award, Kathy Flores ... find all
- 8 - <gallery> - Abada railway station, Flag of Réunion, Hardcover, Immaculate Heart of Mary College-Parañaque, Plakatstil ... find all
- 7 - <z> - Dutch-language literature, General Chinese, Heteronym (linguistics), Leiden Willeram, New Mexican Spanish ... find all
- 7 - <yod> - Modern Standard Tibetan grammar ... find all
- 7 - <year> - AMD Radeon Software, Animecon (Netherlands), Constitutional Court of Korea, Date and time notation in Catalonia, Madras High Court ... find all
- 7 - <us> - HyTelnet, Pinnacle Valley, Yūki Kaneko ... find all
- 7 - <red> - Modern Standard Tibetan grammar ... find all
- 7 - <random> - C++ Standard Library, Sality, Swen (computer worm), Voyager (computer worm) ... find all
- 7 - <iostream> - C++ Standard Library, Criticism of C++, Database Management Library, Microsoft Windows library files, Standard streams ... find all
- 7 - <date> - Battle honour, Carus Publishing Company, Charles E. Fraser, Opera Dragonfly, Radiocarbon dating ... find all
- 6 - <w> - Meroitic script, Myles Goodwyn, Old Saxon Baptismal Vow, South-West Irish English, Uyghur phonology ... find all
- 6 - <video> - Firefox 3.5, List of features in Android, Love Paradox, Unreal Media Server, Vanilla (Leah Dizon song) ... find all
- 6 - <username> - Home directory, MobileMe ... find all
Need debuggingEdit
- 18 - <pre> - Arena (web browser) (ASCII art breaking parsing?), Back-to-back user agent (ASCII art breaking parsing), BagIt, Bally Astrocade, Call graph ... find all
- (These look legit, probably a moss bug. Beland note to self: Run these on wikitext_util functions in an interactive window to find parse breakage.)
Notification of new dumpsEdit
"Most likely misspellings by articles" should always have work to do (if not, ping Beland to add more from the current dump). Some of the other sections are occasionally waiting for a new dump to get a useful list, either because they are ranked by frequency or a code change has been made to clean up noise in the next run. New runs are generally posted twice a month. The database snapshot from the first day of the month generally takes about 9-13 days to process, and the snapshot from the twentieth day of the month might take 4-6 days until it can be posted.
All that said, if you want to get a ping when results from a new dump are posted, you can add your name to the list below. If you are only interested in a particular section, include a note to that effect.
- (add your username to this list)
- Jake The Great!📞talk! 01:40, 18 December 2019 (UTC)
- Sun Creator(talk) 22:21, 19 November 2019 (UTC)
- Puddleglum2.0 (talk) 20:31, 13 October 2019 (UTC)
- Schazjmd (talk) 18:25, 21 December 2018 (UTC)
- bradleyagin (talk) 04:08, 12 January 2019 (UTC)
- Darylgolden(talk) Ping when replying 00:50, 11 February 2019 (UTC)
- MarkZusab (talk) 03:52, 15 February 2019 (UTC)
- Amiodarone talk 20:52, 2 April 2019 (UTC)
- Zojomars (talk) 17:48, 31 May 2019 (UTC)
- Anarhistička Maca (talk) 06:25, 30 June 2019 (UTC)
- Clovermoss (talk) 00:46, 27 October 2019 (UTC)
- JaAlDo (talk) 14:18, 11 March 2020 (UTC)
- Creativecreatr Creativecreatr (talk) 09:56, 26 May 2020 (UTC)
- Voidify (talk) 06:12, 9 June 2020 (UTC)
- Doghouse09 (talk) 20:52, 8 September 2020 (UTC)
- -- spazure (contribs) 09:24, 2 December 2020 (UTC)
- Idell (talk) 21:26, 23 October 2020 (UTC)
- -- *Fehufangą ♮ ✉ Talk page ♮ 12:16, 28 December 2020 (UTC)
moss source codeEdit
moss is written in Python, and is available on github at: https://github.com/cdbeland/moss
Data is obtained from XML database backup dumps.