Rozdíly mezi verzemi

PoložkaVerze z 28. 5. 2013, 15:00 od uživatele Matěj SuchánekVerze z 28. 5. 2013, 16:59 od uživatele Matěj Suchánek
Základní informace
Poznámky:
dlouhá slova jsou podezřelá
rozdělení do filtrů #5 a #41, poznámky promazány
 
pokus o urychlení filtru
 
sakra, těch lidí, co experimentují, je moc; lcase v některých případech
 
"čau" a "sex..." a ":D"
 
velká písmena s vykřičníkem, navíc je nutné, aby byla alespoň dvě, jinak budou kontrolovány hlavičky tabulek
--
další časté experimenty
--
úpravy regulárních výrazů
--
emotikony pouze v hlavním jmenném prostoru
--
detekce "nuda", ale jen malé změny
--
nové detekce
--
úprava detekcí
--
--
"0" vyřazena z opakovaných znaků
"0" vyřazena z opakovaných znaků
--
detekce experimentů typu "ahoj"
--
--
zapnuto varování
zapnuto varování
--
--
import z en:
import z en:


Catch copy/paste vandalism like hahahahahahahahahahahahaha. --Wknight94
Catch copy/paste vandalism like hahahahahahahahahahahahaha. --Wknight94


This doesn't need to be private. -BJ
This doesn't need to be private. -BJ


This has far too many false positives to be set to disallow... Not to mention that it seems to block people from using "..." and "'''''" (for bold/italic), amongst other things. --Conti
This has far too many false positives to be set to disallow... Not to mention that it seems to block people from using "..." and "'''''" (for bold/italic), amongst other things. --Conti


It can at least warn.  "'''''''" problem is fixed now.  Added curly brace exception too.  Need example of "..." since at least 7 occurrences are needed to trigger. --Wknight94
It can at least warn.  "'''''''" problem is fixed now.  Added curly brace exception too.  Need example of "..." since at least 7 occurrences are needed to trigger. --Wknight94


Changed title to something more neutral. --Conti
Changed title to something more neutral. --Conti


Too many false positives - mostly in reference URLs.  Unless someone has a clever way to fix, this is causing more harm than good IMHO.
Too many false positives - mostly in reference URLs.  Unless someone has a clever way to fix, this is causing more harm than good IMHO.


Agreed, please do not enable this filter without substantial changes, and substantial log only testing. -Prodego
Agreed, please do not enable this filter without substantial changes, and substantial log only testing. -Prodego


removing the quote, it was triggered by '''xxxx''' in certain cases -- Luk
removing the quote, it was triggered by '''xxxx''' in certain cases -- Luk
The filter should ignore urls. - Ruslik
The filter should ignore urls. - Ruslik


Ignoring all edits with URLs in them isn't a good idea. I've made it a bit more complex, actually checking if a link contains repeating chars. If any added link contains added chars, it stops. Theoretically we could do all sorts of counting to make sure there isn't a link with repeating chars AND other repeating chars, but I think that would just eat away at needless conditions, considering it is likely very rare. -Prodego
Ignoring all edits with URLs in them isn't a good idea. I've made it a bit more complex, actually checking if a link contains repeating chars. If any added link contains added chars, it stops. Theoretically we could do all sorts of counting to make sure there isn't a link with repeating chars AND other repeating chars, but I think that would just eat away at needless conditions, considering it is likely very rare. -Prodego


Turning out not to be too useful and the filter is overwrought now.  Disabling.  -Wknight94
Turning out not to be too useful and the filter is overwrought now.  Disabling.  -Wknight94
Merged testing edit summary from 292 - Shirik 12 Feb 2010
Merged testing edit summary from 292 - Shirik 12 Feb 2010
Optimized summary block - Shirik 12 Feb 2010
Optimized summary block - Shirik 12 Feb 2010
ignoring if repeated chars was already there, per request -- Jun 18, Soap
ignoring if repeated chars was already there, per request -- Jun 18, Soap


Merge filter 366 into this. – Oct 28, EdoDodo
Merge filter 366 into this. – Oct 28, EdoDodo
Filter tripped the rate limit. I reset it; I didn't see any FPs. -- Shirik 7 Nov 2010
Filter tripped the rate limit. I reset it; I didn't see any FPs. -- Shirik 7 Nov 2010
Reset rate limit. - KoH 10 Mar 2011
Reset rate limit. - KoH 10 Mar 2011
Podmínky filtru
Podmínky:
article_namespace != 2
! 'confirmed' in user_groups
&! 'confirmed' in user_groups
& article_namespace != 2
& article_prefixedtext != 'Wikipedie:Pískoviště'
& article_prefixedtext != 'Wikipedie:Pískoviště'
&! removed_lines rlike "([^_:*'|=}{-]{1,9})\1{6}" &
&! removed_lines rlike "([^_:*'|=}{-]{1,9})\1{6}"
(add := lcase(added_lines);
& (added_lines rlike '\w{50}'
  (add rlike '\bbl([aá]bol|bost)' & edit_delta < 30 |
| (rmwhitespace(added_lines) rlike "([^\.0_:*'|=}{-]{1,9})\1{7}" |
  add rlike '\b[cč]au\b' & edit_delta < 20 |
(added_lines rlike "[bcdfghjklmnpqrstvwxz]{7,}"
  add rlike '\bn[uúů]da\b' & edit_delta < 10 |
&! removed_lines rlike "[bcdfghjklmnpqrstvwxz]{7,}")
  add rlike '\bsex' & edit_delta < 15 & length(add) < 200 |
&! added_lines rlike "\[[Hh]ttps?:\/\/\w*([^_:*'|=}{-]{1,9})\1{7}.*?\]"
  add rlike '\bsmrd(ějí?|íš?|í[mt]e)\b' & edit_delta < 30 |
| (rmwhitespace(summary) rlike "([^_:*'|=}{-]{1,9})\1{7}"))
  add rlike '\bne[cč]um(te)?\b' & edit_delta < 30 |
&! 'http' in summary)
  add rlike '\bredtube\.com\b' |
  add rlike 'bla ?bla' |
  add rlike '\baho+j' & edit_delta < 10 |
  add rlike '[a-z]{50}' |
  added_lines rlike '(\bx|[^tT]:)D+\b' & article_namespace == 0 |
  added_lines rlike '[:;]-?\)+' & article_namespace == 0 |
  added_lines rlike '\b(LO+L|lo+l)\b'|
  added_lines rlike '<3+\b' |
  add rlike '\b(twl|nwm)\b' |
  (rmwhitespace(added_lines) rlike "([^\.0_:*'|=}{-]{1,9})\1{7}"|
    (added_lines rlike "[bcdfghjklmnpqrstvwxz]{7,}" &
    !removed_lines rlike "[bcdfghjklmnpqrstvwxz]{7,}")
  )
  & !(added_lines rlike "\[[Hh]ttps?:\/\/\w*([^_:*'|=}{-]{1,9})\1{7}.*?\]")
) | (rmwhitespace(summary) rlike "([^_:*'|=}{-]{1,9})\1{7}")
  & !("http" in summary)
)