User:EranBot - Wikipedia
User:EranBotFrom Wikipedia, the free encyclopedia
|
This
bot
runs on
Wikimedia Toolforge
.
Administrators: If this bot needs to be blocked due to a malfunction, please remember to disable autoblocks so that other Toolforge bots are not affected.
|
EranBot
|
This is a bot account owned by
User:ערן
(aka Eran).
|
|
Copy & Paste detection
|
This is a copy and paste detection bot based on the multi year efforts here
WP:Turnitin
. The bot runs as the background service for CopyPatrol.
How it works
[
edit
]
-
All recent edits to the English Wikipedia over a certain size (+500 [after removing wikicode]) are scanned (that wasn't there in the previous and previous-previous revisions). The text is sent to plagiarism detection service
iThenticate
.
-
Edits with similar text to external sources are consider as possible copyright violations, and are reported in the
CopyPatrol tool
.
-
If the external source is mirror of Wikipedia, it is either removed by iThenticate itself or afterwards by the bot (based on
blacklist
)
-
If the source is broken link the bot removes it
-
Each entry in the report page have the following fields: Title of the edited page, Diff with link to the relevant edit diff and page history, Editor, Source - link to report page in iThenticate (titled "report") and links to possible sources of the edit (titled "compare"), Status - Should be filled manually with TP/FP. The bot adds hints for possible good edits:
-
citation
- the added text mention the source. For short text it is OK (in copyright sense) and for long text it is a violation (see also
Wikipedia:Close paraphrasing
).
-
Mirror?
- the added text comes from a source that may be a possible mirror site of Wikipedia. E.g the source seems to be unknown mirror (that doesn't appear in our blacklist, but have attribution to Wikipedia). Editors can add such sites to
blacklist
, so they don't appear in future.
-
(CC)
- the added text comes from a source that probably have creative commons license.
Page triage
[
edit
]
-
Special:NewPagesFeed
(aka PageTriage) interacts with Copyright bots, such as Eranbot.
-
Use Set Filter => "Copyvio" to review pages with pontential copyright issues
-
Use "Copyvio" link to review the edit in
toollabs:copypatrol
-
Edits that aren't new pages:
Current state
[
edit
]
Currently it works only for the en/es/fr/cs Wikipedias (There may be potential for it to expand to other languages - just ask!). It has been a great help for medical articles. Efforts to make it more functional are ongoing. The results are being placed at
CopyPatrol tool
and the bot runs 24/7.
There is NO plan for this bot to make edits to mainspace. The concept has been discussed with the WMF legal team who are happy with it.
Who runs the bot
[
edit
]
It is run by Hebrew Wikipedian
User:Eran
.
Doc James
and
User:Ocaasi
have been guiding its development.
Source
[
edit
]
The bot is based on pywikibot and you can find its source code in
github
. It is possible to run the bot in other Wikipedia languages, but to run the bot you have to request an account for iThenticate.
|
Feel free to edit this page

Retrieved from "
https://en.wikipedia.org/w/index.php?title=User:EranBot&oldid=914989564
"
Categories
:
Wikipedia bots running on Wikimedia Toolforge
Active Wikipedia bots
All Wikipedia bots
Navigation menuPersonal toolsNot logged in
Talk
Contributions
Create account
Log in
Namespaces
User page
Talk
VariantsViews
Read
Edit
View history
MoreSearch
Navigation
Main page
Contents
Current events
Random article
About Wikipedia
Contact us
Donate
Contribute
Help
Learn to edit
Community portal
Recent changes
Upload file
Tools
What links here
Related changes
User contributions
User logs
View user groups
Upload file
Special pages
Permanent link
Page information
Print/export
Download as PDF
Printable version
Languages This page was last edited on 10 September 2019, at 15:33 (UTC).Text is available under the
Creative Commons Attribution-ShareAlike License
;
additional terms may apply. By using this site, you agree to the
Terms of Use
and
Privacy Policy
. Wikipedia® is a registered trademark of the
Wikimedia Foundation, Inc.
, a non-profit organization.
Privacy policy
About Wikipedia
Disclaimers
Contact Wikipedia
Mobile view
Developers
Statistics
Cookie statement