Return-Path: Delivered-To: apmail-incubator-rat-dev-archive@minotaur.apache.org Received: (qmail 44733 invoked from network); 16 Jun 2009 20:04:11 -0000 Received: from hermes.apache.org (HELO mail.apache.org) (140.211.11.3) by minotaur.apache.org with SMTP; 16 Jun 2009 20:04:11 -0000 Received: (qmail 91793 invoked by uid 500); 16 Jun 2009 20:04:23 -0000 Delivered-To: apmail-incubator-rat-dev-archive@incubator.apache.org Received: (qmail 91750 invoked by uid 500); 16 Jun 2009 20:04:23 -0000 Mailing-List: contact rat-dev-help@incubator.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: rat-dev@incubator.apache.org Delivered-To: mailing list rat-dev@incubator.apache.org Received: (qmail 91740 invoked by uid 99); 16 Jun 2009 20:04:23 -0000 Received: from athena.apache.org (HELO athena.apache.org) (140.211.11.136) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Jun 2009 20:04:23 +0000 X-ASF-Spam-Status: No, hits=2.2 required=10.0 tests=HTML_MESSAGE,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (athena.apache.org: domain of maka82@gmail.com designates 209.85.221.179 as permitted sender) Received: from [209.85.221.179] (HELO mail-qy0-f179.google.com) (209.85.221.179) by apache.org (qpsmtpd/0.29) with ESMTP; Tue, 16 Jun 2009 20:04:15 +0000 Received: by qyk9 with SMTP id 9so3324380qyk.32 for ; Tue, 16 Jun 2009 13:03:54 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=gamma; h=domainkey-signature:mime-version:received:date:message-id:subject :from:to:content-type; bh=668CevlQFpRBa8194eMY6pDYAX1EDU8s+Kk64nDjFNQ=; b=E5YCsLIBTo+PVgGz5q50nAqcPAKOznfz5BShiJ2kSmqXooBm7gDOIEzcNxsMwsXxgd /MwICSYofn07p90pyDKGilQQNS2VccsJjbsBMigr8jTe0XxTIwx3UkbCC27hVehpcrGt 8+BwmBSPblMR8cwRsYLWyO/xYhcTFCT42THGE= DomainKey-Signature: a=rsa-sha1; c=nofws; d=gmail.com; s=gamma; h=mime-version:date:message-id:subject:from:to:content-type; b=N+ht5klVcG2FJyKcHvCqKChuBR+fC4oQUbxtaRoOKQKSrr26Zb4ojNu0qCeUBf3O0V ZCGm+xfl81uOduBfKHdLsAZHeIM9kc/A/kXl2Ympd8wmpBb1WOVXYn6oUJbxxhsgXjKd bZ5L4W06lGZX88ofcfou5STPTFPA2KKYGFVWo= MIME-Version: 1.0 Received: by 10.220.90.199 with SMTP id j7mr6156708vcm.57.1245182612237; Tue, 16 Jun 2009 13:03:32 -0700 (PDT) Date: Tue, 16 Jun 2009 22:03:32 +0200 Message-ID: <5b553b550906161303i5678f901v8de6f410077cf6e8@mail.gmail.com> Subject: apache-rat-pd From: =?ISO-8859-2?Q?Marija_=A9ljivovi=E6?= To: rat-dev@incubator.apache.org Content-Type: multipart/alternative; boundary=0016e645b9be4b6359046c7cab31 X-Virus-Checked: Checked by ClamAV on apache.org --0016e645b9be4b6359046c7cab31 Content-Type: text/plain; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit Hi! I am working on copy&paste(plagiarism) detector. You can see information about project and reports of my progress on this locations: http://wiki.apache.org/general/MarijaSljivovic/SoC2009ApacheRatProposal https://issues.apache.org/jira/browse/RAT-45 or get source code and binary distributions on: http://code.google.com/p/apache-rat-pd/ I think now to make some misspellings heuristic checkers. This algorithms will be able to notice some misspelled words in source code. Then this part of code will be sent to some of code search engines(GoogleCodeSearch for example) to check if it can find any similar misspellings in public code bases. On that way we can check possibility if code part is plagiarised. Now i search for an open source library which can be used for this task. I found one: jazzy ( http://jazzy.sourceforge.net/ ) and I think that it is good for this purpose. Any suggestion for other solution that is better then jazzy? Work on apache-rat-pd(plagiarism detector) is continuing. If you have any suggestions or advice, please say. Best regards, Marija --0016e645b9be4b6359046c7cab31--