Return-Path: X-Original-To: apmail-community-commits-archive@minotaur.apache.org Delivered-To: apmail-community-commits-archive@minotaur.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 72C0018BE7 for ; Wed, 6 Jan 2016 12:56:22 +0000 (UTC) Received: (qmail 9504 invoked by uid 500); 6 Jan 2016 12:56:22 -0000 Delivered-To: apmail-community-commits-archive@community.apache.org Received: (qmail 9478 invoked by uid 500); 6 Jan 2016 12:56:22 -0000 Mailing-List: contact commits-help@community.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@community.apache.org Delivered-To: mailing list commits@community.apache.org Received: (qmail 9469 invoked by uid 99); 6 Jan 2016 12:56:22 -0000 Received: from Unknown (HELO spamd2-us-west.apache.org) (209.188.14.142) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 06 Jan 2016 12:56:22 +0000 Received: from localhost (localhost [127.0.0.1]) by spamd2-us-west.apache.org (ASF Mail Server at spamd2-us-west.apache.org) with ESMTP id C6C2F1A0955 for ; Wed, 6 Jan 2016 12:56:21 +0000 (UTC) X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: 0.446 X-Spam-Level: X-Spam-Status: No, score=0.446 tagged_above=-999 required=6.31 tests=[KAM_LAZY_DOMAIN_SECURITY=1, RP_MATCHES_RCVD=-0.554] autolearn=disabled Received: from mx1-us-east.apache.org ([10.40.0.8]) by localhost (spamd2-us-west.apache.org [10.40.0.9]) (amavisd-new, port 10024) with ESMTP id 38oGoH_T3RPa for ; Wed, 6 Jan 2016 12:56:21 +0000 (UTC) Received: from mailrelay1-us-west.apache.org (mailrelay1-us-west.apache.org [209.188.14.139]) by mx1-us-east.apache.org (ASF Mail Server at mx1-us-east.apache.org) with ESMTP id 00D22439C3 for ; Wed, 6 Jan 2016 12:56:21 +0000 (UTC) Received: from svn01-us-west.apache.org (svn.apache.org [10.41.0.6]) by mailrelay1-us-west.apache.org (ASF Mail Server at mailrelay1-us-west.apache.org) with ESMTP id 830FAE015D for ; Wed, 6 Jan 2016 12:56:20 +0000 (UTC) Received: from svn01-us-west.apache.org (localhost [127.0.0.1]) by svn01-us-west.apache.org (ASF Mail Server at svn01-us-west.apache.org) with ESMTP id 822913A00A4 for ; Wed, 6 Jan 2016 12:56:20 +0000 (UTC) Content-Type: text/plain; charset="utf-8" MIME-Version: 1.0 Content-Transfer-Encoding: 7bit Subject: svn commit: r1723308 - /comdev/projects.apache.org/scripts/cronjobs/parsecommitteeinfo.py Date: Wed, 06 Jan 2016 12:56:20 -0000 To: commits@community.apache.org From: sebb@apache.org X-Mailer: svnmailer-1.0.9 Message-Id: <20160106125620.822913A00A4@svn01-us-west.apache.org> Author: sebb Date: Wed Jan 6 12:56:20 2016 New Revision: 1723308 URL: http://svn.apache.org/viewvc?rev=1723308&view=rev Log: Detect syntax error in rdf:about for PMC RDFs Modified: comdev/projects.apache.org/scripts/cronjobs/parsecommitteeinfo.py Modified: comdev/projects.apache.org/scripts/cronjobs/parsecommitteeinfo.py URL: http://svn.apache.org/viewvc/comdev/projects.apache.org/scripts/cronjobs/parsecommitteeinfo.py?rev=1723308&r1=1723307&r2=1723308&view=diff ============================================================================== --- comdev/projects.apache.org/scripts/cronjobs/parsecommitteeinfo.py (original) +++ comdev/projects.apache.org/scripts/cronjobs/parsecommitteeinfo.py Wed Jan 6 12:56:20 2016 @@ -84,7 +84,7 @@ pmcs = {} pmcDataUrls = {} # id -> url # get PMC Data from /data/committees.xml -print("reading PMC Data (/data/committees.xml)") +print("Reading PMC Data (/data/committees.xml)") with open("../../data/committees.xml", "r") as f: xmldoc = minidom.parseString(f.read()) f.close() @@ -101,6 +101,8 @@ for loc in xmldoc.getElementsByTagName(' rdfxml = ET.fromstring(rdf) rdfdata = rdfxml[0] committeeId = rdfdata.attrib['{http://www.w3.org/1999/02/22-rdf-syntax-ns#}about'] + if re.match("https?:", committeeId): + print("ERROR: unexpected rdf:about value '%s' in '%s'" % (committeeId, url), file=sys.stderr) pmcDataUrls[committeeId] = url # transform PMC data RDF to json