Return-Path: X-Original-To: apmail-jmeter-user-archive@www.apache.org Delivered-To: apmail-jmeter-user-archive@www.apache.org Received: from mail.apache.org (hermes.apache.org [140.211.11.3]) by minotaur.apache.org (Postfix) with SMTP id 050BA10138 for ; Wed, 4 Sep 2013 16:20:31 +0000 (UTC) Received: (qmail 53959 invoked by uid 500); 4 Sep 2013 16:20:30 -0000 Delivered-To: apmail-jmeter-user-archive@jmeter.apache.org Received: (qmail 53858 invoked by uid 500); 4 Sep 2013 16:20:25 -0000 Mailing-List: contact user-help@jmeter.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: "JMeter Users List" Delivered-To: mailing list user@jmeter.apache.org Received: (qmail 53848 invoked by uid 99); 4 Sep 2013 16:20:23 -0000 Received: from nike.apache.org (HELO nike.apache.org) (192.87.106.230) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Sep 2013 16:20:23 +0000 X-ASF-Spam-Status: No, hits=1.5 required=5.0 tests=HTML_MESSAGE,RCVD_IN_DNSWL_LOW,SPF_PASS X-Spam-Check-By: apache.org Received-SPF: pass (nike.apache.org: domain of jordicarretero@gmail.com designates 209.85.217.173 as permitted sender) Received: from [209.85.217.173] (HELO mail-lb0-f173.google.com) (209.85.217.173) by apache.org (qpsmtpd/0.29) with ESMTP; Wed, 04 Sep 2013 16:20:17 +0000 Received: by mail-lb0-f173.google.com with SMTP id o14so615564lbi.18 for ; Wed, 04 Sep 2013 09:19:57 -0700 (PDT) DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=gmail.com; s=20120113; h=mime-version:in-reply-to:references:date:message-id:subject:from:to :content-type; bh=aBvrnS6AC6mPvsYKIlqVOo+GfEjma0s0+dBJHcaHcQE=; b=jC+yf7ECZ0WZhNoSTnIn3Kx/cTgZP5vLVMsjd+IRsRKWdt5Q0h3TPyun098lLk0quD fzEegKM+zR/fU7Ft0veRkoWamPix5uI2/NBAMMt4J5+fnzjomcbTXKLkEYH+b4CPyzOk mTlDDBomKwG8cx5aIm818QZ152ZW7lOp5k1wST2EHYuYvkyGQSrObKUrdA3wnUoCiXb5 Nrs4yYdkilOt7wUiug7GHMMGQXUwdK3Psdm21MOIR/4zk0vzcoVfVYxyh6zWihfXdEvU zKiqyL5m/2dRfSiWQyjoBzAJGo1XpI6HFJmjzgel0m9QbJAQD6Ub63TdTggVtcxCg5bC 6exA== MIME-Version: 1.0 X-Received: by 10.112.128.166 with SMTP id np6mr3129925lbb.7.1378311596985; Wed, 04 Sep 2013 09:19:56 -0700 (PDT) Received: by 10.112.161.100 with HTTP; Wed, 4 Sep 2013 09:19:56 -0700 (PDT) In-Reply-To: References: Date: Wed, 4 Sep 2013 18:19:56 +0200 Message-ID: Subject: Re: Regular expression extractor for spider From: Jordi Carretero To: JMeter Users List Content-Type: multipart/alternative; boundary=047d7b343ef220d13e04e5912dc1 X-Virus-Checked: Checked by ClamAV on apache.org --047d7b343ef220d13e04e5912dc1 Content-Type: text/plain; charset=ISO-8859-1 Thanks Sebb, That vas very ilustrative for me and helped to find the solution: wrote: > On 3 September 2013 19:08, Jordi Carretero > wrote: > > Hi > > > > I'm building a spider using a regular expression extractor and a > for-each- > > controller and works pretty well but.. > > > > I'm using well > > to extract links like: > > > > > > > but I can not find any expression that will work at the same time for > > expressions found in some sites like: > > > > > > > that include the full domain at the beginning (and has to be removed) > > > > It's a matter of working with the perl expression but after some days I > > could not manage to make it work, so any help will be appreciated > > If you want to ignore an optional string, use something like: > > (?:http://www\.mysite\.es)? > > The form (abc)? means abc or nothing; the (?:) form means don't save > the contents. > > In your case, if you want to ignore both ".", ".." and > "http:/www.mysite.es" you could use: > > (?:http://www\.mysite\.es|\.\.?)? > > BTW, rather than use "[.]" to escape the meta-character ".", the usual > method is "\.". > > > Thanks > > --------------------------------------------------------------------- > To unsubscribe, e-mail: user-unsubscribe@jmeter.apache.org > For additional commands, e-mail: user-help@jmeter.apache.org > > --047d7b343ef220d13e04e5912dc1--