From dev-return-67167-archive-asf-public=cust-asf.ponee.io@openoffice.apache.org Tue May 12 10:15:41 2020 Return-Path: X-Original-To: archive-asf-public@cust-asf.ponee.io Delivered-To: archive-asf-public@cust-asf.ponee.io Received: from mail.apache.org (hermes.apache.org [207.244.88.153]) by mx-eu-01.ponee.io (Postfix) with SMTP id 1099A180634 for ; Tue, 12 May 2020 12:15:40 +0200 (CEST) Received: (qmail 66135 invoked by uid 500); 12 May 2020 10:15:40 -0000 Mailing-List: contact dev-help@openoffice.apache.org; run by ezmlm Precedence: bulk List-Help: List-Unsubscribe: List-Post: List-Id: Reply-To: dev@openoffice.apache.org Delivered-To: mailing list dev@openoffice.apache.org Delivered-To: moderator for dev@openoffice.apache.org Received: (qmail 41701 invoked by uid 99); 12 May 2020 09:57:37 -0000 X-Virus-Scanned: Debian amavisd-new at spamd2-us-west.apache.org X-Spam-Flag: NO X-Spam-Score: -15.502 X-Spam-Level: X-Spam-Status: No, score=-15.502 tagged_above=-999 required=6.31 tests=[DKIMWL_WL_MED=-0.001, DKIM_SIGNED=0.1, DKIM_VALID=-0.1, DKIM_VALID_AU=-0.1, DKIM_VALID_EF=-0.1, ENV_AND_HDR_SPF_MATCH=-0.5, HTML_MESSAGE=0.2, RCVD_IN_DNSWL_NONE=-0.0001, RCVD_IN_MSPIKE_H2=-0.001, SPF_HELO_NONE=0.001, SPF_PASS=-0.001, USER_IN_DEF_DKIM_WL=-7.5, USER_IN_DEF_SPF_WL=-7.5] autolearn=disabled Authentication-Results: spamd2-us-west.apache.org (amavisd-new); dkim=pass (2048-bit key) header.d=google.com Received-SPF: Pass (mailfrom) identity=mailfrom; client-ip=209.85.166.46; helo=mail-io1-f46.google.com; envelope-from=johnmu@google.com; receiver= X-ASF-DKIM-Sig: v=1; a=rsa-sha256; c=relaxed/relaxed; d=google.com; s=20161025; h=mime-version:references:in-reply-to:from:date:message-id:subject:to :cc; bh=bl3JhUz0x6bHjNTii6o3m/RIoLLFYyytOh/N52PNXPg=; b=LW3dXt9TtBI36PNczuUn0Xmn0V3FKVX7MDlLeJJPxROBZYvOT32pICDSi9F5voWJAQ 42RFLqdDY15haWCo1oqwlv8X4au4gqa4xpGcwfxw9aKE8v+luOHlAuxNy3s0DR7x9wJI l+qVGwJW/FVuIWwwInOUyYcHEa1NAiiPoKG0YCpcE6Mw6Sr9rsDS+x5B5EOuk9Vsr5Zf BbHSDcqjbE3yHSr5mqhFOCCGWo1MOt1/ApXYnOhcHOo3MBIiblL25xgvx9nHow5Fn8ZZ fmyBHT35pf5dBSDqd1AOO94h1eYwHPLPvabbdyoP4av51/49prJgiH+my3kVMcyVTR6b JA5Q== X-Google-DKIM-Signature: v=1; a=rsa-sha256; c=relaxed/relaxed; d=1e100.net; s=20161025; h=x-gm-message-state:mime-version:references:in-reply-to:from:date :message-id:subject:to:cc; bh=bl3JhUz0x6bHjNTii6o3m/RIoLLFYyytOh/N52PNXPg=; b=q7srYhStE2ucCOROTJ9gDoIgD7OA5AOlBYN94FK8rlmU1V60iQ+Ik4rWlQojYh51Xg kyFo6vIRFm7UzEGvNp7HTcUKlAlSjoEyOL1/ajjD7k4GHqsFlOqBeYHdJwC0P0LIWRln 9DB+4Cken96QGv3SsB0B3d8vpMsA3WM3wsQZxBGQ3bPE/+XbpZ/uCvlviSQQAsmUiEQF UR/9A/VJMbi3QYwKuDglPAuag2NAKUr+Ew7Sc+/eTHP/dqPcMtQbQdVK6mjgRb8n6X4p ivHy6KyoQsX0m6DJeqiSxj10dzFOxju4qJV42OJHHQ2HQdG4aIPsyH1gmzlKHVUGneUx sQCQ== X-Gm-Message-State: AGi0PuZSVGRFh0OJHy1EqusYSD3hcbFIOgcVYMaNVbsRrzAm2WniP2wX xzzrzh6M7g7cENAi0UeSMqgLdHzhJVyMAfD8WfCewA== X-Google-Smtp-Source: APiQypKUUpx5+MgTt9Bu5R0wMEOemkMl3Ye6Dnyh8smzFcCJDeOrq2LE8Z7cWaiC6uR9mWnZGxK6SqbQIqXX/MwE1V8= X-Received: by 2002:a5d:8516:: with SMTP id q22mr18849547ion.122.1589277453581; Tue, 12 May 2020 02:57:33 -0700 (PDT) MIME-Version: 1.0 References: In-Reply-To: From: John Mueller Date: Tue, 12 May 2020 11:56:55 +0200 Message-ID: Subject: Re: Critical issue on forum.openoffice.org and Google Search To: Peter Kovacs Cc: "dev@openoffice.apache.org" Content-Type: multipart/alternative; boundary="0000000000001277d505a5707e45" --0000000000001277d505a5707e45 Content-Type: text/plain; charset="UTF-8" Hi Peter It looks like Google's infrastructure for crawling the web can't access any URLs at all from forum.openoffice.org, including the homepage. Sometimes this is due to a firewall or abuse protection system recognizing these requests as malicious. Over time, as we attempt to update the pages in the search results by crawling URLs from the site, if we see that we can't access them at all, they generally get removed from our search results, In practice, this means that users won't be able to find your pages in Google Search. Sometimes websites do that on purpose, if they don't want to be found in search, I suspect it's more of an accident here. A simple way to test is to use https://search.google.com/test/mobile-friendly to check URLs from your site (better would be to use https://support.google.com/webmasters/answer/9012289 , though that would require verification of the site in Google Search Console first). Hope this helps! John On Tue, May 12, 2020 at 10:33 AM Peter Kovacs wrote: > Hello Mr Mueller, > > > The forum.openoffice.org is our support Forum. When people have issues > they are often directed to this page for solutions. > > Do you have a list of URLs googlebot has not able to crawl? We can then > check if the behavior is intended or not and we can tell you the reason for > this measurement. > > I am not particular skilled in google search engine. I do not understand > the sentence: > > This will cause those pages to drop out of Google's search results, and > will prevent new pages from being picked up for Search. > > Can you explain this in an example please? > > > Thanks for the support. > > All the best > > Peter > > > Am 11.05.20 um 13:37 schrieb John Mueller: > > Dear webmaster of forum.openoffice.org > > I'm an analyst at Google in Switzerland. We wanted to bring your attention > to a critical issue with your website, and how it's available for Google's > web search. > > In particular, Googlebot has been unable to crawl URLs from > https://forum.openoffice.org/ . This will cause those pages to drop out > of Google's search results, and will prevent new pages from being picked up > for Search. If you're not aware of this issue, you may be accidentally > blocking these pages from Google Search due to a server issue. If you need > to block Googlebot from crawling pages on your website, we'd recommend > using the robots.txt file instead. > > Should you need to recognize IP addresses of Googlebot requests, you can > use a reverse IP lookup to do so: > https://support.google.com/webmasters/answer/80553 > > Should you have any questions, feel free to contact me directly. For > verification purposes, we are sending a copy of this message to your site's > Search Console account. > > Thank you, > John Mueller (johnmu@google.com) > Webmaster Trends Analyst > > > > > -- > > John Mueller, He/Him, Search Relations Team - go/search-rel > > WTA is now Search-Rel (info > ) > > *Time-critical? Resend with "URGENT" in the subject.* > > Google Switzerland GmbH > Gustav-Gull-Platz 1, 3. Stock > 8004 Zurich, Switzerland > > Identifikationsnummer: > CH-020.4.028.116-1 > > -- John Mueller, He/Him, Search Relations Team - go/search-rel WTA is now Search-Rel (info ) *Time-critical? Resend with "URGENT" in the subject.* Google Switzerland GmbH Gustav-Gull-Platz 1, 3. Stock 8004 Zurich, Switzerland Identifikationsnummer: CH-020.4.028.116-1 --0000000000001277d505a5707e45--