Soft 404's Webmaster Tools

Get help with installation and running phpBB 3.0.x here. Please do not post bug reports, feature requests, or MOD-related questions here.
Scam Warning
Forum rules
END OF SUPPORT: 1 January 2017 (announcement)
sk8rgui
Registered User
Posts: 32
Joined: Tue Nov 09, 2010 6:06 pm

Soft 404's Webmaster Tools

Post by sk8rgui » Fri Apr 13, 2012 9:22 pm

I am trying to track down and find out what is causing soft 404 errors to appear in my Google Webmaster tools. Here is an example of one of the URLs throwing this error http://www.dadsdivorce.com/father_divor ... p?p=231973

Any ideas on how to fix this? Also, I have been searching and searching for a list of url query string parameters. What does the ?p= relate to? Does the p= denote a link to a particular post in the thread? Is there a list of what each parameter type is for?

Thanks for the help.

User avatar
stevemaury
Support Team Member
Support Team Member
Posts: 50561
Joined: Thu Nov 02, 2006 12:21 am
Location: The U.P.
Name: Steve
Contact:

Re: Soft 404's Webmaster Tools

Post by stevemaury » Fri Apr 13, 2012 10:53 pm

Works fine for me.
For REALLY good and VERY inexpensive hosting CLICK HERE

I can stop all your spam. I can upgrade or update your Board. PM or email me. (Paid support)

sk8rgui
Registered User
Posts: 32
Joined: Tue Nov 09, 2010 6:06 pm

Re: Soft 404's Webmaster Tools

Post by sk8rgui » Fri Apr 13, 2012 11:25 pm

stevemaury wrote:Works fine for me.
Yes the page loads fine, but why is it showing a soft 404 in webmaster tools?

Also, do you have any answer to my other question. What does the p= in the url stand for? Is there a list of what different variables mean? I know an f= in the url means forum = id#, but not sure about a lot of the others.

User avatar
stevemaury
Support Team Member
Support Team Member
Posts: 50561
Joined: Thu Nov 02, 2006 12:21 am
Location: The U.P.
Name: Steve
Contact:

Re: Soft 404's Webmaster Tools

Post by stevemaury » Fri Apr 13, 2012 11:44 pm

"p" is the post_id. Your other issue you need to ask wherever support for webmaster tools is.
For REALLY good and VERY inexpensive hosting CLICK HERE

I can stop all your spam. I can upgrade or update your Board. PM or email me. (Paid support)

CaNNon_
Registered User
Posts: 392
Joined: Wed Apr 29, 2009 2:07 am

Re: Soft 404's Webmaster Tools

Post by CaNNon_ » Sat Apr 14, 2012 12:23 am

Try the url with "Fetch as Googlebot tool", might give up a clue.
You'll find soft 404 mentioned in the help in the tools page not much info but worth a read. ;)

Pony99CA
Registered User
Posts: 4783
Joined: Thu Sep 30, 2004 3:13 pm
Location: Hollister, CA
Name: Steve
Contact:

Re: Soft 404's Webmaster Tools

Post by Pony99CA » Sat Apr 14, 2012 12:53 am

sk8rgui wrote:Also, I have been searching and searching for a list of url query string parameters. What does the ?p= relate to? Does the p= denote a link to a particular post in the thread? Is there a list of what each parameter type is for?
To elaborate on what Steve said, the number in p=[number] is the post_id (from the phpbb_posts table). It's often followed by #p[number] (at the end of the URL) to force the browser to jump to that post on the page.

That number is independent of the topic (not "thread"). In other words, if you have you have a post_id of X in multiple topics, they'll be the exact same post. (I'm not sure if that can happen, but with copying or moving and leaving a shadow, it might.)

Steve
Silicon Valley Pocket PC (http://www.svpocketpc.com)
Creator of manage_bots and spoof_user (ask me)
Need hosting for a small forum with full cPanel & MySQL access? Contact me or PM me.

sk8rgui
Registered User
Posts: 32
Joined: Tue Nov 09, 2010 6:06 pm

Re: Soft 404's Webmaster Tools

Post by sk8rgui » Sat Apr 14, 2012 2:44 am

Pony99CA wrote:
sk8rgui wrote:Also, I have been searching and searching for a list of url query string parameters. What does the ?p= relate to? Does the p= denote a link to a particular post in the thread? Is there a list of what each parameter type is for?
To elaborate on what Steve said, the number in p=[number] is the post_id (from the phpbb_posts table). It's often followed by #p[number] (at the end of the URL) to force the browser to jump to that post on the page.

That number is independent of the topic (not "thread"). In other words, if you have you have a post_id of X in multiple topics, they'll be the exact same post. (I'm not sure if that can happen, but with copying or moving and leaving a shadow, it might.)

Steve
So, would it be a bad idea to noindex, nofollow pages that contain a url with "p=" in it? It seems like it could raise a duplicate content issue with search engines, but I'm not 100% sure.

User avatar
Oyabun1
Former Team Member
Posts: 23162
Joined: Sun May 17, 2009 1:05 pm
Location: Australia
Name: Bill

Re: Soft 404's Webmaster Tools

Post by Oyabun1 » Sat Apr 14, 2012 3:12 am

Since the post_id for each post is unique if you have 100 links to 100 different topics they would all be unique with no duplication.

All the major search engines seem to have no problem correctly indexing phpBB.

I would say you would be better to spend your time on creating unique content for your site rather than worrying about tweaks that are of questionable SEO value and far less benefit for members.
                      Support Request Template
3.0.x: Knowledge Base Styles Support MOD Requests
3.1.x: Knowledge BaseStyles SupportExtension Requests

sherya6
Registered User
Posts: 21
Joined: Sat May 21, 2011 8:11 pm
Contact:

Re: Soft 404's Webmaster Tools

Post by sherya6 » Mon Jul 02, 2012 11:22 am

I too have got the same message from GWT. So somebody please tell me what all parameters can be set to be ignored by Google. I know of two: p and sid.

User avatar
Oyabun1
Former Team Member
Posts: 23162
Joined: Sun May 17, 2009 1:05 pm
Location: Australia
Name: Bill

Re: Soft 404's Webmaster Tools

Post by Oyabun1 » Mon Jul 02, 2012 12:25 pm

As previously stated, for what parameters to set in a third party tool you need to ask on a support site for that tool.
                      Support Request Template
3.0.x: Knowledge Base Styles Support MOD Requests
3.1.x: Knowledge BaseStyles SupportExtension Requests

User avatar
AmigoJack
Registered User
Posts: 5599
Joined: Tue Jun 15, 2010 11:33 am
Location: グリーン ヒル ゾーン
Contact:

Re: Soft 404's Webmaster Tools

Post by AmigoJack » Mon Jul 02, 2012 12:29 pm

sk8rgui's link gives a HTTP status 200 when visiting with a Googlebot/ user agent, saying that authorization is missing. In other words: the board group Bots has no view permissions - this is what Google might classify as "soft 404".

Since I see this as a bug (a HTTP status 403 should be issued) I created ticket 10961.
The worst thing about censorship is ███████████
Affin wrote:
Tue Nov 20, 2018 9:51 am
The problem is probably not my English but you do not want to understand correctly.
...
We will not come anybody anyway, nevertheless, it's best to shit this.

CaNNon_
Registered User
Posts: 392
Joined: Wed Apr 29, 2009 2:07 am

Re: Soft 404's Webmaster Tools

Post by CaNNon_ » Mon Jul 02, 2012 2:18 pm

Nice catch AmigoJack

User avatar
SidV
Registered User
Posts: 16
Joined: Sun Aug 01, 2004 12:13 am

Re: Soft 404's Webmaster Tools

Post by SidV » Thu Aug 01, 2013 12:40 am

AmigoJack wrote:sk8rgui's link gives a HTTP status 200 when visiting with a Googlebot/ user agent, saying that authorization is missing. In other words: the board group Bots has no view permissions - this is what Google might classify as "soft 404".

Since I see this as a bug (a HTTP status 403 should be issued) I created ticket 10961.
Hello AmigoJack, I don't think this is the case.
Google don't put 404 because has no view permissions here.
Google puts it because see that header code as server response.

Take this example in this forum:
If you (or someone else) go to:
viewtopic.php?f=26&t=2192160
Will see a real page (don't 404) that said:
The requested topic does not exist.
But; if Googlebot/ goes to the same URL, will see 404 as server response.
Why? Well, I don't know. If you don't believe me, you could see it as yourself (use internet tools to see servers response). I attach my screenshoot:
CS1.jpg
How GoogleBot see a deleted topic
CS1.jpg (50.93 KiB) Viewed 2214 times
Well, about codes.
I found an interesting thread in other forum that said (that) the problem it's in the viewforum.php file:

Code: Select all

// Make sure $start is set to the last page if it exceeds the amount
if ($start < 0 || $start > $topics_count)
{
   $start = ($start < 0) ? 0 : floor(($topics_count - 1) / $config['topics_per_page']) * $config['topics_per_page'];
}
And the solution will be replace it with:

Code: Select all

// Make sure $start is set to the last page if it exceeds the amount
if ($start < 0 || $start > $topics_count)
{
    header("HTTP/1.0 404 Not Found");
    trigger_error('The page does not exist');
}
What do you think?
Can you test it?

Source; not spam:

Code: Select all

http://www.phpbb-seo.com/en/phpbb-seo-mods/article8723.html#p41361
Regards,
Sid

User avatar
Oyabun1
Former Team Member
Posts: 23162
Joined: Sun May 17, 2009 1:05 pm
Location: Australia
Name: Bill

Re: Soft 404's Webmaster Tools

Post by Oyabun1 » Thu Aug 01, 2013 2:29 am

SidV wrote:Hello AmigoJack, I don't think this is the case.
Google don't put 404 because has no view permissions here.
Google puts it because see that header code as server response.

Take this example in this forum:
If you (or someone else) go to:
viewtopic.php?f=26&t=2192160
Your example is a different situation. If you or a bot goes to a page that doesn't exist the board correctly returns a 404 Not Found status. The user agent could correctly connect to the server, but the server could not find the requested item, so 404 status is returned, that is as it should be.

In the OP's case the topic existed, but the Bots group did not have permission to see it and phpBB is, incorrectly, returning a 200 OK status, rather than a 403 Forbidden status.
                      Support Request Template
3.0.x: Knowledge Base Styles Support MOD Requests
3.1.x: Knowledge BaseStyles SupportExtension Requests

User avatar
SidV
Registered User
Posts: 16
Joined: Sun Aug 01, 2004 12:13 am

Re: Soft 404's Webmaster Tools

Post by SidV » Thu Aug 01, 2013 4:04 am

Thanks Oyabun1 for your quick replay.
Now I could understand better the 404 response case.

But what about the solution idea posted?
If phpbb trigger_error('The page does not exist');
will google fetch anyway ?

What do you think?

Locked

Return to “[3.0.x] Support Forum”