Moz Q&A is closed.
After more than 13 years, and tens of thousands of questions, Moz Q&A closed on 12th December 2024. Whilst we’re not completely removing the content - many posts will still be possible to view - we have locked both new posts and new replies. More details here.
Help with facet URLs in Magento
-
Hi Guys,
Wondering if I can get some technical help here...
We have our site britishbraces.co.uk , built in Magento. As per eCommerce sites, we have paginated pages throughout.
These have rel=next/prev implemented but not correctly ( as it is not in is it in ) - this fix is in process.
Our canonicals are currently incorrect as far as I believe, as even when content is filtered, the canonical takes you back to the first page URL. For example,
http://www.britishbraces.co.uk/braces/x-style.html?ajaxcatalog=true&brand=380&max=51.19&min=31.19
Canonical to...
http://www.britishbraces.co.uk/braces/x-style.html
Which I understand to be incorrect.
As I want the coloured filtered pages to be indexed ( due to search volume for colour related queries ), but I don't want the price filtered pages to be indexed - I am unsure how to implement the solution?
As I understand, because rel=next/prev implemented ( with no View All page ), the rel=canonical is not necessary as Google understands page 1 is the first page in the series.
Therefore, once a user has filtered by colour, there should then be a canonical pointing to the coloured filter URL? ( e.g. /product/black )
But when a user filters by price, there should be noindex on those URLs ? Or can this be blocked in robots.txt prior?
My head is a little confused here and I know we have an issue because our amount of indexed pages is increasing day by day but to no solution of the facet urls.
Can anybody help - apologies in advance if I have confused the matter.
Thanks
-
Hi Lewis,
Firstly thank you for taking your time to respond in depth to my question.
Since reading your response, I have done the following...
Identified the parameters that should NOT be indexed, these are; 'brand=', 'min=' and 'max='
The colour filter 'colour=' is to be kept indexed. I have reviewed the website and found that users cannot currently select to filter more than on colour, which eliminates Google from indexing multiple colour filters in one URL.
However, users can still filter by colour and brand, hence why I have requested ours devs to meta noindex any URL that contains the 'brand=' parameter as well as any URLs that have the 'min/max=' parameters as these are price filters.
I have also requested rel=next/prev to be implemented correctly.
The above should drastically reduce our indexed content.
As well as this, I have added the following parameters into Search Consoles' URL Parameter tool as 'No Crawl', 'brand, min, max' - although I understand this is not a guaranteed fix, it was my first option with no immediate dev time over the weekend.
Now the only URLs in need of a canonical is the colour filtered URLs as 'brand, min max' are all noindex. I have asked dev to ensure the canonical points back to page 1 for now, however I am looking into a view-all page option so the canonical would point to that.
A good learning curve all of this!
-
There is a big difference between robots.txt and no index
"Therefore, once a user has filtered by colour, there should then be a canonical pointing to the coloured filter URL? ( e.g. /product/black )
But when a user filters by price, there should be noindex on those URLs Or can this be blocked in robots.txt prior?"
See http://i.imgur.com/114BHcR.png
You need to use a no index tag not robots.txt ideally with a secular canonical pointing to the product.
Please see references one and two below. There are larger versions of the photos below as well
You need to run your site through deep crawl and or screaming frog SEO spider If you would be kind enough to give me the URL privately or publicly I will run a deep crawl and SEO spider
** This topic is difficult to explain without using the ability to show videos and images inside the box while describing this. That's why I recommend you view this YouTube video and slide share.**
Deep crawl is fantastic at solving these issues it has done this for other magenta clients of mine, and I strongly recommend utilizing what you've learned from that webinar and the other references below.
please see one and two below
- https://www.deepcrawl.com/knowledge/webinars/masterclass-webinar-faceted-navigation-for-seo/
- https://www.stonetemple.com/seo-tags-virtual-keynote-with-gary-illyes-and-eric-enge/
-
https://webmasters.googleblog.com/2014/02/faceted-navigation-best-and-5-of-worst.html
-
https://moz.rainyclouds.online/blog/building-faceted-navigation-that-doesnt-suck
-
http://searchengineland.com/google-offers-advice-faceted-navigation-infinite-scroll-web-pages-184232
larger versions of the images
I agree with Lewis's recommendation for an extension and have added a couple more.
- http://www.mageworx.com/magento-2-seo-extension.html
- https://ecommerce.aheadworks.com/magento-extensions/ultimate-seo-suite.html
- https://ecommerce.aheadworks.com/magento-2-extensions/layered-navigation
I Hope this helps,
Thomas
78tExl8.png nMrYeUWlslY xJeFTbY.jpg wOHxaEE.jpg QprPUyk.jpg 114BHcR.png
-
Hi!
We do a lot of consultancy for Magento projects and this is a question that comes up quite regularly as it can't really be handled perfectly straight out of the box with Magento.
Every implementation is a little bit different, but I'll put together some recommendations below based on the information available at the moment.
For your faceted navigation, you ideally don't want to index any of these pages, unless you believe that you'll rank in your own right for specific filters (e.g. Colour, like you pointed out in your last message).
That then comes with some additional complications. In Magento, if you have 3 colours available in the faceted nav, you'll have all the different variations indexed in each combination.
For example:
Blue
Black
RedBlue + Black
Blue + Red
Black + Red
Black + Blue
Red + Blue
Red + BlackMagento as standard doesn't always keep the filters in the same order, so you can end up with literally thousands of pages ending up in the index for a relatively small number of attributes being shown on your pages.
There are a few recommendations here:
- Go and look at the MageWorx Ultimate SEO Suite Plugin - http://www.mageworx.com/seo-suite-ultimate-magento-extension.html - For $249, it solves a lot of issues Magneto has straight out of the box and gives you ultimate control over your meta titles.
What you want to do is set all of your facets to 'NOINDEX,FOLLOW' where possible. This will reduce the number of URLs in the index gradually. An example of this would be adding ?min=* and mode=* etc (grid/list variants).
- For your canonicals, you're probably best setting the canonical to the current filtered page (for example, if you're on a category page with colour = blue selected in your faceted nav, you'd have this URL as your canonical). Some sites we work on have it setup so the canonical points to the category URL (like you currently have).
Finally, you probably want to build an extension to allow you to inject content into the filtered content pages. If you're using an extension like ManaDev for your facet navigation, this can be achieved fairly easily and allows you to add a block of text to each filter applied on a page.
You should also look to request each of the incorrectly indexed URLs is removed from the index (although this does take a long time if you have a lot!).
We wrote a really long guide around launching a Magento website last month which may be of interest - https://www.pinpointdesigns.co.uk/the-definitive-guide-to-launching-a-magento-website/. We've also done a guide on Common Magento SEO Issues here - https://www.pinpointdesigns.co.uk/common-magento-seo-issues/ and I previously wrote a guide on setting Magento up for Search Engines on Moz - https://moz.rainyclouds.online/ugc/setting-up-magento-for-the-search-engines (Although this is likely to be a little outdated now)
I hope this helps!
Lewis
Got a burning SEO question?
Subscribe to Moz Pro to gain full access to Q&A, answer questions, and ask your own.
Browse Questions
Explore more categories
-
Moz Tools
Chat with the community about the Moz tools.
-
SEO Tactics
Discuss the SEO process with fellow marketers
-
Community
Discuss industry events, jobs, and news!
-
Digital Marketing
Chat about tactics outside of SEO
-
Research & Trends
Dive into research and trends in the search industry.
-
Support
Connect on product support and feature requests.
Related Questions
-
My url disappeared from Google but Search Console shows indexed. This url has been indexed for more than a year. Please help!
Super weird problem that I can't solve for last 5 hours. One of my urls: https://www.dcacar.com/lax-car-service.html Has been indexed for more than a year and also has an AMP version, few hours ago I realized that it had disappeared from serps. We were ranking on page 1 for several key terms. When I perform a search "site:dcacar.com " the url is no where to be found on all 5 pages. But when I check my Google Console it shows as indexed I requested to index again but nothing changed. All other 50 or so urls are not effected at all, this is the only url that has gone missing can someone solve this mystery for me please. Thanks a lot in advance.
Intermediate & Advanced SEO | | Davit19850 -
Will disallowing URL's in the robots.txt file stop those URL's being indexed by Google
I found a lot of duplicate title tags showing in Google Webmaster Tools. When I visited the URL's that these duplicates belonged to, I found that they were just images from a gallery that we didn't particularly want Google to index. There is no benefit to the end user in these image pages being indexed in Google. Our developer has told us that these urls are created by a module and are not "real" pages in the CMS. They would like to add the following to our robots.txt file Disallow: /catalog/product/gallery/ QUESTION: If the these pages are already indexed by Google, will this adjustment to the robots.txt file help to remove the pages from the index? We don't want these pages to be found.
Intermediate & Advanced SEO | | andyheath0 -
URL Injection Hack - What to do with spammy URLs that keep appearing in Google's index?
A website was hacked (URL injection) but the malicious code has been cleaned up and removed from all pages. However, whenever we run a site:domain.com in Google, we keep finding more spammy URLs from the hack. They all lead to a 404 error page since the hack was cleaned up in the code. We have been using the Google WMT Remove URLs tool to have these spammy URLs removed from Google's index but new URLs keep appearing every day. We looked at the cache dates on these URLs and they are vary in dates but none are recent and most are from a month ago when the initial hack occurred. My question is...should we continue to check the index every day and keep submitting these URLs to be removed manually? Or since they all lead to a 404 page will Google eventually remove these spammy URLs from the index automatically? Thanks in advance Moz community for your feedback.
Intermediate & Advanced SEO | | peteboyd0 -
URL mapping for site migration
Hi all! I'm currently working on a migration for a large e-commerce site. The old one has around 2.5k urls, the new one 7.5k. I now need to sort out the redirects from one to the other. This is proving pretty tricky, as the URL structure has changed site wide. There doesn't seem to be any consistent rules either so using regex doesn't really work. By and large, the copy appears to be the same though. Does anybody know of a tool I can crawl the sites with that will export the crawled url and related copy into a spreadsheet? That way I can crawl both sites and compare the copy to match them up. Thanks!
Intermediate & Advanced SEO | | Blink-SEO0 -
If I own a .com url and also have the same url with .net, .info, .org, will I want to point them to the .com IP address?
I have a domain, for example, mydomain.com and I purchased mydomain.net, mydomain.info, and mydomain.org. Should I point the host @ to the IP where the .com is hosted in wpengine? I am not doing anything with the .org, .info, .net domains. I simply purchased them to prevent competitors from buying the domains.
Intermediate & Advanced SEO | | djlittman0 -
301 Redirection and apostrophes in URLs
Hi I am experiencing trouble getting any redirects with apostrophes in the URLs to 301 redirect in order to eliminate 404 errors. I have tried replacing the instance of the apostrophe in the source URL field to %27 and variations of this but to no avail. The site is a wordpress site (the old URLS are legacies from the old Business Catalyst site) and I am using the redirection plug in. I have gone into some detail with a helpful soul here http://wordpress.org/support/topic/how-to-deal-with-apostrophes-in-source-url but unfortunately to no result. If anyone has any idea how to solve this puzzle I would be grateful for the help. Example: http://www.tesselaars.com/blog/Inside_Flowers/post/Online_Marketing_for_Florists_Part_1%E2%80%93_A_Website_You_Won%27t_Regret/
Intermediate & Advanced SEO | | Seamoose0 -
Crazy long weird URLs... help
I have a HTML website, mysite1.com, and I placed a link on the home page to another one of my sites, mysite2.com Today I checked the links to mysite2.com in Majestic and noticed 24 links coming from the mysite1.com instead of just one link. The URLs from mysite1.com that are showing in Majestic are like this mysite1.com/?epl=4donafvFK3fMXxZXMWQRQLodmPchoXCK5C7-kbBv_agkwlkJrZAoaSDVUlhqFmUqt0f8c2Q6jF6GO6DNMnbidqRsikriF-IEBEt5okmICLEB0FxP36GrsxoPGQ3SGBo1PVR7itDUA4CYmjypn5gi mysite1.com,was inherited from a friend and I believe that it was originally built in Frontpage. Can you tell me how I can get rid of these multiple links as I only want 1 showing from the home page Thanks in advance
Intermediate & Advanced SEO | | JohnPeters0 -
Magento: URLs for Products in Multiple Categories
I am working in Magento to build out a large e-commerce site with several thousand products. It's a great platform, but I have run into the issue of what it does to URLs when you put a product into multiple categories. Basically, "a book" in two categories would make two URLs for one product: 1) /books/a-book 2) author-name/a-book So, I need to come up with a solution for this. It seems I have two options: Found this from a Magento SEO article: 'Magento gives you the ability to add the name of categories to path for product URL's. Because Magento doesn't support this functionality very well - it creates duplicate content issues - it is a very good idea to disable this. To do this, go to System => Configuration => Catalog => Search Engine Optimization and set "Use categories path for product URL's to "no".' This would solve the issues and be a quick fix, but I think it's a double edged sword, because then we lose the SEO value of our well named categories being in the URL. Use Canonical tags. To be fair, I'm not even sure this is possible. Even though it is creating different URLs and, thus, poses a risk of "duplicate content" being crawled, there really is only one page on the admin side. So, I can't go to all of the "duplicate" pages and put a canonical tag, because those duplicate pages don't really exist on the back-end. Does that make sense? After typing this out, it seems like the best thing to do probably will be to just turn off categories in the URL from the admin side. However, I'd still love any input from the community on this. Thanks!
Intermediate & Advanced SEO | | Marketing.SCG0