As we already know, Google tends to call "duplicate content" those blocks with content of considerable size that completely coincide, or in any case, are very similar to others that are in the same domain or in any other website. For the most part, it is not malicious, including those that we will indicate below:
1 - In those Forums that serve for debate that generate both standard or simplified pages for mobile devices.
2 - Those stored elements that are displayed or that appear linked by URLs that are different.
3 - Different versions to print of the web pages.
It should be noted that if your website has multiple pages with largely identical content, different methods exist and are available to tell Google your preferred URL (this practice is called "canonicalization"). You can click here to get more information about it.
Although we have already clarified one of the main points that Google takes into account for its search engine, we must also highlight that in some cases the content is deliberately duplicated in several domains as an attempt to manipulate the positioning of a website in different search engines or to achieve increased traffic. Google recognizes this type of deceptive practices and knows that they harm the experience of users, since they will see the same content repeated in different search results causing the quality of the content to be not good. It is therefore that it is tried that the pages that sample and index have different content.
Through this filtering, for example, if your site has a version marked as "normal" and for printing of each article, and none of these versions is blocked with a noindex meta tag, the engines will choose one of them to include it in their index. For those cases in which the engines detect that duplicate content is being displayed to manipulate a positioning in the search terms and deceive users, the appropriate adjustments will also be applied to the indexing and positioning of the sites that are involved. This practice will result in a clear impact on positioning. It is also very likely that a site will be removed from the Google index so that it no longer appears among the different search results.
Anyway, nothing is lost as it is known that there are some steps the creator can take to address duplicate content issues in advance and ensure that users who visit the website can see the content that the webmaster wants to display.
1 - First we must Use 301 redirects: if your site has been restructured, you must use the 301 redirects ("RedirectPermanent") in the .htaccess file that will be very useful to intelligently redirect both users and Googlebot and other spiders .
2 - Google also tells us to "be consistent": you must ensure that internal links "are consistent". To serve as an example, you should not link to http://www.example.com/pagina/, http://www.example.com/pagina and http://www.example.com/pagina/index .htm.
3 - Use top-level domains: it is extremely important, whenever possible, to use this type of domains to manage content directed to specific countries and thus you will be helping search engines to show the version of the documents most appropriate for each case. For example, Google is more likely to know that http://www.example.de contains content focused on Germany than using http://www.example.com/de or http://de.example.com.
4 - It is important to distribute the content with extreme caution: if the creator is distributing their content on other sites, the engines will always show the version that they consider most appropriate for users in each specific search. We must know that it is useful to ensure that each site that distributes your content (usually Social Networks) includes a link that points to the original entry. It would also be good to advise those who use the distributed material to use the noindex meta tag to prevent search engines from indexing their version of that content.
5 - Reduce the repetition of templates: this is good to use in those cases that we are going to include a long text on copyright at the bottom of each page, it would be more strategic to put a brief summary and a link to a page that contains "more information". Also, you can use a parameter handling tool to specify how you want Google to handle the URL parameters.