I am doing SEO ( Search Engine Optimization ) for this site. Since what I have now is only about an average of 100 visitors from Google Daily. And upon checking some Google stuff and just found out some unusual thing happening at my pages.
Anybody experiencing this kind of unusual string added at the end of your post Url? [ ?wpcf7=json ] at Google result . According to Google it has indexed about 57 pages out of 357 Indexed by Google is having an unusual string like this [ ?wpcf7=json ]. Below figure shows how I get the details of this. ( I just type site:techathand.net + ?wpcf7=json ) as shown in below captured image.
Actually there are no much specific information about this string but I believe there are lots of blog infected by this scenario.
Upon searching I and research I found the following information which might be of help to those who have the same problem.
What is wpcf7 ?
Upon googling the word wpcf7 most of the associated result is showing from the plug-in Contact Form 7 although it is not installed at my site at the moment it was one of the plug-in that I tested for this site. And it does not work well with me that I have decided to change it for a different plug-in. [ Contact Form 7 ]
What is json ?
According to wikipedia
JSON (JavaScript Object Notation) (pronounced /?d?e?s?n/, like Jason) is a lightweight computer data interchange format. It is a text-based, human-readable format for representing simple data structures and associative arrays (called objects). The JSON format is specified in RFC 4627 by Douglas Crockford. The official Internet media type for JSON is
application/json
.The JSON format is often used for transmitting structured data over a network connection in a process called serialization. Its main application is in Ajax web application programming, where it serves as an alternative to the traditional use of the XML format.
What about WordPress support ?
Upon searching I found that there is only one thread about this topics here. The thread does not show the cure for this thing so I am thinking that based on one of its comment that it is not harmful.
Is it harmful to SEO ?
I believe it is, since it is taken as a duplicate content . An example is https://www.techathand.net and https://www.techathand.net/?wpcf7=json both of them is cached by google and both of them show same content and we all know that it is not good to have a duplicate content from Google point of view.
Is it bad for my site ?
In an SEO point of view yes it is. But other than that I don’t have any idea. Actually I don’t have particular answer for this question. But I did a little experiment and it goes like this :
I choose two site which I know is also doing SEO in their site and I tried typing their URL + ?wpcf7=json string and find what will be the result. And the result is as follows :
I tried http://www.yugatech.com/blog/?wpcf7=json and it leads me to Yugatech Home page
I tried http://www.macuha.com/blog/?wpcf7=json and it leads me to Marhgil Home Page
I tried searching if any of this above site mentioned has an indexed page by google with the same string and I found out that there is none. I did the same exercise as above.
Conclusion
I believed that there is some leak at my theme, and this I don’t know where. I will continue to check if this indexed page by google will try to increase and after that I have to decided on my next step. I just make a “nofollow” for my feeds the other day.
What can you do to Help Me ?
If any of you has any idea about this problem or do you have any solution It would be glad to know your opinion. BTW there is not much answer in the web regarding this problem, So it is a great opportunity for you to have visitors at you site if you can give a good answer.
[…] once site. If you are a regular reader here, You know that I am having some problem with those Unusual string showing at my Google Search Engine Results and this is the reason why I studied this matter, and […]
I am not familiar about WordPress Platform and the plug-in that you’ve mentioned on your post but I guess that you can clean this malicious script by changing your theme or re-installing the same theme again.
@Fibonacci,
It is not a problem of removing the malicious script but changing the result of the serp..
Thanks! nice post!
koloms last blog post..Libro Guiness de los Récords Especial Videojuegos 2009
@kolom,
Thanks
I accidentally add on my header. When i check today my site not in top 25 for my keyword anymore.
I already remove that line. Should i request reconsideration or just let google bots check later.
[…] content – My post on removing duplicate content that was indexed by google was a […]
nice to see you back france.. Well Google webmaster just report the latest problems encountered and not the total. Diminishing values means that Google see less problem as time pass by
hello it is me
just to let you know that it is working….robots.txt has got rid of duplicate contents. I have another question anyway. Till some days ago if i looked in my google webamster tools I saw that my robots.txt excluded about 220 duplicate contents with the famous wpcf7=json string. Today i checked it again and I found out that now robots.txt has excluded just 130 duplicate contents…what about the others 90??…why is that? does it happen to you too?
france’s last blog post..Wii Tv Guide Channel
@ france,
Just continue to check the number of indexed post in Google webmaster under sitemap. Mine has a 85% indexed by Google.
i think to wait for google to crawl my not indexed posts. I have about 200 posts…it is a little hard to browse them one by one. if google will index those posts anyway…well, i will wait
thanks my friend!
@ France,
Yup it is in the process
The best thing to do in order for your other post to be indexed is identify those post that is not yet indexed and try linking to them from your new post.
The hardest part is to know which site is not yet indexed. If you have less content it will be easier but if you have lots.. It is means more work.
hi there dexter
the code i wrote on the robots.txt are working and in fact many duplicated contents with the string have disappeard. Now anyway i have seen that some posts are not indexed by google. I think that this is due to the fact that google indexed the posts with the “wpcf7json†string while the it didnt index the orginal posts. did it happen to you? I think that in the end google will index all the posts anyway. please tell me if i am right!
thanks
fp
@ France,
Yup and I made it specifically for Google Bot. Because even you write “?” in the title it will not go to the Post Slug.
Our conversation will be of great help to others. Since lot of site is infected with this.
ok, I studied your robots.txt
you were smart because you used
Disallow: /*?*
with the * before and after. in this way you got rid of the string right away!
very good dexter!
thanks for write a comment in my post! 🙂
@ France,
Thanks for asking that.. Well actually, it might work for you, But I believe not for me since my post slug contains “wpcf7json”, this post itself might be in trouble if I am going to implement it.
Most of the index site start with ” /?wpcf7=json ”
The reawson I added * at the end is sometimes I am seeing duplicated ?wpcf7=json with in the link.
I have never seen any indexed url with other string in the front.
[…] tell you the truth, I didn’t find anything relevant, apart from the good article you can see here. One thing is for sure. When Google bots crawl your website and discover duplicated contents or […]
hi dexter…so i just finished study a long article about robots.txt
i think yuo made a mistake. in fact you should also include in your robot file this
Disallow: /*?wpcf7=json
in fact you want to block any webaddress containing
?wpcf7=json after but also before any link of your website….i check this on Google webmaster tool and it worked….waiting for your reply
@ France,
It takes about 2 weeks for those links to be removed from Google index.. Slowly it will be pushed at the farthers SERP then will be removed..
one last question…
I applied the changes you suggested in the robots.txt a couple of days ago, but i still see the string in google. how long does it take before google deletes all the ?wpcf7=json links????
thanks dexter for your great help!
@ france,
yes it is, not only that it also increase click rate
did you experienced any improvement by using robots.txt??
does your website have more visits???
@ France,
Yes the robts.txt wors.. and i experience the same thing before
look at this page of mine
http://www.webtlk.com/category/web-sites/?wpcf7=json&wpcf7=json&wpcf7=json&wpcf7=json
hi dexter,
i hope you solved the issue…on google some pages of mine are indexed like this
http://www.webtlk.com……..json&wpcf7=json&wpcf7=json
what can i do and why the plugin author doesnt solve the issue??? can you confirm that your robot.txt is working? do you experience the above string?
@ DomainPubber
try my method here >> https://www.techathand.net/2008/01/robotstxt-how-important-it-is/
When I change my robot.txt this weird json is not being indexed and shown in google anymore..
I am successful in implementing this
Hello,
So I installed the new version of the plugin (contact-form-7.1.7.5) but that weird wpcf7=json query string is still there on all my blog posts.
Can somebody tell me how to remove it please?
Thanks!
@ murphyz
Ok I will let you know if this robots.txt will eliminate those unusual string.
Dexter,
I use the Contact Form 7 on my site and have noticed this string on my firestats today – though today is the first time it’s happened.
It’s only happening for one post – and this post has 12217 hits under the direct URL, and now 157 under the ?wpcf7=json URL. It’ll be interesting to monitor the hits as they increase, to see if this latest string prevents the correct URL from increasing.
Please let me know how your robots.txt testing goes.
Thanks
Mike
[…] once site. If you are a regular reader here, You know that I am having some problem with those The Unusual string showing at my Google Search Engine Results which is the reason why study this matter. And later […]
@ France and to all have the same problem sa mine…
I am now testing some changes in the robots.txt as per Marhgil recommendation.. I tried searching information about it.. and I believe it will not be only benefecial for this problem but beneficial for the site in totality
@ France,
So you are also one of the user of that plug in. I just hope that this plugin author can do soemthing or explain something about this matter.
hi there…i enjoyed your article since i am concerned about ?wpcf7=json
i hope the author of the plugin is gonna take it seriously and fix it…i would hate to get rid of it plugin since i raelly like it!!
@ Marhgil
Thanks for that information. The information you provide will help not only in this particular problem on some other problem as well.
@ AJ Batac
As far as I know, a duplicate content in the web harms the SEO of a certain page. Could you please give me more Idea on your basis that it will not harm your SEO.
an example Seach results shows
techathand.net/2007/02/online-photo-editing/ is shown in search result and
techathand.net/2007/02/online-photo-editing/?wpcf7=json
is also shown in search result.. although the later is only a supplemental page.
No, it’s not going to harm your SEO effort. 😉
I don’t know if it has a negative effect. However, if you want it to be deindexed, or any other URL that has question mark in it, you can modify the robots.txt file as suggested by Google here.