Even with extensive manual blocking I’ve still been manually deleting up to hundreds of seemingly-hand-entered spams a day. I’m trying yet another new filter. The good news is 99% of the time your real comments should go through unchallenged, and the 1% of the time they are you’ll be asked to fill out a Captcha.
Let me know in comments if you can’t, comment. Um… no, wait, let me know in email (address here) if you can’t comment.
Apologies all around.
figleaf
So I’ve just confirmed with another blogger that comments I leave on their blog are being automatically spam-filtered. And as she said
I did search for a recent comment from you …and found it amidst my spam. And not just comments awaiting moderation, but the ‘bad’ spam. LOL
Craziness. Every time I catch a comment of yours, I mark it as ‘not spam’. Which should mean it doesn’t keep snagging your comments.
I already knew blogging systems that use the popular (and otherwise generally excellent) Akismet kick me to the curb- and some other systems drag me down too. Now it looks like [Blogger’s correction: Wordpress! —fl] is doing it as well. That’s ok, sort of… something about a URL with text “real adult sex” in it maybe? And, after all, my original idea when I saw the URL for sale years ago was to start a straight-ahead progressive political blog with a minor in 1st Amendment issues precisely because I knew it would get filtered and nanny-netted.
What’s bugging me a bit more, though, is that I might be getting filtered when I comment using my alt-URL on Blogger, figleaf.blogspot.com and alt-email address (“talkingfigleaf” on the gmail system.)
Anyway, two questions:
- Anyone else with sexually oriented but non-spam content think their comments are getting filtered? – Has anyone else checked their moderation lists for incorrectly marked comments? (This might not be worth your while and I wouldn’t blame you if you didn’t bother.)
I promise I might be irked but I’m not offended if my main URL is being spam-filtered, although I’d like to politely ask policy administrators on the big filtering teams to add me to their white-lists. Unless their algorithms just don’t work that way or, of course, if they honestly believe I really am some kind of spammer or my content is really that inappropriate.
(Note: As chance would have it I’ve been so busy for the last three months that I’ve barely had time to keep up with the comments on my blog, let alone comment heavily on other people’s. Yet sometime in those three months I seem to have been plunged into black-list oblivion.)
Ok, so I’ve added a “Captcha” style comment verification to my posts.
The good news? My hosting service has re-enabled comments.
The bad news? You have to guess what the smudgy text says and type it in.
The good news? When you retype the smudgy text you’re re-typing, and confirming, text that’s been scanned from old books using the reCaptcha collaborative web service.
Here’s how that works, according to their website:
About 60 million CAPTCHAs are solved by humans around the world every day. In each case, roughly ten seconds of human time are being spent. Individually, that’s not a lot of time, but in aggregate these little puzzles consume more than 150,000 hours of work each day. What if we could make positive use of this human effort? reCAPTCHA does exactly that by channeling the effort spent solving CAPTCHAs online into “reading” books.
To archive human knowledge and to make information more accessible to the world, multiple projects are currently digitizing physical books that were written before the computer age. The book pages are being photographically scanned, and then, to make them searchable, transformed into text using “Optical Character Recognition” (OCR). The transformation into text is useful because scanning a book produces images, which are difficult to store on small devices, expensive to download, and cannot be searched. The problem is that OCR is not perfect.
eCAPTCHA improves the process of digitizing books by sending words that cannot be read by computers to the Web in the form of CAPTCHAs for humans to decipher. More specifically, each word that cannot be read correctly by OCR is placed on an image and used as a CAPTCHA. This is possible because most OCR programs alert you when a word cannot be read correctly.
The process is actually pretty cool. Get the whole non-technical rundown here.
At any rate, while I regret having to subject you to comment confirmation at least I was able to find something that actually does something besides strain your eyes.
Update: If you missed a chance to comment on one of my posts from earlier this week those should work now too. Even the egregious Lonesome Folksinger HNT post that, for poetic reasons if nothing else, I should have left closed. :-)
Update #2: Also, in the interest of getting more books scanned, I’ve added a “reply to this comment” feature so that you can reply directly to each other’s comments as well as to my posts. (Just be polite to each other — I’m way nicer to people who are mean to me than to my important guests… and I feel strongly that every one who visits, let alone comments, is a very important guest!)
Oh botheration! Looks like my comments are fouled up again — possibly because I’m now getting thousands of spam hits a day and that consumes resources that my ISP doesn’t like.
If so I may need to either upgrade my service, move to a new host, or otherwise reconfigure my blog to make it harder for spammers to access it. (Movable-type back-end consultants are welcome to drop me a line!)
Update: The good news (such as it is) is that I’m getting to learn even more (than I wanted to :-)) about my blogging tools. Plus java-script (which back when I was a web professional I went to a lot of trouble never to learn.)
Meanwhile…