Learning Poster
- Included Free with ScrapeBox
- Multi-Threaded Connections
- Post to +30 Platforms
- Trainable Platforms
- Captcha Support
- Proxy Support
- Build Backlinks
- Muti-Step Form Capability
- Advanced Tokens
- Automator Support
The ScrapeBox v2 Learning Poster is a new trainable Fast Poster included with the latest version of ScrapeBox.
It’s an extremely flexible poster capable of being trained to post to dozens of platforms including Blogs, Guestbooks, Contact Forms, Image Comments, Trackbacks and even some Forum and BBS sites.
The poster is fast, multi-threaded and can process sites with image or text captcas, it also has the ability to use keyword blacklists or whitelists to ensure you don’t post on pages with bad words or you only post on pages containing specific keywords.
If like flexibility and power, then you are free to train your own platforms or enhance existing platforms due to the use of fairly easy to understand platform definition files which can be built or modified using a simple text editor.
But if not that’s ok, it comes pre-trained to post to the following platforms:
It’s an extremely flexible poster capable of being trained to post to dozens of platforms including Blogs, Guestbooks, Contact Forms, Image Comments, Trackbacks and even some Forum and BBS sites.
The poster is fast, multi-threaded and can process sites with image or text captcas, it also has the ability to use keyword blacklists or whitelists to ensure you don’t post on pages with bad words or you only post on pages containing specific keywords.
If like flexibility and power, then you are free to train your own platforms or enhance existing platforms due to the use of fairly easy to understand platform definition files which can be built or modified using a simple text editor.
But if not that’s ok, it comes pre-trained to post to the following platforms:
- 4image
- Advanced Guestbook
- AkoBook
- Ard Guestbook
- Aska BBS
- ASP Blog
- Basti Guestbook
- BeepWorld
- Bella Guestbook
- Blogengine
- Burning Book
- Chinese Blog
- cms2day
- CoderWorld
- Coppermine
- DedeIms
- DRB Guestbook
- e107 Forum
- EasyBook Reloaded
- GA Guestbook
- Gallery2 Image
- Icybook Guestbook
- Jambook Guestbook
- Jax Guestbook
- Joomla Comment
- K2 Blog
- Pixelpost
- Plogger
- Serendipity Blog
- Sitebuilder Guestbook
- TextCube Guestbook
- WordPress
- WPTrackback
With the release of ScrapeBox v1.16.0 and the new Learning Poster v2,
platforms that the new commenter is able to post to are also available
from the ScrapeBox harvester.
You can now select multiple platforms to harvest from at the same time, and each time new platforms are released you are able to download the new platforms and footprints right from the harvesters Platforms window.
Also all platforms files are fully editable, so you can modify the default footprints or you can even add multiple footprints for each individual platform. This is ideal when a platform has a number of possible footprint variations, or is available in different languages.
This is synced to the comment poster, so platforms you can harvest here you can also post comments to.
You can now select multiple platforms to harvest from at the same time, and each time new platforms are released you are able to download the new platforms and footprints right from the harvesters Platforms window.
Also all platforms files are fully editable, so you can modify the default footprints or you can even add multiple footprints for each individual platform. This is ideal when a platform has a number of possible footprint variations, or is available in different languages.
This is synced to the comment poster, so platforms you can harvest here you can also post comments to.
Decaptcher Setup
The new Learning poster has the ability to use multiple captcha services such as OCR software like Captcha Sniper, as well as online services like De-Captcher.com, Death By Captcha and Image Typerz.
A primary and secondary service can be setup with adjustable retries for each services, so for instance you can setup Captcha Sniper as the first service with 3 retries to try and solve the captcha using free OCR solving then if this fails to solve the captcha it can be sent to a service for human solving.
Using this method you can solve the maximum amount of captchas possible, with the least cost involved.
The poster has an internal math question solver to automatically solve various anit-spam math questions such as “10 + 6 – 2 =” giving it the ability to successfully post to a wide range of urls. It’s capable of handling words like “ten plus six minus two equals”, it can use multiple math symbols like : / * x + – ÷ and it can also compute math questions in German using mal, weniger, geteilt, und as well as words numeric words like eins, zehn etc.
The poster also has the ability to use both Blacklists and Whitelists to control where posts are made. Using the Blacklist you are able to add “bad” words and when any page is encountered that contains any one of your bad words the poster will skip this URL. Likewise with the Whitelist,you are able to add keywords related to your domain or niche and the poster will only make comments on pages where one or more of your whitelisted words appear. This gives you great control over where your links are placed.
To use the new poster, ensure you have updated to ScrapeBox v1.16.0 then install the Learning Mode Poster V2 addon from the Addons menu. Once installed you will be able to download more platforms via the “Platforms” button on the harvester.
To post to the new platforms with the new poster, simply use the regular Fast Poster in ScrapeBox like normal and the new poster will launch in it’s place.
A primary and secondary service can be setup with adjustable retries for each services, so for instance you can setup Captcha Sniper as the first service with 3 retries to try and solve the captcha using free OCR solving then if this fails to solve the captcha it can be sent to a service for human solving.
Using this method you can solve the maximum amount of captchas possible, with the least cost involved.
The poster has an internal math question solver to automatically solve various anit-spam math questions such as “10 + 6 – 2 =” giving it the ability to successfully post to a wide range of urls. It’s capable of handling words like “ten plus six minus two equals”, it can use multiple math symbols like : / * x + – ÷ and it can also compute math questions in German using mal, weniger, geteilt, und as well as words numeric words like eins, zehn etc.
The poster also has the ability to use both Blacklists and Whitelists to control where posts are made. Using the Blacklist you are able to add “bad” words and when any page is encountered that contains any one of your bad words the poster will skip this URL. Likewise with the Whitelist,you are able to add keywords related to your domain or niche and the poster will only make comments on pages where one or more of your whitelisted words appear. This gives you great control over where your links are placed.
To use the new poster, ensure you have updated to ScrapeBox v1.16.0 then install the Learning Mode Poster V2 addon from the Addons menu. Once installed you will be able to download more platforms via the “Platforms” button on the harvester.
To post to the new platforms with the new poster, simply use the regular Fast Poster in ScrapeBox like normal and the new poster will launch in it’s place.
Training Platforms Adding new Platforms to ScrapeBox
Platform Guide
- Train new Platforms
- Enhance Existing Platforms
- Detect Different Captchas
- Modify Search Footprints
- Multi-step Forms
- Change Success/Fail Footprints
Since 2011 ScrapeBox has had the ability to learn new platforms, and
ScrapeBox can post to virtually any platform or form that doesn’t
require a user account to be created on the site website. So it can post
to blog platforms, guestbooks, contact forms, trackbacks, some open
forums and wiki’s.
In order to work with a platform, you will need to create a definition file which is just a plain text file using the Microsoft .ini format like the screenshot above. This consists of [Sections] which contain a number of Name=value keys. The first section in the ScrapeBox platform files is…
[Setup]
The setup decides the basics on what footprints are used in the
harvester to find the platform, how ScrapeBox can identify this platform
once it loads a page, how can ScrapeBox detect if a comment to this
platform was successful or failed and things like how to handle the URL’s and navigating the pages. Below are the available Name= entries that are valid for the setup.
FriendlyName= Any name you want to call the platform, will be used in the GUI.
UseBlackList= Values can be 1 to use a blacklist or 0 not not use the blacklist. This is the bad words list you can edit in the poster.
UseWhiteList= Values can be 1 to use a whitelist or 0 not not use the whitelist. This is the bad words list you can edit in the poster.
Platform= This is the type of platform it is, such as Blog, GuestBook, Image, Forum, Contact Form, Trackback and used used to group similar platforms.
Markup= How to handle links and code, values can be HTML or BB
PageMustContain= If any of the given strings can be found in the pagecode, the page is valid. | is interpreted as OR, * is interpreted as AND
Success= If any of the given strings can be found in the resultpage after post, the submission was a success. | is interpreted as OR, * is interpreted as AND
Failed= If any of the given string can be found in the resultpage after post, the submission failed. | is interpreted as OR, * is interpreted as AND
UseBlackList= Values can be 1 to use a blacklist or 0 not not use the blacklist. This is the bad words list you can edit in the poster.
UseWhiteList= Values can be 1 to use a whitelist or 0 not not use the whitelist. This is the bad words list you can edit in the poster.
Platform= This is the type of platform it is, such as Blog, GuestBook, Image, Forum, Contact Form, Trackback and used used to group similar platforms.
Markup= How to handle links and code, values can be HTML or BB
PageMustContain= If any of the given strings can be found in the pagecode, the page is valid. | is interpreted as OR, * is interpreted as AND
Success= If any of the given strings can be found in the resultpage after post, the submission was a success. | is interpreted as OR, * is interpreted as AND
Failed= If any of the given string can be found in the resultpage after post, the submission failed. | is interpreted as OR, * is interpreted as AND
All platform definition files should have the above fields added and
set. The below are not required, but often must be used to perform more
advanced functions.
PageMustNotContain= If any of the given strings can be found in the pagecode, the page is invalid. | is interpreted as OR, * is interpreted as AND
Enctype= The Encoding type if you wish to override the forms default encoding such as application/x-www-form-urlencoded
LoadUrl= Locate the given url and load the target page. Will skip RemovefromUrl, RemoveFromUrlAfter, and ModifyUrl
LoadUrlFromAnchor= Locate the given anchor, grab the url and load the target page. Will skip RemovefromUrl, RemoveFromUrlAfter, and ModifyUrl
RemoveFromUrl= Remove given strings from the baseurl. Multiple strings are separated with |
RemoveFromUrlAfter= Remove everything from the position of given strings in the baseurl. Multiple strings are separated with |
ModifyUrl= Add something to the baseurl. variables %host% and %path% can be used to rebuild the baseurl.
DeleteCookies= List of cookie names to delete
Enctype= The Encoding type if you wish to override the forms default encoding such as application/x-www-form-urlencoded
LoadUrl= Locate the given url and load the target page. Will skip RemovefromUrl, RemoveFromUrlAfter, and ModifyUrl
LoadUrlFromAnchor= Locate the given anchor, grab the url and load the target page. Will skip RemovefromUrl, RemoveFromUrlAfter, and ModifyUrl
RemoveFromUrl= Remove given strings from the baseurl. Multiple strings are separated with |
RemoveFromUrlAfter= Remove everything from the position of given strings in the baseurl. Multiple strings are separated with |
ModifyUrl= Add something to the baseurl. variables %host% and %path% can be used to rebuild the baseurl.
DeleteCookies= List of cookie names to delete
Here you can see a basic example of the [Setup] for Bella Guestbook.
For the PageMustContain, PageMustNotContain, Success and Failed
values this scans the page contents for the markers you add so you can
add text, html, javascript or anything in the page content.
This platform also uses 2 optional values RemoveFromUrl and
ModifyUrl. This tells ScrapeBox when it lands on the guestbook, no
matter what the page it should trim index.php and sign.php and everything after these like querystrings from the URL, then load %host%%path%sign.php
so if it landed on scrapebox.com/guestbook/index.php?page=123 it would
strip the last part and load scrapebox.com/guestbook/sign.php
This is used when the page you need to post the comment on is different then the page you load. So you can train ScrapeBox to navigate to the correct page to make the post.
This is used when the page you need to post the comment on is different then the page you load. So you can train ScrapeBox to navigate to the correct page to make the post.
[Step] which deals with making the post.
DoStepIf= Process this step only when any of the
given strings can be found in the page code. | is interpreted as OR, *
is interpreted as AND. If not set, the step will be processed always.
FormMustContain= The form is valid when any of the given strings can be found in the form. | is interpreted as OR, * is interpreted as AND
FormMustNotContain= If the form contains any of the given strings, the form is invalid. | is interpreted as OR, * is interpreted as AND
PostUrl= A | separated list of url parts used to grab the post url. It looks between <form and >
AddToPostUrl= A value added to post url. Masks (%…%) can be used.
DelayPost= Delay post by the given number of seconds. The variable %rndnum-x-y% can be use too.
DelayPostIf= Only delay the post when any of the listed strings can be found. Multiple strings are separated with |
AddToPostDataIfInpage= Will add all AddToPostData= fields when any of the with | separated strings is found in the pagecode.
AddToPostData= fieldname=variable will be added to the postdata when the AddToPostdataIfInPage condition is true. When no AddToPostDataIfInpage if set, AddToPostData will be added always.
EncodeFieldNames= 1 will url encode fieldnames.
Fieldnames can contain * as a wildcard. So if fieldname is captcha_code123 where 123 is different on each blog/post then captcha_code*=%captcha% will match.
Variables:
All ini setting using variables allow spintax, for example thename={%rnd-name%|%rnd-email%} is valid. Values assigned to variables also allow spintax.
%host% Represents the host name of the target url
%path% Represents the path of the target url
%rnd-name% Returns a random name from the file ~cpn.txt. Spintax allowed.
%rnd-email% Returns a random email from the file ~cpe.txt Spintax allowed.
%rnd-website% Returns a random website from the file ~cpw.txt Spintax allowed.
%rnd-comment% Returns a random comment from the file ~cpc.txt Spintax allowed.
%rnd-option% Return a random option. Values are grabbed from the <select/option tags of the form
%rnd-location% Spintax allowed.
%rndnum-x-y% Returns a random number between x and y.
%ignore% Just use the original value represented in the form.
%user-domain% Extract the domain of the user’s website generated previously by %rnd-website%
%user-name% Previously by %rnd-name% generated username
%user-email% Previously by %rnd-email% generated email
%user-comment% Previously by %rnd-comment% generated comment
%user-location% Previously by %rnd-location% generated location
%user-website% Previously by %rnd-website% generated website
%wphashcash% Result of WPHashCash processing (internal code)
%captcha% Image captcha result
%question% text captcha result
%serverstatus-200% Represents server status code 200
%serverstatus-302% Represents server status code 302
%header-xxxx% Checking the post header for the presence of xxxx in it.
%unixtimestamp% returns the current unix timestamp
%unixtimestampms% returns the current unix timestamp in milli seconds
%xxxxxx% Executing a section with the name xxxxx
You can have multiple [Step] configured for multi-step forms that may require you to fill out info on 2 or more pages.
We have hundreds of video tutorials for ScrapeBox.
https://www.youtube.com/user/looplinescrapebox/videos
FormMustContain= The form is valid when any of the given strings can be found in the form. | is interpreted as OR, * is interpreted as AND
FormMustNotContain= If the form contains any of the given strings, the form is invalid. | is interpreted as OR, * is interpreted as AND
PostUrl= A | separated list of url parts used to grab the post url. It looks between <form and >
AddToPostUrl= A value added to post url. Masks (%…%) can be used.
DelayPost= Delay post by the given number of seconds. The variable %rndnum-x-y% can be use too.
DelayPostIf= Only delay the post when any of the listed strings can be found. Multiple strings are separated with |
AddToPostDataIfInpage= Will add all AddToPostData= fields when any of the with | separated strings is found in the pagecode.
AddToPostData= fieldname=variable will be added to the postdata when the AddToPostdataIfInPage condition is true. When no AddToPostDataIfInpage if set, AddToPostData will be added always.
EncodeFieldNames= 1 will url encode fieldnames.
Fieldnames can contain * as a wildcard. So if fieldname is captcha_code123 where 123 is different on each blog/post then captcha_code*=%captcha% will match.
Variables:
All ini setting using variables allow spintax, for example thename={%rnd-name%|%rnd-email%} is valid. Values assigned to variables also allow spintax.
%host% Represents the host name of the target url
%path% Represents the path of the target url
%rnd-name% Returns a random name from the file ~cpn.txt. Spintax allowed.
%rnd-email% Returns a random email from the file ~cpe.txt Spintax allowed.
%rnd-website% Returns a random website from the file ~cpw.txt Spintax allowed.
%rnd-comment% Returns a random comment from the file ~cpc.txt Spintax allowed.
%rnd-option% Return a random option. Values are grabbed from the <select/option tags of the form
%rnd-location% Spintax allowed.
%rndnum-x-y% Returns a random number between x and y.
%ignore% Just use the original value represented in the form.
%user-domain% Extract the domain of the user’s website generated previously by %rnd-website%
%user-name% Previously by %rnd-name% generated username
%user-email% Previously by %rnd-email% generated email
%user-comment% Previously by %rnd-comment% generated comment
%user-location% Previously by %rnd-location% generated location
%user-website% Previously by %rnd-website% generated website
%wphashcash% Result of WPHashCash processing (internal code)
%captcha% Image captcha result
%question% text captcha result
%serverstatus-200% Represents server status code 200
%serverstatus-302% Represents server status code 302
%header-xxxx% Checking the post header for the presence of xxxx in it.
%unixtimestamp% returns the current unix timestamp
%unixtimestampms% returns the current unix timestamp in milli seconds
%xxxxxx% Executing a section with the name xxxxx
You can have multiple [Step] configured for multi-step forms that may require you to fill out info on 2 or more pages.
Comment Poster Tutorial
View our video tutorial showing the Comment Poster in action. This feature is included with ScrapeBox, and is also compatible with our Automator Plugin.We have hundreds of video tutorials for ScrapeBox.
https://www.youtube.com/user/looplinescrapebox/videos
My Profile Page On BHW:
http://www.blackhatworld.com/blackhat-seo/members/921588-joseph-lich.html