|
There are no translations available.
YOS News Crawler is a program which will be help you to get articles from another websites. So, how to do it? Plz read help below. --- Step 1: Input crawler link and define type of page
- Insert link of website which you want to get articles. YOS News Crawler support pages as HTML, Feed, RSS, Atom.
- Published choose 'yes' if you want to enable this link.
- Click test if you want to view site.
- Plug-in: If you have plug-in which help you get articles, you can choose it on this step.
--- Step 2: If your page is HTML, you need define syntax of regex which will be help you to get link-detail of each articles - RegExp: You need input a regular-expression which help you to get link-detail.
- Sub Pattern: Beside the RegExp box, you can see a text-box which you need input sub pattern of RegExp. Default: 0.
- Click test button to check your regexp
--- Step 3: If your page is HTML, you need define syntax of regexp which will be help you to get title of each articles - RegExp: You need input a regular-expression which help you to get title.
- Sub Pattern: Beside the RegExp box, you can see a text-box which you need input sub pattern of RegExp. Default: 0.
- Click test button to check your regexp
--- Step 4: Choose a way to get introtext and fulltext - MainIntro and Fulltext: Component will be get Introtext on Main-page and Fulltext on Detail-page
- Introtext and Fulltext: Component will be get Introtext and Fulltext on Detail-page
- Introtext and Cut text: Component will be get Introtext on Detail-page. You can cut text to Fulltext
- MainIntro and Cut text: Component will be get Introtext on Main-page. You can cut text to Full
--- Step 5: You need define syntax of regexp which will be help you to get introtext. - Get Introtext: You need input a regular-expression which help you to get content.
- Sub Pattern: Beside the RegExp box, you can see a text-box which you need input sub pattern of RegExp. Default: 0.
- Auto generate fulltext: This feature will be help you to cut a part of introtext to fulltext. If you choose 'yes', you need input number of words which were cut.
- Find and Replace: This feature will be help you to replace html code of each articles which were crawled. You need define regexp to replace it with each regexp on a line. For example: /joomla/iyopensource
- Find and Replace after encode: This feature will be help you to replace html code of each articles after component encode some special tags (a, script, img, etc..). You need define regexp to replace it with each regexp on a line. For example: /joomla/iyopensource
- Click test button to check result.
--- Step 6: If you chose option 1 or 2 in step4, component will take you to this step to define regexp which will be help you to get fulltext - Get Introtext: You need input a regular-expression which help you to get content.
- Sub Pattern: Beside the RegExp box, you can see a text-box which you need input sub pattern of RegExp. Default: 0.
- Find and Replace: This feature will be help you to replace html code of each articles which were crawled. You need define regexp to replace it with each regexp on a line. For example: /joomla/iyopensource
- Find and Replace after encode: This feature will be help you to replace html code of each articles after component encode some special tags (a, script, img, etc..). You need define regexp to replace it with each regexp on a line. For example: /joomla/iyopensource
- Click test button to check result.
--- Step 7: With some article has many pages. You can define regexp to get link-nextpage. - Next page: You need input a regular-expression which help you to get link.
- Sub Pattern: Beside the RegExp box, you can see a text-box which you need input sub pattern of RegExp. Default: 0.
- Click test button to check result.
--- Step 8: If you chose plug-in at step 1. Component will be take you to this step, immediately. Because Plug-in did everything for you. - Find and Replace: This feature will be help you to replace html code of each articles which were crawled. You need define regexp to replace it with each regexp on a line. For example: /joomla/iyopensource
- Find and Replace after encode: This feature will be help you to replace html code of each articles after component encode some special tags (a, script, img, etc..). You need define regexp to replace it with each regexp on a line. For example: /joomla/iyopensource
--- Step 9: Configure link as (Save to ..., publish, number content, etc...) - Time update: Time delay to get content.
- Get number content: Number content which you want to crawl
- Section/Category: Where you want save content
- Publish time delay: Publish up time (value hour)
- Publish down time: Publish down time (value hour)
- Access Lever: Permission of user can read.
- Readmore: Link to original article
- Keyword: Keyword to define position where contain articles. Please input a keyword and double click to selectbox beside that'll be generate code keyword for you For ex: [joomla=>1,1][yopensource=>3,5]
- Translate From: Choose a language that you want translate
- Translate To: Choose a language that you want save in your database
- Author: Choose author of articles
- Copyright: This text will be add on footer of articles
- Get Images: Yes, if you want to save images into your site
- Resize Images: Yes, if you want to resize images that will be store in your site
- Images's pixel: Number pixel which you want to resize.
- Keyword (Filter): Use keyword to filter data receive. For ex: [joomla][yopensource]
- Position of Ads: Where you choose position to display ads
- Advertising content: Put your ads' content in here
Son - YOS Team
|