AWStats 6.9发布: 补充中文搜索引擎定义和配置样例下载


Eldy赶在新年前把AWStats 6.9发布了: 主要的蜘蛛定义和搜索引擎定义修改以及本站的配置样例我已经打包在这里;AWStats虽然是perl写的,但是基本上要用起来不需要对perl熟悉,主要是配置的修改,并且可以适用于于大部分网站的流量结合Google Analytic统计作为网站状况的轻量级基础统计解决方案;

相关的更新也已经提交: 欢迎各位补充,争取在下一个版本中发布
AWStats - Patches - 4 items
1569229 Simplified Chinese language file update
1569201 top Chinese browser and robot update
1569151 TOP Chinese local search engines update
2499455 robots.txt: clfmerged log files maybe not start with /
AWStats - Feature Requests - 2 items
2498163 configurable $LIMITFLUSH and increase default value to 50000
706297 IIS timezone:change the timeline instead of change time

完整的diff附后: 包含了awstats.pl本身的2个小修改;
1 针对大量URL:增大$LIMITFLUSH减少临时文件I/O;
2 针对泛域名型应用的修改:使用clfmerge -b合并后的日志无法匹配"GET /robots.txt" (因为被clfmerge拼上域名,变成了 "GET http://foo.example.com/robots.txt");

cvs diff: Diffing .
Index: awstats.pl
===================================================================
RCS file: /cvsroot/awstats/awstats/wwwroot/cgi-bin/awstats.pl,v
retrieving revision 1.926
diff -r1.926 awstats.pl
46c46
<   5000;   # Nb of records in data arrays after how we need to flush data on disk
---
>   50000;   # Nb of records in data arrays after how we need to flush data on disk
10385c10385
<       my $regrobot       = qr/^\/robots\.txt$/i;
---
>       my $regrobot       = qr/\/robots\.txt$/i;
cvs diff: Diffing lang
cvs diff: Diffing lang/tooltips_f
cvs diff: Diffing lang/tooltips_m
cvs diff: Diffing lang/tooltips_w
cvs diff: Diffing lib
Index: lib/robots.pm
===================================================================
RCS file: /cvsroot/awstats/awstats/wwwroot/cgi-bin/lib/robots.pm,v
retrieving revision 1.53
diff -r1.53 robots.pm
356a357
> 'yahoo!\sslurp\schina',
476a478
> 'soso',
677a680,688
> 'blogbus',
> 'QBikSpider',
> 'Zwl.Rss.Spider',
> 'BBReaderFeedSpider',
> 'yeti',
> 'doubanbot',
> 'moreoverbot',
> 'xianguo',
> 'blogvibebot',
998a1010
> 'yahoo!\sslurp\schina','<a href="http://misc.yahoo.com.cn/help.html" title="Bot home page [new window]" target="_blank">Yahoo! Slurp China</a>',
1115a1128
> 'soso','<a href="http://help.soso.com/webspider.htm" target="_blank">Soso Spider</a>',
1319a1333,1341
> 'blogbus','<a href="http://www.blogbus.com" title="BlogBus Home [ new window ]" target="_balnk">BlogBus</a>',
> 'QBikSpider','QBikSpider',
> 'Zwl.Rss.Spider','Zwl.Rss.Spider',
> 'BBReaderFeedSpider','BBReaderFeedSpider',
> 'yeti','<a href="http://help.naver.com/robots/" target="_blank">Yeti</a>',
> 'doubanbot','<a href="http://www.douban.com" target="_blank">doubanbot</a>',
> 'moreoverbot','<a href="http://www.moreover.com" target="_blank">Moreoverbot</a>',
> 'xianguo','<a href="http://www.xianguo.com/" target="_blank">XianGuo</a>',
> 'blogvibebot','BlogVibeBot',
Index: lib/search_engines.pm
===================================================================
RCS file: /cvsroot/awstats/awstats/wwwroot/cgi-bin/lib/search_engines.pm,v
retrieving revision 1.46
diff -r1.46 search_engines.pm
288a289
> 'image\.baidu\.com',     # baidu image search
296a298
> '\.youdao\.com',       # powered by NetEase
503a506
> 'image\.baidu\.com','baidu_image',
508a512
> '\.youdao\.com','yodao',
768a773
> 'baidu_image','(wd|word)=',
774c779,780
< 'soso','q=',
---
> 'yodao','q=',
> 'soso','w=',
996a1003
> 'baidu_image','<a href="http://image.baidu.com/" target="_blank">Baidu Image</a>',
1001a1009
> 'yodao', '<a href="http://www.youdao.com/" target="_blank">YoDao</a>',
作者:车东 发表于:2009-01-12 12:01 最后更新于:2009-02-11 18:02
版权声明:可以转载,转载时请务必以超链接形式标明文章 的原始出处和作者信息及本版权声明

Comments

大大你好,在awstats的
參考資訊:
來源網址
... 由那些搜尋引擎轉介
會看到 eg
1)Yahoo
2)Google,
這是否包括從
com.cn or .com or .hk的yahoo,

還是可分出由什麼地區的Yahoo 轉介?

謝謝

WIN平台下用GNUWIN32打PATCH打不上,怎么办呢?
youdao的蜘蛛手工怎么加呢?
谢谢
Index: search_engines.pm
===================================================================
RCS file: /cvsroot/awstats/awstats/wwwroot/cgi-bin/lib/search_engines.pm,v
retrieving revision 1.46
diff -r1.46 search_engines.pm
288a289
> 'image\.baidu\.com', # baidu image search
296a298
> '\.youdao\.com', # powered by NetEase
503a506
> 'image\.baidu\.com','baidu_image',
508a512
> '\.youdao\.com','yodao',
768a773
> 'baidu_image','(wd|word)=',
774c779,780
'yodao','q=',
> 'soso','w=',
996a1003
> 'baidu_image','Baidu Image',
1001a1009
> 'yodao', 'YoDao',

发表一个评论

(如果你此前从未在此 Blog 上发表过评论,则你的评论必须在 Blog 主人验证后才能显示,请你耐心等候。)

相关文章

关于

此页面包含了发表于2009年01月12日 下午12时59分的 Blog 上的单篇日记。

此 Blog 的前一篇日记是 关于Google FREE Webhosting !的欺诈邮件 200∞

此 Blog 的后一篇日记是 使用Google analytics的 _trackPageview()对网页进行重新命名统计

更多信息可在 主索引 页和 归档 页看到。

Creative Commons License
此 Blog 中的日记遵循以下授权 Creative Commons(创作共用)授权.
Powered by
Movable Type 3.36