Archive for December, 2006

中国博客2006

截止到2006年11月3日,全球中文博客站点数量达到5230万,博客(Blogger)用户数达到 1987万,平均每个博客(Blogger)用户拥有大约2.6个博客,博客站点数和博客用户数均比去年有一定程度的增长,人均拥有博客数与去年相比也略有上升。

在近二千万的中国博客用户中,每个用户平均每7.6天更新一次博客,活跃的博客用户数(一周内有更新的博客)达到 302万,约15.2%的用户每周更新博客,同时,只有大约4.6%的用户每天更新博客。在用户更新的博客中,约42%的博客文章在500个汉字 (1000个字节)之内,500~1000个汉字(1000~2000个字节)的博客文章占约16.5%,越是长篇大论的文章在博客中所占比例则越少。

用户更新博客的时间大部分集中在白天,约48.5%的用户选择在工作时间(上午10点~下午6点)更新自己的博客, 其中,在临近下班前(下午4点~6点)博客更新数达12.4%,而约16.3%的用户选择在晚上7点~10点更新博客,为全日最高峰,晚上10点之后时间 更新博客的数量逐渐减少。

2006年,拥有独立域名的博客站点,在博客站点总数中占约0.43%,这表明,绝大多数的用户将博客服务商作为他们的博客站点首选。而这一年,中国大陆博客服务商(BSP)持续大幅增长,博客服务商数量达到1460家,与去年同期相比增长近55%;大型网络公司如搜狐、百度纷纷推出相应的博客服务。

过去一年中,中国博客发展速度整体趋缓,博客用户增长数较之去年有小幅上升,博客服务商之间的市场竞争日趋激烈,与早期提供博客服务的运营商相比,传统门户及大型公司的博客服务已占据市场主导地位。总体而言,2006年中国博客发展呈现5大特点:

  • 专业博客如医药类、教育类等增长较快,博客圈成为社区发展新方向
  • 博客服务商(BSP)死亡比例逐渐增高,2005年Top100服务商中,近20%的站点已经关闭或终止服务
  • 博客服务商(BSP)开始逐渐支持手机访问和发贴,发展迅速
  • 综合博客服务商(BSP)增长很快,越来越多的服务商开始提供音频、视频博客等功能
  • 利用博客进行排名作弊的站点越来越多,06年出现爆炸性增长,其中小型博客服务商(BSP)尤为突出

百度的空间

Comments

微软全新版面上线

About the new Microsoft.com home page

microsoft

On December 14 we introduced a new home page for Microsoft.com. The new page incorporates months of research, testing, customer feedback, and refinements. We hope the new page makes it easier to find what you’re looking for on Microsoft.com, and that you find new items of interest along the way.

Q:

Why have you suddenly changed the home page? And how is this one better than the previous page?

A:

The changes have actually been in the works for most of the past year. We began testing the new design and navigation system early in 2006, and since July a group of customers has been providing feedback as beta testers. As a result we’re confident the new page will make it easier for you to find what you’re looking for on the site.

The new navigation is aimed at helping you quickly find the most sought-after content on Microsoft.com. We based our design on data of the most common tasks, products, and services that people come to our site for, and then we validated and improved the design during months of customer testing. It’s extremely important to us that you find what you’re looking for on Microsoft.com. That can be a challenge because we serve up hundreds of thousands of pages and address the interests of millions of customers.

Q:

Why did you move the main navigation menu from the left side of the page to the right?

A:

Customers told us it was too hard to find what they were looking for on the home page. So we separated the main navigation menu from the featured announcements on the page. This separation makes it easier for you to either scan the announcements for new items of interest, or dig into the site navigation. Moving the menu to the right side of the page turned out to be the solution that looks best and makes the menu easiest to find.

Q:

Have you finished making changes to your site now?

A:

Nope, what you see on the home page is just the tip of the iceberg in terms of changes we’re making to our Web site. Some of the biggest changes are ones you don’t see. In the background we are beginning to migrate much of Microsoft.com to a new platform based on Microsoft Office SharePoint Services 2007. Using Office SharePoint Services will bring us a broad range of new capabilities that will help you more easily find the information, products, and services you’re looking for across our site. Microsoft.com is the 5th most visited Web site on the Internet, hosting hundreds of thousands of pages of content. Running our site on Office SharePoint Services goes a long way in demonstrating to customers worldwide that the product is reliable and provides an effective solution for managing vast amounts of content.

Comments

垂直搜索

垂直搜索的核心技术实际上就是智能爬虫的技术,也就是说如何将定向或者非定向的网页抓取下来并进行分析后得到格式化数据的技术。那么衡量一个垂直搜索引擎的好坏主要有以下几个标准。

  1. 数据的更新频率
    顾名思义,就是爬虫从目标网站上爬取数据的频率。
  2. 覆盖网站个数
    覆盖尽量多的网站,对提供的信息数量将是一个保证。
  3. 单站有效数据抓取率
    单个目标网站的有效数据,对数据量的多少有直接的影响。衡量一个爬虫的重要标准之一。
  4. 信息抽取完整率和准确率
    此项指标的重要度不言而喻。信息的准确率和完整率直接关系到整个搜索引擎搜索结果的质量。

经过发展现有垂直搜索爬虫分为2种基本模式。

  • 定向爬虫获取信息,配上手工或者自动的模版进行信息匹配,将信息进行格式化分析存储。
    • 优势:
      基于模版的信息提取技术,能提供更加精准的信息。比如价格,房屋面积,时间,职位,公司名等等。
    • 劣势:
      目标网站难以大面积覆盖,因为基于模版匹配的信息提取技术,需要人工的参与配置模版,欲要大面积覆盖各个目标网站,需要大量的人力成本,同样维护模板也需要很大的人力成本。
  • 语义爬虫全网爬取,爬虫根据语义识别,自动进行信息格式化分析,并存储。
    • 优势:
      全网非定向抓取目标网站,有效的保证信息数量。
      不需要人工参与定制和维护模板,有效的保证了自身的人力和维护成本。
    • 劣势:
      相对于第一种模板匹配,根据语义来进行数据抓取,准确率略有下降

搜索引擎研究所

Comments

搜索引擎优化论坛和博客列表

论坛

  • 站长世界


    This site is a service to the web site administrator community. We are here for WebmasterWorld members to discuss the process of doing business on the internet. Running a website takes a great deal of knowledge. The design, coding, maintenance, promotion, marketing, and management of a website is almost an impossible task for one person alone without extensive training. We are here as a forum for the members to share and gain knowledge in operating and promoting a website. Think of us as part of your extended site development and process team.

  • SEW论坛
  • 搜索引擎观察

    最早提供搜索引擎新闻和知识的网站之一,论坛版主也都是搜索引擎业界重量级的人物。

  • DigitalPoint论坛
  • High Rankings SEO Forum

博客

Comments

搜索引擎市场占有率图表

这样的分析数据,你信吗?

搜索引擎市场占有率图表

蓝衫实验室

Comments (2)

每一个伟大的网站背后,都有一个赤裸的女人

其实哪里都一样的,比如当年木子美之于blogcn和blogchina,竹影青瞳之于天涯社区;素质高的西方呢?远的来说,拉链门事件成全了德拉吉报道和blog;近的来说,帕里斯·希尔顿裸照成全了digg.com――所以,麦田说,每一个伟大的网站背后,都有一个赤裸的女人。

张钰事件

Comments

客户创造价值

商业的基础,是赢得消费者,为客户创造价值。反商业并不是拒绝商业,而是将商业先放到一边,专注于自己的产品和用户。中国的互联网公司之所以很难做大,常常不是因为经营者不考虑商业,而是因为他们把赚钱的目标摆在了用户的利益之上,商业得过了头。没有一家成功的大网站,会把他们的首页密密麻麻地塞满广告,会把他们的广告伪装成正常的搜索结果,会用数不清的弹出窗口骚扰自己的用户。

专注服务

Comments

Next entries »