关于作者

用户名:stanley.cheng
笔名:stanley
地区: 中国-天津
行业:其他

日历  

快速登录

+ 用户名:
+ 密 码:

在线留言



IS resources

LIS school

IS league (CAS)

LS&IS Leading Blogger (PRC)

访问统计:
文章个数:24
评论个数:112
留言条数:13




Powered by BlogDriver 2.1

INFORMATION SCIENCE -Stanley's personal BLOG

 

脚踏实地,再次从零开始! stanley 16.April.2005

文章

后信息时代-信息个性化的展望(一)

一、“个性化”是在一定的规则与秩序之下的个性化,尽管规则与秩序也需要不断的完善与调整

信息资源本身的庞大与纷杂,理顺其关系已不是易事。尽管多年之前情报学家曾经成功的预言了信息爆炸的出现,并提出了相关的理论来抑止信息爆炸;然而,直至今天,聪明的人们一浪高过一浪的推出各种不同格式、不同结构的信息模型:异构数据库的无缝链接成为首当其冲的棘手问题;搜索引擎的robot程序还在以古老的方式进行着爬行,堆彻在词频分析理论上的切词分词无可奈何的制造着更多“导向信息目的源”的障碍。当人们正得意的沉浸在各种新兴事务中,如blog,viki之类的东西时,我们悲哀的发现,搜索引擎还是连续发挥着其应有的作用-检索出一堆资料,你却不得不手工,或者凭借着肉眼,或者凭借着自己的感觉,去删除或挑选出自己所需要的内容。在这个理念上,从亿万张网页中检索出的10万张页面,同样也是自己不得不面对的尴尬;在这里,小声的问一句:我们的检索技术,真的已经远远超越了理论的发展了么?在一片纷乱的世界里,谈及到所谓的“信息个性化”,依然还是一个遥远的梦想。

将视线转回到手工时代。随着图书馆的形成并开始大规模进行藏书的时候,首先面临的是知识分类体系的建立。对于读者-信息使用者而言,图书馆需要提供的帮助有几个不同层次:

1、提供读者需要的资料(书籍)
2、提供信息渠道(获取书籍的途径)
3、提供图书分类法并教授读者可以懂得图书信息资源分类的理念
在这个层面上,个人认为,信息使用者可以通过“统一”的方式去获取各自需求不同的“目的产品”,正是“自动化”理念手工版的光辉写照。因此,在用户使用信息之前,获取相应的“信息整理与保存”的规则,则是必须的-这一点的实现,可以节省大量的人力物力。可谓为知识管理在古老时代的成功运用。也是在信息还没有爆炸的时候,信息资源整合与信息用户个性化实施的绝佳配对和应用。

再将目光转回现代甚至是未来,面对一群无序杂乱的信息资源,尤其是在网络成为获取信息资源的主要方式下,信息资源的整合,是重中之重的任务,其次是获取信息规则的建立与实施,最后,在信息的整理、分类、存储、传递等理念成为一种通行的规则后,需要人们去利用这些规则来进行符合自身的信息查询行为,并最终获取自己需要的资源,才可谓个性化的信息服务。

-to be continued

- 作者: stanley 2005年05月10日, 星期二 17:28  回复(1) |  引用(0) 加入博采

后信息时代-信息个性化的展望(引子)

尼葛鲁旁帝在1995年其著作《数字化生存》(being digital )中提及到一个非常前卫的概念:后信息时代。相对于信息时代-以计算机技术为代表并区别以能源消费的工业时代而言,“信息”作为一种资源开始进入人类社会并源源不断的产生效益的时候,“信息个性化”(information personlization)则成为“后信息时代”的标志性特征。

按照尼葛鲁旁帝的观点,“信息时代”是计算机的时代。尽管其也具有规模经济的痕迹与特征,但已不再像工业时代那样受制于时空条件,人们可在任何地方任何时间从事生产,而产品(不仅仅是信息)可在瞬间完成在全球范围内的流通。而在“后信息时代”,传媒的受“众”通常只有一个人。所有的产品(当然包括信息)都高度个性化。在“后信息时代”,人们可在任何时空状态下工作、学习和生活。实际上,无论是信息时代还是后信息时代,对于已经升级成为资源的信息而言而言,都不可避免联系着全球化的影子。

然而在日志内,探讨的并不是信息与全球化,而是对于信息个性化的思考。
-to be continued

- 作者: stanley 2005年05月10日, 星期二 17:22  回复(1) |  引用(0) 加入博采

北行随笔之四合院里的北大信管

第一次去北大就闹了一个大笑话,在中关园下车后,居然走了半天没有找到北大东门。晃悠了半天进了学校,绕了很多圈才找到号称北大“一塌糊涂”三大圣地之一的图书馆,和李武相约在此。等了才一会,老远的就见着李武在招手打招呼。刚经历过惨烈北大博士入学考试的他看起来还是显得有些疲惫;当然北京城也的确大的过分了一些,坐车也是非常痛苦的事。而远方已经略现夜色端倪,此时心里也觉得自己的拜访显得有点冒昧了,让他如此行色匆匆。

见面,握手,一路来到宿舍区;小左片刻,两人便一起在郁郁葱葱的校园里逛了起来。北大的风景的确不错,不愧是园林式的大学校园,游人也络绎不绝,看到各色的人操着各种方言,手拿摄像机到处游逛;空气也要比校园外的大街上好很多,到处都散发着绿色的味道。周末的未名湖很安静,湖畔四周依依垂柳下三三两两的坐着人;微风拂过,确实非常惬意。

李武一路向我介绍北大的各种建筑与其历史沿革,走在暮色中的林荫道里,感受着百年北大“思想自由,兼容并包”的情调。在欣赏风景的同时,也和他聊了一些北大信管的话题。一方面,此次来北京前,游园数次的要求我一定要带上他的问候,甚至差点打算与我同行;一方面,在信管读博士的师兄刚好出差不在,因而只能由李武来“负责”招待了。耽误了他很多的时间。

信管系坐落在六院里,古色古香。周围是中文系历史系的别院,也正显露出信管在北大学科体系里的定位。据李武的介绍:首先在专业设置上,图书馆学与情报学并没有分的很清楚,北大系统更偏向文科一类,而相比而言还是中科院系统的项目比较多一些,也更侧重实务;其次,北大信管少有毕业生从事图书馆工作,更多的去选择了其它的一些行业;第三,则将李武的专业――图书馆学好好的聊了一下。首先得声明,我不是图书馆学专业出身,亦对该专业了解不是很多。作为一个stiill open to doubt的话题,图书馆学是否有必要成为一门科学,我们在一起聊了一些,当然,更多的时候我是作为一名听众的。尽管以前也常常考虑过这个话题,但长久以来,关于这个论题的讨论也很难达成一个共识,处在公说公有理的一个状态。

作为朋友之间的交流,谈到了几个方面:

一是,相比起北图、国图、清华图书馆、上图等大型图书馆,毕竟大量的中小型图书馆构成知识传播金字塔的主力军,而在目前的状况下也不可能做到每个图书馆和每个从事图书馆工作的人员都去从事研究工作;而对于大部分的图书馆工作而言,仅从北大信管图书馆专业少有从事专业工作这一点看,就可以视为传统图书馆教育的失败;另一方面,大量图学专业的毕业生不再从事图书馆工作,也可以视为传统图学的技能、理念与方法向其它领域大范围扩张。但因此就引入了另一个话题:传统的图书馆工作,由谁来做?李武的解释是,可以将图学专业从“学科与科学体系”向“职业培训”转移。对此我也是比较认同的,我个人认为,完全可以将传统的图书馆学所涉及的方法与理念可以抽析出来,这些东西并不依存与具体图书馆的存在而存在;作为方法,则可以介入到图书馆实务中进行职业的培训,也可以涉及到其它的社会领域里进行;说白了,现时图书馆专业教育体系中的东西,不应该紧紧针对图书馆专业的学生,而是应该针对所有的人,成为人的一种基本素质教育。(在自己的专业--情报学领域内,也常常和游园谈及Information Science as a Method的话题,但总是无果而终)

二是,无论是图书馆专业还是情报学专业,目前的研究前沿领域如Ontology、数图与元数据、人机交互、检索技术、知识管理等,无一不是依靠技术来支撑的。K博士亦在其日志中谈到,图书馆缺乏软件人员。对此,我的理解是:一方面,传统图书馆专业的教育,培养不出来可以胜任软件工作的人员,其次:社会竞争与分工,导致能够从事软件工作的人员,更多的愿意去竞争更高利益价值的行业。因此而陷入第一个dilemma. 倘若加强图情专业的技能技术训练,又导致相当多的人认为会丧失图情专业的本质而举步维艰,因而陷入第二个dilemma。实际上,这种话题以前已经讨论的足够多了,这次北大之行,只是温故而知新,所以自己就觉得,干脆当个新时代的阿Q吧,至少还能自得其乐。

时间过的飞快,今天写博客的同时,李武也参加了博士入学的复试,我还没有联系他,也不清楚状况,就在这里预祝李武顺利留在北大又四年。另外回天津后,一直忙于其它事务而没有将北大之行写出来,游园都已经催了我好几天了,但总觉得文笔晦涩,因而迟迟没有动手。脑子里也比较乱,还没有从京城钢铁铸成的车水马龙发出的噪音里缓过神来,加之我写博客一向都是比较乱的,希望游园能够谅解。也再次感谢李武的热情招待,希望能早日再聚并翘首以待了。

 

 

 

 

 

 

- 作者: stanley.cheng 2005年04月22日, 星期五 21:45  回复(6) |  引用(0) 加入博采

北行随笔之柳絮满天飞

这是我第三次来到北京,正是春天柳絮发芽又赶上大风天气,漫天的柳絮飘飞,如漫天大雪。路上的车来车往摩擦出的擦擦风声依然不绝于耳,显示出整个京城的快节奏生活。


到达北京倒也方便,同学去接站,然后去吃饭,顺便逛了逛;看到漫天飞舞的柳絮,想起南方的秋天,以前在城里的道路两旁都种满了法国大梧桐,每到秋天的时候,树上结的果子总会被风吹落一种小絮絮,吹到眼睛里会疼。现在换种樟树后,唯有清香留下,在夏季雨后的傍晚散散步是很轻松舒服的享受。北京没有象南方那种潮湿清新的空气,空气太过于干燥,灰尘也太重了,尤其是城市太大,干什么事儿都不方便。但是这次同学事先安排好了住处,不需要象春节一样路过北京的时候,那次在天安门晃悠了一晚上,冻得自己浑身发抖还只能安慰自个:咱可是非常积极的,专门熬夜来看升旗。结果盼望了几个小时也冻得半死,看升旗却只用了三分钟;事后跟同学也感慨过这次独自一人在北京的经历,问起自己是否庄严肃穆激情高昂的时候,只发自内心的感觉到,好冷啊!

从地铁换乘公交,顺着学院路一路看到政法大学北航语言学院等众多的高校,属于高校密集区域,看起来也的确比较繁华热闹,但由于时间关系,都没有进去这些学校转转。上海的两处高校集中的区域主要在西南片和东北片五角场一带,有年夏天去复旦玩了两个月,每天晚上吃完饭,就喜欢和哥们穿着裤衩汲着拖鞋去五角场一带逛悠,在复旦或同济的小道上穿行,去路旁的小书店;从复旦的东门出去向北是财经大学,号称和上师大一样是上海美女的批发市场,但那条通往邯郸路的小道上,总是车来车往,亦对扬起的尘土尤其不喜;比对起在北京见识到的漫天红色的沙尘,又怀念起在上海呼吸新鲜空气的日子来...

住的地方离考点比较远,早晨慌慌张张的爬起来,就赶上了公交车往中关村方向开行,堵车很严重;我没有事先去看考场,又不熟悉北京城,本已为时间预留的足够多了,结果下了车从中关村一直走到苏州街,来回晃悠了两次还问路无数次才弄明白,原来那地方深深的隐藏在小巷深处,害的我在考试之前白走了好多路还耗尽了力气,进到考场的时候,满身臭汗还紧张兮兮的。很意外的是,春节回家在火车上遇到一个北航的学生,曾经聊的很开心并互相留过电话号码,没想到在考场外又一次偶遇;他陪着女友刚考完出场,被我一眼看到,稍聊几句,并相约下次再来北京的时候要好好聚一下。也趁着这他乡遇故知的开心梳理镇静了一下自己的情绪,感慨世界真小啊,而没有被休息室里等待进场的一群人紧张沉闷的气氛所干扰。

友情提示:如果是考试的话,提前侦察一下考场还是很重要的,不要象我一样过分自信,找不着北的时候急得满地打转。此外,情绪还是要稳定住,不要自己吓唬自己尤其不要太关注休息室里的其他人,很多人进场之前还抱着书啃,容易引发集体紧张;机考正式开始之前会有一些背景知识调查(background questions),可填可不填,只需要遵照屏幕提示操作就可以了。除了两类的证件外,不需要携带任何物品进场:手表、报名卡、confirmation number、纸笔等都不需要带入。整个考试是监控录像并且不允许超过两个小时,如果发现有带入非证件的物品会一律没收,之前也有其他人惨痛的教训。


 

- 作者: stanley.cheng 2005年04月20日, 星期三 14:05  回复(1) |  引用(0) 加入博采

心情随笔: 失败和无奈是姐弟关系?

中午的时候与远方的哥们聊天,谈到自己对已经过去的2004年,觉得很无奈。哥们安慰到,其实无奈是成功的舅舅。

想起那一句什么谚语来着,失败是成功之母。方才恍然大悟,原来失败和无奈是亲姐弟的关系啊..


记得一句唐诗说到:沉舟侧畔千帆过,病树前头万木春。回家过年本来是件愉快开心的事,不知道是什么原因,想起已经远去的2004却无法开心起来。但无论如何,过去的事情,总不能如嚼腊一样翻来覆去的折腾,感慨归归感慨,总是还得向"前"看的嘛!经过一中午惊天动地的闹腾以后,把带媳妇回家过年的舍友"远送"到宿舍的大门处,"恶狠狠"的诅咒了祝他们俩一路倒霉,招来的回报却是舍友女友的甜笑以及对我"前无古人后无来者"的评价,实在是消受不起;打定主意今天要娱乐,于是重新爬上网,依靠这看起来还属于自己的一块"净土",发点牢骚,苟延残喘一下。

的确没有见过谁失败了之后,依然开心欢笑的。看来哥们说的有点道理,失败和无奈还真是一对亲姐弟。

私下通信之后,游园告诉我,他是要经常去光顾博客并将博客精神发扬光大的。我很佩服他这种惊天地泣鬼神的壮举,不知道他家里上网的条件如何,居然数十天如一日的坚持博客,这一点很让我佩服,于是精神上支持游园一下,反正换了我就是肯定不行的啦,写博客就成为无奈之一。其次是,老爸老妈要带上小外甥去上海过节,因而俺回家之后就需要认真思考寒假期间蹭饭吃的生计大事,此外还得应付几个月以前就已经约好要在一起切磋酒精品尝技术的一堆哥们,可偏偏俺酒量又差,想起来那没完没了的酒局,实在是头疼不已。顺便提一下,据说游园的酒量非常好,经常可以喝个饱还能摇摇晃晃坚持走进宿舍而绝对不会误入洗手间,相比之下俺觉得汗颜,哪怕是一小杯的白酒,就能撂在桌子下面几个钟头人事不醒,所以说,酒量的问题已经不是尴尬,而是无奈之二了。

其实在家里不是不能上网,只是俺不喜网吧的感觉,而喜欢在自己的小屋宁静思远,装出一副思考的样子来吓唬吓唬邻居小孩;屋子里没有电话,确实有需要上网的时候可以用手机红外线接入,不过那移动也实在是太黑了一点。以前曾经试过,刚刚接通红外登上QQ,流量已经过了200k了,这要是聊上几个小时,那俺不破产了才怪。明明可以上网,却由于种种原因不能上网,此为无奈之三。也实在是由于红外上网不属于俺这类穷人可以玩的起的,只能一笑了之。

由于俺非常需要利用这春节的三十天的整个假期来干一些自己的事,不得不把以前的一些老朋友给得罪了。发短信,QQ留言纷纷通知他们,春节不能出来要在家闭门思过。什么卡拉ok啦,喝茶啦,聚会啦就别再拉上我了。花了一沓的时间,却得到了一群不理的目光。不得不一个个去解释以获取理解。实为无奈之四。

不写了,写多了,手累!也到了该吃饭的时间了。

 

- 作者: stanley.cheng 2005年01月17日, 星期一 17:26  回复(2) |  引用(0) 加入博采

Notice : Gmail invitations run out

There is no more Gmail invitations available.

Please DO NOT send requestion emails to me anymore. I will post updated offering information in my blog to inform you  if Google give me more invitations to invite people join the Gmail system.

Recently, I 'v sent more than 10 invitations for the applicants who came from countries such as China, Canada, Turkey, Indonesia, UK and so on.

Thanks.

Those are all that I can do. And I tried my best to distribute Gmail system to the people around me. It's time to say STOP.

:)

By the way, I have to go home in this winter holidays. Wish you and your family  happy in new-year and the coming tranditional chinese holiday: Spring Festival.

May GOD bless China.


- 作者: stanley.cheng 2005年01月17日, 星期一 15:02  回复(0) |  引用(0) 加入博采

津门手记:衣食住行

大概是北方的气候使然,对于我这个打小就在南方湿润的空气中长大的人,初来乍到的时候还真有点不适应。

不过有些事情看在眼里记在心里,刚好过了这阵儿就打算开路走人回家过年了,把能想起来的事儿又过了一边;反正闲在宿舍也没什么可干的,就这么漫无目的的瞎掰吧。


都说人的生存四大需求为:衣食住行。 从这几点说开去之前,不得不先说说我居住的城市-天津!

一年多前初次来到天津的时候,巧逢天色朦胧亮。碰上的第一件怪事就是在出站口,一个看起来像是给人称体重的大称,原来是用于称行李的重量的。凡是超过国家规定的20公斤,一律要求补缴运费。不幸的是,我就成了被抓获的少数现行派。第二件事就是奔出火车站后打算找一辆出租车,才惊奇的发现,堂堂的直辖市,满大街跑的出租居然是在其它城市已经绝迹了的黄面的。第三件事,乘上出租车跑了很长时间后,我极其好奇的问开车师傅:"啥时候到市区啊?"大师傅面无表情冷冷答道:"大哥,你已经在市区了"。我:"※%¥#-(※%"。

顺便提及一下,不仅仅只有情报学有"12化",后来还听说过关于天津的"四化",即:市长没文化,城市农村化,马路市场化。而另外的"一化"实在是不记得了。甚至有笑话专门拿天津城来开涮,话说这本拉登是不打算去炸纽约世贸大厦的,先乘坐飞机到中国来了一圈。在上海上空的时候,拉登同志惊叹于上海的繁华,于是决定投资上海来赚取利润,上海因而幸免于难;这飞机又飞到了北京,拉登瞧见了宏伟的天安门广场、毛主席像,还看见了很多清真寺、美妙绝伦的紫禁城和文化遗产,于是决定以后要来北京旅行7苫鄯档搅颂旖虻纳峡眨尚性弊偶钡奈世牵拐ú徽ǎ坷羌荒头车幕卮鸬剑郝璧模獠灰丫ü嗣矗?/p>

再回到衣食住行,其实我很少外出吃饭,对于津门本地的风味小吃实在是没有什么经验。但学校食堂的伙食之差,却绝对是空前绝后的。而且意外的是,居然是甜的。都知道上海菜好甜,我虽然吃上海菜没有什么问题,却被这天津的甜讥讥的菜给弄怕了。只要是菜,必定要放番茄酱,甚至连面条都不放过。弄的黄乎乎的不仅不好看,而且还很难吃。我只是觉得奇怪,为什么其他人都吃的津津有味?比如那番茄汤,其实就是拿一锅开水,打几个鸡蛋进去,再倒上一把番茄酱搅和搅和就成了味道鲜美的"汤"。这样的汤,在南方地区白送人都不会有人喝,而天津本地的同学居然会花钱去喝,我实在是百思不得其解。只能自己敬而远之了!

关于衣服的话题就不多说了,反正觉得天津人不像上海人那样讲究衣着。估计跟天津美女比较少有很大的关系。几年前去上海的时候,在复旦的同学那儿待了一个暑假,还专程去了距复旦很近财经大学,那时候才知道什么是"美女如云"。但那种场面,在天津是从来没有瞧见过的。大概是属于风水方面的问题吧!

瞎掰了半天了,得说说学校的其它几个轶闻趣事了。

比如,在食堂吃饭不用卡划帐,却直接付现金。收钱的人顺手会抓上食物递给买饭的人,两人都面不改色心不跳,唯独只有在旁边看着这一切并且目瞪口呆的俺,实在是无法忍受那将细菌洒落到食物上的坦然,不得不落荒而逃;吃饭时不统一发放餐具也就算了,相当多的人却用简易塑料袋吃饭,无论那饭菜是温热的还是滚烫的,全然不觉有任何不妥。本来学校为了防止非典卷土重来,禁止使用公共餐具,却又任由餐厅使用明显有毒的非食用塑料袋来盛放食物,实在是令人哭笑不得;临走之前得沐浴净身,去了浴室后,那儿与我所见过的任何一个浴室都不一样:有穿着裤衩洗澡的、或者一边洗澡一边刷牙的,甚至还刷的旁若无人自得其乐,对此俺不得不说声"佩服",这些同学争分夺秒惜水如金的精神,是要坚决的学习的;俺虽然去过的地方也不少了,却对这样的事情闻所未闻,于是乎,只能惊叹于自己的浅薄了...

其实这就是在记流水帐罢了,回家还能当作跟人聊天的谈资。真是一方水土一方人啊,几声大笑,就此搁笔!

 

- 作者: stanley.cheng 2005年01月16日, 星期日 22:48  回复(4) |  引用(0) 加入博采

心情随笔: 我不做愤青已多年

【时间】 公元2005年1月14日

【地点】 天津西火车站

【午夜心情】那一霎那的似水流年

【摘要】在这个寒冷的夜晚,一群衣着褴褛的人们,拥挤在破旧火车站的高大玻璃窗前,一双双渴望的眼睛里,透露出的是无奈和执着,整整一行队伍里,共同之处在于,每个人的手上,都拿着一沓钞票,还有一堆学生证。其实每个人想要的,不过是一张回家的车票罢了。

【2004关键词】民工欠薪;彩票造假;奥运"辉煌";超级假球;房价泡沫;网络博客;学生买票

-stanley


天津,这个有着一千万人口的古老城市,迎来了建卫600周年。孤单地坐落在中国的北方,稀稀拉拉的几栋高楼,显示出其与直辖市地位极不相称的荒凉。听说,很多北方城市都是这样的,除了北京。而火车站的破旧,也许不是一般人可以想象得出来的。终于要自己亲自出马去排队买票了,吃过晚饭早早的就去了售票处,人已经很拥挤了,当然,只是在学生售票窗口。除此以外,其它的十多个窗口前,一个人也没有。只有几十上百个学生,喧嚣着排队,听说晚上12点开始预售5天后的火车票,而每一车次限售学生票10张到20张,整个西火车站每天也不过仅出售190张学生票。

打听了一圈后,才知道前面有些人有已经排了10个小时的,而此时离售票时间还有整整6个小时。确切的说,排在后面的人,买票基本已经无望了,却依然倔强的排着。一个看起来象是大二大三的小姑娘插进了前面的队伍,最终因为群情激愤而不得不含着眼泪被警察拉了出来,看的出来,她几乎是硬忍着不让自己的泪水脱眶而出,而所有看热闹的人都很漠然,包括我在内!

   一年前的2004年初,民工欠薪是节前最大的新闻。不断听说有人拿不到钱而去跳楼。政府吃不消了,又是新闻报道又是官员出马,不知道解决了没有;

   西安的宝马案也因为刘姓小伙子的强悍而告终,该给的钱给了,该坐牢的人也进去了,皆大欢喜;

   奥运会的几十块金牌也挺"值"得的,不过到底多少块我也没有兴趣了解,就是挺厌恶那整天在中央台播来播去的辉煌,而报道失学儿童和下岗职工的平凡生活,就没看见这些开豪车住豪宅的大牌主持们有多少积极性,觉得他们挺他妈无聊的;

   中超足球就更不用说了,以前好像还听说学校有人拿学生证去买廉价球票,现在估计免费给票看球都不会有人去了;倒是宿舍宽带的BT用的巨多,大家都去下载5大联赛的录像,闹的网络速度越来越慢了;

   房价也逐渐看涨,旋即又暴出涨价内幕,原来开发商贷款建房,高价倒卖,然后把风险转嫁给银行。没关系啊,我们的政府很有钱哦,听说大笔一挥就要给工商银行注入300亿美元的资金抵消坏帐,这下就好了,入世了,竞争了,工商银行发达了!北京上海的房子看来还真不是一般的人可以问津的,特别是咱这些穷学生,一个师兄签约到了某著名搜索引擎公司,一个月给4k,一年不吃不喝估计也只能在北京城的三环以外买个小型厕所而已。似乎还有很多人羡慕不已呢:哇,4千啊...

  博客风行了,人手一个,弄得形势颇点象当年的pager,又象..暂且不谈这个了吧,还是持仓观望ing。

  票,还是要等到午夜12点才能买的。我暗自庆幸自己来的还算早。后面的人,谁也顾不上了。您要是大款,也行!旁边的票贩子绕来绕去的,只要您给钱,什么票都能给您弄到,别说一张火车票了,就是火箭票也行啊。可咱是穷学生啊,不穷怎么当学生呢?谁说这年头只有民工穷啊,其实学生也挺穷的;谁说民工买票难啊?身临其境的时候才明白有人比民工还惨。

 排了六个小时终于买到了票,前面那些已经排了将近20个小时的人也纷纷做鸟兽散,只有后面的几百人,还在无辜的排着队等候那渺茫的希望。也许,他们明天还得再来一次,甚至后天也得来;为了那一点团圆的渴望,孤单的站上十多个甚至几十个小时后,拿到票的那一霎那真的很激动,欢欣雀跃的跳出来..轻轻的我走了,正如我轻轻的来,没有带走一片云彩,只带走一张火车票,但什么都没有留下--给你!

 其实只是要回家了,也许相当长的时间不能来写博客;和"山高水长"家里经常停电的理由不同,俺家虽然不会停电,却无处上网,只好让这鸡肋般的博客休息一阵子,要是象以前一样写上一句"太忙了不能来写博客"这类的话,忒是简单了一些,并因此而遭受多方的批判。因而借此机会,唠点废话,顺便给大家拜个早年。但绝对不是发牢骚,因为:"大哥,我没做愤青很久了!!"

 

 

- 作者: stanley.cheng 2005年01月15日, 星期六 14:26  回复(6) |  引用(0) 加入博采

Helpful Terminology

我是没有学过专业英语的,就找了些专业词汇的解释。很多词汇通过WM词典同样可以得到,但不一定会很专业了。

(转) transferd from http://www.fact-index.com/i/in/information_science_glossary_of_terms.html


An abstract is a brief set of statements that summarize, classifies, evaluates, or describes the important points of a text, particularly a journal article. An abstract is typically found on the first page of a scholarly article. Because an abstract summarizes an article, it is very useful for either browsing or keyword searching.

An annotation (noun) is an explanatory or critical note or commentary. Annotation (verb) is the process of adding an explanatory or critical note or commentary to a text. Reference lists are often annotated with comments about what each resource covered and how useful it was.

An appendix is a group of supplementary material appended to a text. It is usually related to the material in the main part of the text but not so closely related to it that it should be put into the main text. Put background information and supporting facts in the appendices. An example of a file that should be put in an appendix is a file of detailed charts and graphs of resent research closely related to the paper's main topic.

An archive is a place in which historical documents and other records are preserved. Usually operated by large organizations, they may or may not be open to the public. The University of Toronto, for example has an archive that requires a five story building and contains several climate controlled vaults. A virtual archive is similar except the documents have no physical presence and seldom have historical value.

An author is an originator of a creative work, particularly a writer of a text. Searching by author can be an effective form of information gathering.

A bibliography is a list of writings related to a specific subject, writings by a specific author, or writings used in producing a specific text.

A bibiographic database is a computer based list of library resources. Typically each record contains the call number, author, title, publishing information, and other card catalog information.

Boolean logic is the algebraic system, developed by George Boole that is applied to Boolean expressions that contain Boolean operators such as AND, OR, NOT AND, and XOR (exclusive OR). This binary algebraic system is used primarily in switching circuits and database searches. Boolean operators are not to be confused with proximity operators such as NEAR..

To browse is to inspect something casually, particularly to use an internet browser to casually inspect Web pages. This involves following links from page to page (also called surfing) rather than searching directly. The main difference between browsing and searching is that with browsing you have very little advance knowledge of what will be on the next page.

A call number is an identification marker used in libraries to categorize and locate books and other resources. Each resource is assigned a combination of letters and numbers which correspond with a location in the library. For example the call number for the Publication Manual of the American Psychological Association is BF76.7 .P83 2001.

A catalog is a complete and systematically organized enumeration of items, particularly the complete enumeration of a libraries' resources on a set of paper cards (card catalog) or in an electronic database (bibliographic database).

A citation is the quoting or mentioning of a source. All works used in preparing a paper should be cited.

A citation search is a search, by name, of all references to an individual. Some databases have a specific citation search option, otherwise you use a full-text search. For an example of a database that has a specific citation search option go to the University of Michigan Library Database.

Controlled vocabulary refers to limiting ones searches to the exact subject headings contained in the Library of Congress. An example would be "History - Bibliography etc". Some indexes, like Wilson Indexes, have their own system of headings and hence their own controlled vocabulary.

Copyright is the legal right granted to a copyright holder for the exclusive sale, distribution or reproduction of a creative work. It is a form of intellectual property that prevents others from using a creative work without consent of the owner. For example, Thomas Mann holds the copyright on the book The Oxford Guide to Library Research.

A cross reference database is a collection of records that have one or more fields that reference other related records. These connections (for example between "marketing" and "promotion") make browsing very productive and allow related-items searches.

A descriptor is an index term used to identify a record in a database. It can consist of a word, phrase, or alphanumerical term. It can describe the content of the record or be an arbitrary code. When a descriptor is descriptive, it can be an effective search parameter.

A dissertation is a written treatise or thesis, usually lengthy, usually authored by a doctoral candidate, usually directed by a faculty advisor, and usually intended to advance the state of the art in a given discipline. There can be an oral component to the process, in which the dissertation must be defended in front of critical judges. Dissertation searches are valuable because of their currency.

Document delivery refers to the transfer of a database record, or other information resource, to the end user. It can involve direct internet or email transfers, CD delivery via mail, paper delivery via mail, or delivery via interlibrary loan.

An edition is a version of a published text, or all the instances of a published text issued at a given time. An example would be the 2nd edition (2001).

Entry (verb) is the inclusion of a record in a database, or the inclusion of a field in a record. An entry (noun) is a record so entered. An entry word is the headword in a dictionary, encyclopedia, or glossary. This author's entry of this entry results in an entry word of "entry".

A field is an element of a database record. It contains one type of information and has a unique address. All or most other records in the database have a similar field. An example is the field "name".

A free-text search(1) is a simple word or character search, usually with very few Boolean, proximity, or scope limiting options. It is simple and fast. A free-text search(2) is a search in which all the entries are freed from their original format of presentation. Text that originated in a journal article looks much the same as text that originated in a glossary or chat room. A free-text search(3) refers to the deliberate limiting of the scope of the search parameters to include only records that are available free of charge.

A full text database is a collection of records containing complete versions of the original source, rather than just bibliographies, abstracts, or abridgements. An example is WilsonSelectPlus. A related concept is that of a full text search which searches only sources that are complete, and ignores those records that are mere abstracts or descriptors.

An interlibrary loan is when patrons of a library arrange for resources from other libraries to be sent to their library. This resource sharing system is being promoted by the International Federation of Library Associations.

A journal is a periodical publication that presents articles in a specific subject area. They are usually scholarly. An example is The Journal of Marketing Research.

Keyword searching is the searching of a database using a significant word from the title, abstract, or descriptor of a record as a point of reference to the article's overall content. This type of search is usually the best type to start with.

A literature search is the search of traditional resources including books, catalogs, encyclopedias, dictionaries, atlases, bibliographies, handbooks, manuals, periodicals, indexes, yearbooks, gazetteers, directories, chronologies, almanacs, and guides. It is useful because it opens up a vast world of pre-1995 information that is not available online.

A menu is a list of options from which a computer user can choose. This saves the user from having to memorize a set of commands. It also reduces the decision down to the basic information required (note the etymology from the French word minuet meaning small).

A monograph is a written document (i.e., graph) on a single subject (i.e., mono). It is usually scholarly in nature and of approximately book length. They are valuable information sources because of their depth in a limited subject area.

Operators are symbols that represent operations. In computer science there are binary and unary operators depending on the number of elements or records an operator acts on. In database searching there are Boolean and Proximity operators. Boolean operators are a subclass of logical operators (Logical operators are binary operators that manipulate data at the bit level.). A Boolean operator manipulates the binary value "true/false".

An online catalog is a record of the holdings of an institution (usually a library), or group of institutions, that can be found on the internet. An example is UCSD's "Roger"

A periodical index is an alphabetized listing of works that are published at regular intervals of more than one day.

Plagiarism is the passing off as your own, the work of others'. Original sources are not given credit.

A primary source is the originator of a primary record. A primary record is a resource created by the same people that initially experienced or used it. They create the records for their own purposes, records that often remain unpublished. Sometimes they witness an event, sometimes they are involved in an event, and sometimes the record is directly created by the event.

A review article is a periodical article that describes, analyses', and criticizes a book, journal article, movie, or other creative or academic work.

A scholarly article is an article, usually in a scholarly journal, that deals with academic subject matter at an advanced level.

A record is an individual entry in a database and simultaneously a collection of related data fields. Each field , although related, is of a different type to all the other fields in the record (and this is what differentiates a record from an array).

A search strategy is a generalized set of technique used in the process of determining what information you currently have, determining what information you need, and determining how to get it. Some possible strategies include; controlled vocabulary searches, specific entry searches, browsing, general scanning, broad to narrow searches, adjacent item browsing, subject tracings searches, keyword searches, citation searches, literature searches, cross reference searches, and chat room questions and other direct people contact searches.

A subject heading is the name of the category that a record is included under. For example, the record "natural frequency of vibration" might be found under the subject heading of "Acoustics", and acoustics, in tern, might be found under the subject heading "Physics".

A subject directory is an hierarchical grouping of related subject headings. The tree structure shows relationships between subject headings. They can be found either inside a database or separate from a data base.

Generally a thesaurus is a book of synonyms, often also containing antonyms. An example is Roget's Thesaurus. In database searching, a thesaurus strategy is to use multiple iterations to search for related words and generate results. The database will often suggest synonyms and related words to try.

A thesis statement is a one or two sentence description of the main point you intent to make in the paper or dissertation. It is usually found very near the beginning of the work. An example is "In this paper I will provide evidence that...".

Truncation is the shortening of a search word, field, or record. In the case of truncating a search word, this is a strategy used to search among multiple variants or spellings of a word. The asterisk (*) is generally used as a wildcard to replace a letter or letters. An example is invest* which will pick up instances of invest, investor, investments, investigations, etc. In some databases the asterisk must be accompanied with a number that define the number of characters that can be truncated.

- 作者: stanley.cheng 2005年01月11日, 星期二 22:23  回复(1) |  引用(257) 加入博采

An Immature Study on Searching Behavior

Those are some points deduced by my immature consideration of the searching behavior. 

                               by stanley


1. Search for specifics
In this point, I mean, when we practice a search action, we do not care about anything else at all but the relevancy of the resulting information which should fulfill the needs of information users.

2. Search for characteristics of information inside and outside          
(unfinished)

3. Search for types
In this case, I want to explain that the types of information do not indicate the information carrier only but also represent the modalities of information. For instance, web page and digitalized book both can be transmitted by internet, however, both two types of information are totally different. We can use advanced search in google, there are several options such as selecting the type, language, etc. They might be English or Chinese version, PDF, Doc, or PIC format and so on. Confine your search strategy to a smaller range should be a helpful method access to the information what you wanted.


4. Search for entries
In this level, I regard the entry as a channel to the database which consisted of different information. A website is a database, as a simple model, if there are thousands of website, what we called 'website' becomes a entry. After that, Blog is an entry too.


5. Search for relations
Obviously, based on the standpoints of users, there are uncountable relations among different information. As to say, hyperlink, is just one of those connections; Citation, in a word, is a typical method to point out the relation between two articles. Moreover, based on the former study on 'theory of six-degree separation', we can easily figure out that there are lots of relationship among information and users. For example, as a centerlized blog, oldhuai.blogchina.com or www.qiantu.org not only is a powerful place to distribute articles, but a useful method for the users to seek related topics and the others who engaged in the same subject also. In other words, it makes users more closer and more easier access to a certain domain. A theory consisted of hyperlink analysis and citation analysis is not sufficient for us to make a embedded research. I would like to forward along with this subject.
 

ANOTHER special topic:  Approaching-search behavior        


I do like to study the search behavior for a long time. We assume a person does not know the targets explicitly. He shall do a approaching-search. I was just wondering how this action could be done, and how does he plan to do such a work? How about the procedures work? Those questions confuse me all the time. In the case study, the person referred above, he can find some outside characteristics as a pre-search keywords. Afterward, based on the results, he will be able to find out some relations closer to the targets. Then, the clue or clues will open out more directly and visible to the final target. However, this is a one-user analysis. We can not draw a conclusion smoothly that all the users' behaviors are following this way. In a large amount circumstance, there will show a complicated network of behaviors. The five points, or more, are not independently, but illustrate that those five levels and user behaviors are the all and the one that we can not cut off the relationships among them. I realized that we can use a mathematical method analysis and modeling the phenomenon. It is one of my favorite topics. By the way, those viewpoints are immature consideration. Don't pay much attention on it please. There is only one reason why I post it here, that is, I am practicing my writing English.   LOL.    And I have to comment on  my point of views: TOO SIMPLE, SOMETIMES NAIVE.           ---"excerpt from a famous person's words                   :)             

- 作者: stanley.cheng 2005年01月11日, 星期二 22:01  回复(0) |  引用(0) 加入博采

/body>