| Ce 的个人资料piggyzone照片日志列表 | 帮助 |
|
|
10月2日 应该多几个这样的教授!下面是浙江大学一个教授的演讲:
在中国这个发展中国家,你能建10所世界一流大学,那美国有多少所?日本有多少所?现在的实际状况是:世界上前200所大学,中国一所都排不进!在亚洲能排上几所?我到国外去看了以后,感到要将浙大建成世界一流大学就像共产主义理想。” “以前说‘无知无畏’,现在却是‘无知才无畏’,许多企业把浙江省技术监督局、科委的人请来吃一顿饭、喝一点酒,他就给你签个字,再把我们这些教授胁迫到那里去,给你盖个章,然后就是‘填补国内外空白’、‘国际先进水平’,写论文则是‘国际领先水平的研究成果’、‘首次科学发现’等等,这都是目前非常严重的问题!作为一个大学教授,我深深地为此担忧!这不是我们的责任,是我们的领导无知,是他们倡导了这个主流。我知道在座的处长或老总日子很难过,因为你们不写这样的报表,就拿不到钱,项目就得不到批准。教授也同样如此,天天写报告,而不是在实验室静下心来好好搞研究,这是很严重的!” “我们国家的现实和发展就是这样:凡是依赖不成的,我们自己都能搞得像模像样,比如二弹一星;凡是能够引进的,就都搞不成……现在很多合资企业就这样,就是卖点东西,而没有去考虑这些深层次的东西。殊不知,这就是社会的恶性循环!” “我认为:语言、计算机就是工具。中国的外语教授讲英语还不如美国卖菜的农民!怎么看待这个问题?日本博士、德国教授说不出英语的多得是!我们怎么能说一个人不会说英语就是文盲呢?语言就是一个工具!你没有那个环境,他怎么能讲这个语言呢?……如果我是教育部长,我要改革二件事:第一,取消六级考试,你一个研究生连中文一级都不及格,你英文考六级干什么呢?看看研究生写得论文,自己的民族文化都没有学好,天天考英语──打勾、托福——打勾、GRE——打勾,英文考出很高的分,可哪个写的英文论文在我面前过得了关呢?过不了关!这样培养出来的人能干什么?自己搞的专业一点都没学好!……说不会计算机就是文盲,这又是一个误区!我现在是教授,我顾不上搞计算机!” “你看我,从高中开始学英语,大学学,硕士学,博士学,花了我多少精力!你说中国人怎么做得出高科技的研究成果?我这几天就教训我手下的几位女学生,问她们在干什么,看不到人影,一天到晚考这样、考那样的,到美国去干什么?在国内要干的事多着呢!你整天考英语,美国人连报个名都要收你们的钱,日本人也是如此,中国学生到日本去要交手续费,到日本留学是为日本人打工,好不容易挣点钱交了学费,读完博士在日本的公司就职,当劳动力,挣了一笔钱后要回国了就买了日本家电,把钱全给了日本人。你们都没有注意这件事,这里面都是经济问题。这就是素质教育到底是什么。” “中小学的教育就是听话,老师管干部,干部管同学,孩子们都学会了成年化的处世方式,这是害人啊!这样强迫性地做了一些好事后,没有把做好事与做人准则结合起来去培养,而只学会了拍马屁、讨老师喜欢、说成人话。上次电视上就曾经播出,一个小孩得了个奖,主诗人问他最愿意说什么,他说:‘我最愿意跟江爷爷说:我向你报告!’‘江爷爷’是谁?还不是老师教的!孩子们在中小学活得很累,到了大学就没人管了,所以就要玩、就要谈恋爱。” “我们有很多同学成绩好,却什么都做不了。在我们大学像我这种程度的人,招博士生是从来不看成绩的,成绩算什么!现在我从事的这个领域在中国有叁个杰出的人才,当初在读研究生时都补考过,而成绩考得好的几个人却都跑到美国去卖中药了,这说明了什么问题?作老板的可不能这样啊! ……人才的梯队一定要合理,而不要认为教授就是万能的、博士就是万能的。中国的教育体系就是让每一个老百姓都充满希望和理想,教育孩子们要树立远大的理想。实际上,人的能力是不一样的,扫地能扫好,也应该受到尊重;打扫厕所能打扫干净,也应该受到尊重,不能动不动就要高学历。我要提醒的是:在国外可不是这样,美国、日本的博士就很难找到工作,为什么?因为老板心疼钱,招了博士要给他高工资,而他能做什么用呢?这是个具体问题。” “科技到底该干什么?高科技到底该干什么?如果我是科技部长,该玩的就玩,就像陈景润,他就是玩!陈景润如果是处在今天的中国,他绝对是要去讨饭的,因为他不会去搞产业化,他的英语也不好,他说话都不流利,中文都讲不好,按现在“标准”,他是个文盲,还谈什么教授! 日本人就是喜欢美国人,我跟日本人说:你们这个民族爱谁,谁就要向你们扔原子弹。日本人就是喜欢黑人也不喜欢中国人。……我特别对我们的女教授、女同学说:在日本人面前一句日文都不要讲,会也不要讲,日本人一听说你讲英文,特别是看到中国女孩讲英文,腿都要发软,这是真的!” “中国人为什么这些年都往外跑,最重要的是要让国民自己爱自己国家……如果我是杭州的市长,我绝对不是狭隘的民族自尊心──如果杭州有什么灾难,我就首先把杭州的老百姓安排在香格里拉,让外国人在外面排队(掌声)!这样,你才会让你的国民爱自己的国家!一个日本的农民跑到峨嵋山去玩,骨头摔断了,你就用中国空军的直升飞机去救他,而在日本大学一名中国留学生在宿舍里死了7天才被发现;名古屋大学的一对中国博士夫妇和孩子误食有毒磨菇,孩子和母亲死了,父亲则是重症肝炎,在名古屋大学医学院的门诊室等了12个小时,也没有一个日本教授来看望!而你们为什么还要这么友好,以为自己很大度,实际上是被人家耻笑,笑你的无知!笑你们这个民族!我们不能这样!我们的领导人跑到国外去访问,看到有几个人在欢迎他们,就感到挺有面子;而外国来了个什么人物,都是警车开道,这究竟是怎么回事?这让我们中国人感到是自豪还是悲哀?所有这些,对教育工作来讲,都是深层次的问题。所以我经常讲,我作为一位自然科学工作者,我教育我的学生,首先是学会做人,没有这些,你学了再高分,外语再好都是花架子,你不是一个完整的人!” “一个观念或是一个问题:是不是技术越新越好?今天谈的就都是提醒大家的……技术并不是越新越好,技术要有储备。日本的企业现在卖的东西大都是10年或15年前的技术,好东西他不拿出来,他要等到现有的技术把成本收回并获得尽可能高额的利润以后才会拿出来。其次,我们的对手什么时候推出新东西时,我们才会出手。不要以为你今天好不容易搞了生产线,明天又有新的了,你的钱还没赚到就换新的,有什么用?我这次到日本刚好谈到悬浮列车──即使何先生在我也要说,这是中国人又在玩高新科技。悬浮列车目前在理论上都还不成熟。日本现在最完善,最经济的就是新干线!从经济和市场的概念来讲,越先进的东西,风险越大,有可能得到的回报就越少!” 8月5日 每周报告 BAYESIAN DECISION THEORY (1)Bayes formula: P(x|y) = P(y|x)P(x) / åxÎX P(y|x)P(x) = P(y|x)P(x) / P(y)
Bayes decision rule: To minimize the overall risk(R=ò R(a(x)|x) p(x)dx ), compute the conditional risk R(ai|x) = åλ(ai|wj)P(wj|x) for each i, and then select the action ai for which R(ai|x) is minimum. The resulting minimum overall risk is called the Bayes risk, R*, and is the best performance that can be achieved. (λ(ai|wj) is loss function )
Tow category classification Let λij = λ(ai|wj) We decide w1 if (λ21 – λ11) P(x|w1)P(w1) > (λ12 – λ22) P(x|w2)P(w2) Assume λ21 > λ11 To decide w1 if P(x|w1) / P(x|w2) > (λ12 – λ22)P(w2) / (λ21 – λ11)P(w1) Thus the Bayes decision rule can be interpreted as calling for deciding w1 if the likelihood ratio (P(x|w1) / P(x|w2)) exceeds a threshold value that is independent of x.
Minimum-error-rate classification If true state of nature is wj, decision is correct if i=j. Zero-one loss: when i=j λ(ai|wj)=0 i¹j λ(ai|wj)=1 then when i¹j R(ai|x) = åλ(ai|wj)P(wj|x)= R(ai|x) = åP(wj|x) =1- P(wi|x) P(wi|x) is the conditional probability that action ai is correct. For minimum error rate: Decide wi if P(wi|x) > P(wj|x)
Classifiers, discriminant functions, and decision surfaces
The multicategory case The classifier is said to assign a feature vector x to class wi if gi(x) > gj(x) g(x) is discriminant function A network representation of a classifier: with d inputs and c g(x)s, then it determines which of the discriminant values is the maximum, and categorizes the input pattern accordingly.
Bayes classifier: gi(x) = -R(ai|x) gi(x) is maximum when conditional risk is minimum Minimum-error-rate classifier: gi(x) = P(wi|x) gi(x) is maximum when P(wi|x) is minimum gi(x) can be replaced by f(g(x)) when f is a monotonically increasing function, the result is unchanged. The effect of any decision rule is to divide the feature space in to c decision regions, R1,…, Rc. If gi(x) > gj(x) for all i¹j, then x is in Ri.
The tow-category case Dichotomizer (tow-category classifier) use single discriminant function g(x) = g1(x) – g2(x) Decide w1 if g(x)>0; otherwise decide w2 For minimum-error-rate classification: g(x)= P(w1|x) - P(wi|x) or g(x)= ln(P(x|w1) / P(x|w2)) + ln (P(w1)/ P(w2)
The normal density
Multivariate density p(x) = 1 / (2π)d/2|S|1/2 exp[-1/2(x-m)t S -1(x-m)] p(x) ~ N(m, S). x is d-component column vector, m is the d-component mean vector(m=òxp(x)dx), S is the d-by-d covariance matrix(S=ò (x-m) (m- x)t p(x)dx) σ ij = e[(xi-mi)(xj-mj)] Linear combination: p(x) ~ N(m, S), y=Atx, then p(y) ~ N(Atm, AtSA) The multivariate normal density is completely specified by the elements of the mean vector m and the independent elements of the covariance matrix S. (m determine the centre and S determine the shape)
Discriminant functions for normal density
If P(x|wi) ~ N(mi, Si) Discriminant function gi(x) = -1/2(x-mi)t Si -1(x-mi) – d/2 ln2π – 1/2ln|Si| + lnP(wi)
Case 1: Si = σ2I gi(x) = -|| x-mi ||2 /2 σ2 + lnP(wi) || x-mi ||2 = (x-mi)t(x-mi) which is Euclidean norm
gi(x) = -[xtx - 2mitx + mitmi]/2 σ2 + lnP(wi) xtx is the same for all i, making it an ignorable additive constant, so we get the equivalent linear discriminant function : gi(x) = witx + wi0 wi= mi/ σ2 wi0 = -mitmi /2 σ2 + lnP(wi) which is called threshold for the ith category A classifier uses linear discriminant function is called a linear machine.
Because the discriminants are linear the resulting decision boundary are hyperplanes, defined by gi(x) = gj(x). In this case, wt(x-x0) = 0; Where w = mi - mj x0 = (mi + mj) - σ2/||mi - mj ||2 ln P(wi)/ P(wj) (mi - mj) If P(wi) = P(wj), x0 is halfway between the means, and the hyperplane is the perpendicular bisector of the line between the means. If P(wi) = P(wj), x0 shifts away from the more likely mean.
If P(wi) are the same for all c classes the lnP(wi) term can be ignored. So the decision rule is measure the Euclidean distance || x-mi ||, assign x to the nearest mean. Such classifier is called a minimum distance classifier.
每周报告 BAYESIAN DECISION THEORY (2)Case 2: Si = S
gi(x) = -1/2(x-mi)t Si -1(x-mi) + lnP(wi)
If P(wi) are the same for all c classes the lnP(wi) term can be ignored. So the decision rule is measure the Mahalanobis distance (x-mi)t Si -1(x-mi), assign x to the nearest mean. Ignore the quadratic term xtS-1x, we get the linear discriminant function again: gi(x) = witx + wi0 wi = S-1mi wi0 = -1/2mit Si -1mi + lnP(wi)
The resulting decision boundaries are hyperplanes, same as case 1, but are generally not orthogonal to the line between the means.
Case 3: Si = arbitrary
gi(x) = xtWix + witx + wi0 Wi = -1/2Si -1 wi = Si -1mi wi0 = -1/2mitSi -1mi – 1/2ln|Si| + lnP(wi)
In the tow category case, the decision surfaces are hyperquadrics. Even in one dimensional, the decision regions need not be connected.
Bayes decision theory- discrete features
x can assume only one of m discrete values v1…,vm. So the probability density function P(x|wj) becomes singular. Then Bayes formula involves probabilities rather than probability density. P(wj|x) = P(x|wj) P(wj) / P(x) where P(x) = åw P(x|wj) P(wj) The conditional risk is unchanged in the discrete case, and the Bayes rule remains the same: select the action ai for which R(ai|x) is minimum.
Independent binary features
For tow category problem Let x = (x1,…, xd), xi are either 0 or 1, and conditionally independent. pi = Pr [xi = 1 |w1], so 1- pi = Pr [xi = 0 |w1] qi = Pr [xi = 1 |w2], so 1- qi = Pr [xi = 0 |w2] For conditionally independence, P(x|w1) = Pi pixi(1- pi )1- xi P(x|w2) = Pi qixi(1- qi )1- xi For The tow-category case: g(x) = ln(P(x|w1) / P(x|w2)) + ln (P(w1)/ P(w2) Then we get g(x) = åi wi xi + w0 wi =ln [pi(1- qi )/ qi(1- pi )] w0 = åi ln (1- pi )/ qi(1- qi ) + ln (P(w1)/ P(w2) We decide w1 if g(x)>0, otherwise decide w2
The magnitude of wi indicates the revalence of a “yes”(1) answer for xi If pi = qi, xi gives no information, wi =0 If pi > qi, 1- qi > 1- pi, wi >0, and wi gets larger as pi gets larger. If pi > qi, wi < 0 and |wi| gets larger as pi gets larger
P(wj) appear only through the threshold w0 increasing P(w1) increases w0 biases the decision in favour of w1.
8月3日 University of WollongongThe University of Wollongong’s status as one of Australia’s top research and teaching institutions has been confirmed today (August 3) with five-star ratings across key categories in the nation’s authoritative guide to choice of university. The 2006 Australian Good Universities Guide is now being launched in newsagents and bookshops in Australia and overseas, and the University of Wollongong has achieved an impressive maximum five stars in six of the key categories in the independent Guide. Only the top 20 per cent of universities can be awarded a five-star rating in any one category. 7月29日 Image acquisition, quantization and perception——成果汇报
Elements of visual perception
Image formation in the eye: The lens of eye is just like an optical lens, but is more flexible. When we see an object, we get a retinal image primarily in the area of the fovea. Perception then takes place by the relative excitation of light receptors, which transform radiant energy into electrical impulses that are ultimately decoded by the brain.
Brightness adaptation The range of light intensity levels to which the human visual system can adapt is enormous. But the visual system cannot operate over such a range simultaneously. Rather, it accomplishes this large variation by change in its overall sensitivity. So the total range of distinct intensity levels it can discriminate simultaneously is rather small when compared with the total adaptation range. For any given set of conditions, the current sensitivity level of the visual system is called the brightness adaptation level.
At any specific adaptation level, brightness discrimination is poor at low levels of illumination, and it improves significantly as background illumination increases.
Light and the electromagnetic spectrum
Spectrum (wavelength from short to long, energy from high to low): Gamma rays, Hard X rays, Soft X rays, Ultraviolet, Visible spectrum, Infrared, Microwaves, Radio waves.
λ=c/ν; E=h ν;
A body that reflects light and is relatively balanced in all visible wavelengths appears white to the observer. However, a body that favors reflectance in a limited range of the visible spectrum exhibits some shades of colour.
The term gray level generally is used to describe monochromatic intensity because it ranges black to grays, and finally to white.
Image sensing and acquisition
We use sensors to transform illumination energy into digital image. The idea is simple: Incoming energy transformed into a voltage by the combination of input electrical power and sensor material that is responsive to the particular type of energy being detected. The output voltage waveform is the response of the sensor, and a digital quantity is obtained from each sensor by digitizing its response.
There are three principal sensor arrangements:
Using a single sensor: Images are generated by a single sensor combined with mechanical motion.
Using sensor strips: The strip provides imaging elements in one direction. Motion perpendicular to the strip provides imaging in the other direction.
Using sensor arrays: The arrays are tow dimensional. A complete image can be obtained by focusing the energy pattern onto the surface of the array.
Simple image formation model We denote image by two-dimensional functions of the form f(x, y). The value of f at spatial coordinates (x, y) is a positive scalar quantity. f(x, y)=i(x, y) r(x, y) i(x, y) is illumination components, 0 < i(x, y)< ∞. r(x, y) is reflectance components, 0 < r(x, y)< 1.
We call the intensity of a monochrome image at any coordinates (x0, y0) the gray level (l) of the image at that point. That is: l=f (x0, y0) lÎ[0, L-1] l = 0 is considered black and L-1 is considered white (L=2k). All intermediate values are shades of gray varying from black to white.
Image sampling and quantization
To create a digital image, we need to convert the continuous sensed data into digital form. This involves two processes: sampling and quantization. Digitizing the coordinate values is called sampling (divide the image). Digitizing the amplitude values is called quantization (divide the gray level value).
The quality of a digital image is determined to a large degree by the number of samples and discrete gray levels used in sampling and quantization. When using a single sensor to get images, practical limits are established by imperfections in the optical devices. And when a sensing strip or a sensing array is used for image acquisition, the number of sensors establishes the limits of sampling.
Expressing sampling and quantization in formal mathematical terms: Let Z and R demote the set of real integers and the set of real numbers, respectively. The sampling process can be viewed as partitioning the xy plane into a grid, with the coordinates of the centrer of each grid being a pair of elements from the Cartesian product Z2,which is the set of all ordered pairs of elements (zi,zj),with zi and zj being integers from Z. Hence, f(x, y) is a digital image if f(x, y) are integers from Z2 and f is a function that assigns a gray-level value to each distinct pair of coordinates (x, y). This functional assignment is the quantization process. If the gray level also are integers, Z replace R, and a digital image then becomes a 2-D function whose coordinates and amplitude values are integers.
Representing digital image Assume that an image f(x, y) is sampled so that the resulting digital image has M rows and N columns. So the values of the coordinates (x, y) now become discrete quantities. We use integer values for these discrete coordinates. Thus, the values of the coordinates at the origin are (x, y) = (0, 0). The next coordinate values along the first row of the image are represented as (x, y) = (0, 1). So we can use an M*N matrix to represent the image. Each element of this matrix array is called pixel, which represent a pair of coordinates and the correspondent gray-level value.
The number of gray levels typically is an integer power of 2: L=2k The number b is the bits required to store a digitized image. b=M*N*k When M=n, b=N2k. When an image can have 2k gray levels, it is common practice to refer to the image as a “k bit image”.
Spatial and gray-level resolution Spatial resolution is the smallest discernible detail in an image. It is common to refer to an L-level digital image of size M*N as having a spatial resolution of M*N pixel and a gray-level resolution of L. Basically, decreases in k and n will reduce the quality of an image.
Zooming and shrinking digital images Zooming requires tow steps: the creation of two pixel location, and the assignment of gray-level to those new locations. First, lay an imaginary new grid over the original image. Then, we look for the closest pixel in the original image and assign its gray level to the new pixel in the grid. Finally, expand it to the specified size to obtain the zoomed image.
A more sophisticated way of accomplishing gray-level assignment is bilinear interpolation using the four nearest neighbours of a point. Let (x’, y’) denote the coordinates of a point in the zoomed image, and let v (x’, y’) denote the gray-level assigned to it. The assigned gray level is given by: v (x’, y’)=ax’+by’+cx’y’+d The four coefficients are determined from the four equations in four unknowns that can be written using the four nearest neighbours of point (x’, y’).
Image shrinking is done in a similar manner as zooming. The equivalent process of pixel replication is now column deletion.
Some basic relationships between pixels
For a pixel p(x, y): 4-neighbors of p, N4(p): (x+1, y), (x-1, y), (x, y+1), (x, y-1) (horizontal and vertical neighbours) Diagonal neighbours of p, Nd(p): (x+1, y+1), (x+1, y-1), (x-1, y+1), (x-1, y-1) 8-neighbors of p, N8(p): N4(p)+ Nd(p)
Let V be the set of gray-level values used to define adjacency 4-adjacency: two pixels p and q with values from V are 4-adjacent if q is in the set N4(p). 8-adjacency: two pixels p and q with values from V are 4-adjacent if q is in the set N8(p).
5月17日 出国?今天我的签证通过了。心情没有多么好。在论坛上发签证信息的时候读到了一篇前辈的文章,如下: 半夜2点了,我刚刚在办公室准备完下周一的presentation。最近3个星期可能是我在澳洲工作以来最困难的时光。三个星期来接连丢掉了两个很大的政府项目,面对下午开会时同事们颓丧的气氛,真让人没法不难过,虽然marketing的人负有主要责任,但我真没准备第一次当project leader就如此惨淡收场。我们这次依然最多胜率五成,我们的对手背靠印度第二大的软件巨头Infosys, 我们在印度的合作伙伴只有infosys规模的四分之一,甚至我们在西澳地矿局中的内线最近也刚retire, 但是我们没有理由怯懦,因为我记得《阿甘正传 》中那位慈祥的母亲临终前对儿子的嘱咐:“You have to do the best with what god gave you.” 5月11日 十大禁片 No.1010. 《我唾弃你的坟墓》i spit on your grave (1978年首映) 导演: meir zarchi 放映时间: 100 min / argentina:95 min / australia:90 min / uk:96 min (cut) 出品国家: usa 不知道为什么这部片子可以入选,我对它的评价是一无是处。不吓人,不符合逻辑,几乎没有情节,对白都很少。一共5个演员,表现都很业余,特别是那女的从头到尾表情差不多。不要说触目惊心了,我都可以一边吃饭,一边写作业一边看。可能是因为出品在70年代,大家承受力有限?21世纪的小电影都比这个厉害。 电影说一单身女作家去一荒凉但挺美的小地方休假顺便写作,当地加油时被4只色狼盯上。4人用20多分钟把她强奸。这女的事后一琢磨没报警,用剩下的时间使不同方法逐个干掉那4个男的,捍卫了女权。如果说凡是色情或暴力或无聊的电影就得禁的话,这部电影被禁的十足真金童叟无欺。 不好的当然不推荐看,但烂成这样的电影还是有见识一下的必要。十大禁片,提供刻盘服务,DVD10块一张,不杀熟,给打97折。
十大禁片 序断腿的时候没事儿干,看电影,最多一天看6部。看多了不免就有审美疲劳,就找刺激的看,刺激程度逐渐升级,于是费尽心机搜罗世界十大禁片。看完以后就觉得当导演太爽了,变态都有那么多人陪着一块儿变,还有人给钱让变着法儿的变,自己爽完了还能郁闷别人,所有的想法都能虚拟的实现,所有的欲望都能几近真实的满足。我也要当导演!我的第一部片子就叫《战策真帅!》。 十大禁片,提供刻盘服务,DVD10块一张,有意留言,兄弟姐妹们97折,呵呵。 3月21日 卧龙岗 卧龙区卧龙岗乡位于南阳市城区西南郊,紧临诸葛亮躬耕地汉昭烈皇帝刘备三顾茅庐处武侯祠, |
|
|