`
gstarwd
  • 浏览: 1488444 次
  • 性别: Icon_minigender_1
  • 来自: 杭州
社区版块
存档分类
最新评论

Richard MacManus:10个语义应用实例

阅读更多

 把语义应用(semantic applications)视为10月在旧金山web 2.0峰会上的亮点之一,并且列表了10个语义应用的实例,但需要说明的是,这并不是一个top 10之类的榜单。

1、什么是语义应用?

语义应用的一个核心元素是试图识别文本和其它信息的意义(determine the meaning),并且创建其与用户之间的连接( create connections)。 Nova Spivack (下文要介绍的Twine的发起者)同样也将数据的可移植性(data portability)与可连接性(Connectibilty)作为语义应用的关键特征。

Alex Iskold在一篇名为的文章中,列举了实现语义应用的两种方法:

自下而上(Bottom up):即通过将语义标识(semantical annotations)即元数据(meta-data)置入数据中;

自上而下(Top down):即依靠分析现有信息;自上而下的最终解决策略是建立一个能完全以人类的方式理解文本与信息的自然语言处理程序(natural language processor)。

 

下面就放榜:

Freebase
Freebase aims to "open up the silos of data and the connections between them", according to founder Danny Hillis at the Web 2.0 Summit. Freebase is a database that has all kinds of data in it and an API. Because it's an open database, anyone can enter new data in Freebase. An example page in the Freebase db looks pretty similar to a Wikipedia page. When you enter new data, the app can make suggestions about content. The topics in Freebase are organized by type, and you can connect pages with links, semantic tagging. So in summary, Freebase is all about shared data and what you can do with it.

Powerset
Powerset (see our coverage here and here) is a natural language search engine. The system relies on semantic technologies that have only become available in the last few years. It can make "semantic connections", which helps make the semantic database. The idea is that meaning and knowledge gets extracted automatically from Powerset. The product isn't yet public, but it has been riding a wave of publicity over 2007.

 

Twine
Twine claims to be the first mainstream Semantic Web app, although it is still in private beta. See our in-depth review. Twine automatically learns about you and your interests as you populate it with content - a "Semantic Graph". When you put in new data, Twine picks out and tags certain content with semantic tags - e.g. the name of a person. An important point is that Twine creates new semantic and rich data. But it's not all user-generated. They've also done machine learning against Wikipedia to 'learn' about new concepts. And they will eventually tie into services like Freebase. At the Web 2.0 Summit, founder Nova Spivack compared Twine to Google, saying it is a "bottom-up, user generated crawl of the Web".

AdaptiveBlue
AdaptiveBlue are makers of the Firefox plugin, BlueOrganizer. They also recently launched a new version of their SmartLinks product, which allows web site publishers to add semantically charged links to their site. SmartLinks are browser 'in-page overlays' (similar to popups) that add additional contextual information to certain types of links, including links to books, movies, music, stocks, and wine. AdaptiveBlue supports a large list of top web sites, automatically recognizing and augmenting links to those properties.

SmartLinks works by understanding specific types of information (in this case links) and wrapping them with additional data. SmartLinks takes unstructured information and turns it into structured information by understanding a basic item on the web and adding semantics to it.

[Disclosure: AdaptiveBlue founder and CEO Alex Iskold is a regular RWW writer]

Hakia
Hakia is one of the more promising Alt Search Engines around, with a focus on natural language processing methods to try and deliver 'meaningful' search results. Hakia attempts to analyze the concept of a search query, in particular by doing sentence analysis. Most other major search engines, including Google, analyze keywords. The company told us in a March interview that the future of search engines will go beyond keyword analysis - search engines will talk back to you and in effect become your search assistant. One point worth noting here is that, currently, Hakia has limited post-editing/human interaction for the editing of hakia Galleries, but the rest of the engine is 100% computer powered.

Hakia has two main technologies:

1) QDEX Infrastructure (which stands for Query Detection and Extraction) - this does the heavy lifting of analyzing search queries at a sentence level.

2) SemanticRank Algorithm - this is essentially the science they use, made up of ontological semantics that relate concepts to each other.

Talis
Talis is a 40-year old UK software company which has created a semantic web application platform. They are a bit different from the other 9 companies profiled here, as Talis has released a platform and not a single product. The Talis platform is kind of a mix between Web 2.0 and the Semantic Web, in that it enables developers to create apps that allow for sharing, remixing and re-using data. Talis believes that Open Data is a crucial component of the Web, yet there is also a need to license data in order to ensure its openness. Talis has developed its own content license, called the Talis Community License, and recently they funded some legal work around the Open Data Commons License.

According to Dr Paul Miller, Technology Evangelist at Talis, the company's platform emphasizes "the importance of context, role, intention and attention in meaningfully tracking behaviour across the web." To find out more about Talis, check out their regular podcasts - the most recent one features Kaila Colbin (an occassional AltSearchEngines correspondent) and Branton Kenton-Dau of VortexDNA.

UPDATE: Marshall Kirkpatrick published an interview with Dr Miller the day after this post. Check it out here.

TrueKnowledge
Venture funded UK semantic search engine TrueKnowledge unveiled a demo of its private beta earlier this month. It reminded Marshall Kirkpatrick of the still-unlaunched Powerset, but it's also reminiscent of the very real Ask.com "smart answers". TrueKnowledge combines natural language analysis, an internal knowledge base and external databases to offer immediate answers to various questions. Instead of just pointing you to web pages where the search engine believes it can find your answer, it will offer you an explicit answer and explain the reasoning patch by which that answer was arrived at. There's also an interesting looking API at the center of the product. "Direct answers to humans and machine questions" is the company's tagline.

Founder William Tunstall-Pedoe said he's been working on the software for the past 10 years, really putting time into it since coming into initial funding in early 2005.

TripIt
Tripit is an app that manages your travel planning. Emre Sokullu reviewed it when it presented at TechCrunch40 in September. With TripIt, you forward incoming bookings to plans@tripit.com and the system manages the rest. Their patent pending "itinerator" technology is a baby step in the semantic web - it extracts useful infomation from these mails and makes a well structured and organized presentation of your travel plan. It pulls out information from Wikipedia for the places that you visit. It uses microformats - the iCal format, which is well integrated into GCalendar and other calendar software.

The company claimed at TC40 that "instead of dealing with 20 pages of planning, you just print out 3 pages and everything is done for you". Their future plans include a recommendation engine which will tell you where to go and who to meet.

Clear Forest

ClearForest is one of the companies in the top-down camp. We profiled the product in December '06 and at that point ClearForest was applying its core natural language processing technology to facilitate next generation semantic applications. In April 2007 the company was acquired by Reuters. The company has both a Web Service and a Firefox extension that leverages an API to deliver the end-user application.

The Firefox extension is called Gnosis and it enables you to "identify the people, companies, organizations, geographies and products on the page you are viewing." With one click from the menu, a webpage you view via Gnosis is filled with various types of annotations. For example it recognizes Companies, Countries, Industry Terms, Organizations, People, Products and Technologies. Each word that Gnosis recognizes, gets colored according to the category.

 

Also, ClearForest's Semantic Web Service offers a SOAP interface for analyzing text, documents and web pages.

Spock
Spock is a people search engine that got a lot of buzz when it launched. Alex Iskold went so far as to call it "one of the best vertical semantic search engines built so far." According to Alex there are four things that makes their approach special:

The person-centric perspective of a query
Rich set of attributes that characterize people (geography, birthday, occupation, etc.)
Usage of tags as links or relationships between people
Self-correcting mechanism via user feedback loop
As a vertical engine, Spock knows important attributes that people have: name, gender, age, occupation and location just to name a few. Perhaps the most interesting aspect of Spock is its usage of tags - all frequent phrases that Spock extracts via its crawler become tags; and also users can add tags. So Spock leverages a combination of automated tags and people power for tagging.

再次说明,这不是一个top 10榜单,Richard MacManus的原文位于:http://www.readwriteweb.com/archives/10_semantic_apps_to_watch.php。如果你有其它的类似应用需要推荐,你可以在该文的评论中添加。


本文来自CSDN博客,转载请标明出处:http://blog.csdn.net/zhengyun_ustc/articles/1932192.aspx

=======================================================================

 

One of the highlights of October's Web 2.0 Summit in San Francisco was the emergence of 'Semantic Apps' as a force. Note that we're not necessarily talking about the Semantic Web, which is the Tim Berners-Lee W3C led initiative that touts technologies like RDF, OWL and other standards for metadata. Semantic Apps may use those technologies, but not necessarily. This was a point made by the founder of one of the Semantic Apps listed below, Danny Hillis of Freebase (who is as much a tech legend as Berners-Lee).

The purpose of this post is to highlight 10 Semantic Apps. We're not touting this as a 'Top 10', because there is no way to rank these apps at this point - many are still non-public apps, e.g. in private beta. It reflects the nascent status of this sector, even though people like Hillis and Spivack have been working on their apps for years now.

What is a Semantic App?

Firstly let's define "Semantic App". A key element is that the apps below all try to determine the meaning of text and other data, and then create connections for users. Another of the founders mentioned below, Nova Spivack of Twine, noted at the Summit that data portability and connectibility are keys to these new semantic apps - i.e. using the Web as platform.

In September Alex Iskold wrote a great primer on this topic, called Top-Down: A New Approach to the Semantic Web. In that post, Alex Iskold explained that there are two main approaches to Semantic Apps:

1) Bottom Up - involves embedding semantical annotations (meta-data) right into the data.
2) Top down - relies on analyzing existing information; the ultimate top-down solution would be a fully blown natural language processor, which is able to understand text like people do.

Now that we know what Semantic Apps are, let's take a look at some of the current leading (or promising) products...

Freebase

Freebase aims to "open up the silos of data and the connections between them", according to founder Danny Hillis at the Web 2.0 Summit. Freebase is a database that has all kinds of data in it and an API. Because it's an open database, anyone can enter new data in Freebase. An example page in the Freebase db looks pretty similar to a Wikipedia page. When you enter new data, the app can make suggestions about content. The topics in Freebase are organized by type, and you can connect pages with links, semantic tagging. So in summary, Freebase is all about shared data and what you can do with it.

Powerset

Powerset (see our coverage here and here) is a natural language search engine. The system relies on semantic technologies that have only become available in the last few years. It can make "semantic connections", which helps make the semantic database. The idea is that meaning and knowledge gets extracted automatically from Powerset. The product isn't yet public, but it has been riding a wave of publicity over 2007.

Twine

Twine claims to be the first mainstream Semantic Web app, although it is still in private beta. See our in-depth review. Twine automatically learns about you and your interests as you populate it with content - a "Semantic Graph". When you put in new data, Twine picks out and tags certain content with semantic tags - e.g. the name of a person. An important point is that Twine creates new semantic and rich data. But it's not all user-generated. They've also done machine learning against Wikipedia to 'learn' about new concepts. And they will eventually tie into services like Freebase. At the Web 2.0 Summit, founder Nova Spivack compared Twine to Google, saying it is a "bottom-up, user generated crawl of the Web".

AdaptiveBlue

AdaptiveBlue are makers of the Firefox plugin, BlueOrganizer. They also recently launched a new version of their SmartLinks product, which allows web site publishers to add semantically charged links to their site. SmartLinks are browser 'in-page overlays' (similar to popups) that add additional contextual information to certain types of links, including links to books, movies, music, stocks, and wine. AdaptiveBlue supports a large list of top web sites, automatically recognizing and augmenting links to those properties.

SmartLinks works by understanding specific types of information (in this case links) and wrapping them with additional data. SmartLinks takes unstructured information and turns it into structured information by understanding a basic item on the web and adding semantics to it.

[Disclosure: AdaptiveBlue founder and CEO Alex Iskold is a regular RWW writer]

Hakia

Hakia is one of the more promising Alt Search Engines around, with a focus on natural language processing methods to try and deliver 'meaningful' search results. Hakia attempts to analyze the concept of a search query, in particular by doing sentence analysis. Most other major search engines, including Google, analyze keywords. The company told us in a March interview that the future of search engines will go beyond keyword analysis - search engines will talk back to you and in effect become your search assistant. One point worth noting here is that, currently, Hakia has limited post-editing/human interaction for the editing of hakia Galleries, but the rest of the engine is 100% computer powered.

Hakia has two main technologies:

1) QDEX Infrastructure (which stands for Query Detection and Extraction) - this does the heavy lifting of analyzing search queries at a sentence level.

2) SemanticRank Algorithm - this is essentially the science they use, made up of ontological semantics that relate concepts to each other.

Talis

Talis is a 40-year old UK software company which has created a semantic web application platform. They are a bit different from the other 9 companies profiled here, as Talis has released a platform and not a single product. The Talis platform is kind of a mix between Web 2.0 and the Semantic Web, in that it enables developers to create apps that allow for sharing, remixing and re-using data. Talis believes that Open Data is a crucial component of the Web, yet there is also a need to license data in order to ensure its openness. Talis has developed its own content license, called the Talis Community License, and recently they funded some legal work around the Open Data Commons License.

According to Dr Paul Miller, Technology Evangelist at Talis, the company's platform emphasizes "the importance of context, role, intention and attention in meaningfully tracking behaviour across the web." To find out more about Talis, check out their regular podcasts - the most recent one features Kaila Colbin (an occassional AltSearchEngines correspondent) and Branton Kenton-Dau of VortexDNA.

UPDATE: Marshall Kirkpatrick published an interview with Dr Miller the day after this post. Check it out here.

TrueKnowledge

Venture funded UK semantic search engine TrueKnowledge unveiled a demo of its private beta earlier this month. It reminded Marshall Kirkpatrick of the still-unlaunched Powerset, but it's also reminiscent of the very real Ask.com "smart answers". TrueKnowledge combines natural language analysis, an internal knowledge base and external databases to offer immediate answers to various questions. Instead of just pointing you to web pages where the search engine believes it can find your answer, it will offer you an explicit answer and explain the reasoning patch by which that answer was arrived at. There's also an interesting looking API at the center of the product. "Direct answers to humans and machine questions" is the company's tagline.

Founder William Tunstall-Pedoe said he's been working on the software for the past 10 years, really putting time into it since coming into initial funding in early 2005.

TripIt

Tripit is an app that manages your travel planning. Emre Sokullu reviewed it when it presented at TechCrunch40 in September. With TripIt, you forward incoming bookings to plans@tripit.com and the system manages the rest. Their patent pending "itinerator" technology is a baby step in the semantic web - it extracts useful infomation from these mails and makes a well structured and organized presentation of your travel plan. It pulls out information from Wikipedia for the places that you visit. It uses microformats - the iCal format, which is well integrated into GCalendar and other calendar software.

The company claimed at TC40 that "instead of dealing with 20 pages of planning, you just print out 3 pages and everything is done for you". Their future plans include a recommendation engine which will tell you where to go and who to meet.

Clear Forest

ClearForest is one of the companies in the top-down camp. We profiled the product in December '06 and at that point ClearForest was applying its core natural language processing technology to facilitate next generation semantic applications. In April 2007 the company was acquired by Reuters. The company has both a Web Service and a Firefox extension that leverages an API to deliver the end-user application.

The Firefox extension is called Gnosis and it enables you to "identify the people, companies, organizations, geographies and products on the page you are viewing." With one click from the menu, a webpage you view via Gnosis is filled with various types of annotations. For example it recognizes Companies, Countries, Industry Terms, Organizations, People, Products and Technologies. Each word that Gnosis recognizes, gets colored according to the category.

Also, ClearForest's Semantic Web Service offers a SOAP interface for analyzing text, documents and web pages.

Spock

Spock is a people search engine that got a lot of buzz when it launched. Alex Iskold went so far as to call it "one of the best vertical semantic search engines built so far." According to Alex there are four things that makes their approach special:

  • The person-centric perspective of a query
  • Rich set of attributes that characterize people (geography, birthday, occupation, etc.)
  • Usage of tags as links or relationships between people
  • Self-correcting mechanism via user feedback loop

As a vertical engine, Spock knows important attributes that people have: name, gender, age, occupation and location just to name a few. Perhaps the most interesting aspect of Spock is its usage of tags - all frequent phrases that Spock extracts via its crawler become tags; and also users can add tags. So Spock leverages a combination of automated tags and people power for tagging.

Conclusion

What have we missed? ;-) Please use the comments to list other Semantic Apps you know of. It's an exciting sector right now, because Semantic Web and Web 2.0 technologies alike are being used to create new semantic applications. One gets the feeling we're only at the beginning of this trend

分享到:
评论

相关推荐

    Web 2.0 Heros

    7 Richard MacManus: Read/WriteWeb & Web 2.0 Workgroup. . . . . 91 8 TJ Kang:ThinkFree . . . . . . . . . . . . . . . . . . . . . . . . . . . . 101 9 Patrick Crane: LinkedIn. . . . . . . . . . . . . . ....

    ROS基于C++动力学约束的路径规划源码+ppt文件.zip

    ROS基于C++动力学约束的路径规划源码+ppt文件.zip

    ASP.NET BS结构的城市酒店入住信息管理系统的设计

    ASP.NET B/S结构城市酒店入住信息管理系统的设计与实现简介 一、项目背景与意义 随着城市旅游的蓬勃发展,酒店业作为旅游产业链中的重要一环,面临着日益激烈的市场竞争。为了提升酒店的服务质量和管理效率,信息化管理成为酒店业不可或缺的一部分。因此,我们设计并实现了一个基于ASP.NET的B/S(浏览器/服务器)结构城市酒店入住信息管理系统。该系统旨在帮助酒店实现入住信息的快速录入、查询、修改和统计,提升酒店的运营效率和客户体验。 二、系统主要功能 用户管理:系统支持管理员、前台服务员、客户等不同角色的注册、登录和权限管理。通过角色权限的设置,确保系统数据的安全性和完整性。 房间管理:管理员可以添加、编辑、删除房间信息,包括房间类型、价格、状态等。前台服务员可以实时查看房间状态,为客人办理入住和退房手续。 入住信息管理:前台服务员可以录入客人的入住信息,包括姓名、证件号码、联系方式、入住时间和离店时间等。系统支持客人信息的快速查询和修改,方便前台服务员处理各种客户需求。 费用管理:系统根据客人的入住时间和房间价格自动计算费用,并支持多种支付方式。管理员可以设置折扣、优惠券等促销

    基于streamlit的YOLOv8可视化交互界面

    基于streamlit的YOLOv8可视化交互界面

    liba52-0-0.7.5+svn613-lp152.3.2.aarch64.rpm

    liba52-0-0.7.5+svn613-lp152.3.2.aarch64

    基于matlab实现配电网三相潮流计算方法,对几种常用的配电网潮流计算方法进行了对比分析.rar

    基于matlab实现配电网三相潮流计算方法,对几种常用的配电网潮流计算方法进行了对比分析.rar

    123321123323211

    121342141414

    哈希算法(Hash Algorithm)是一种将任意长度的二进制数据映射为较短的、固定长度的二进制值的函数.txt

    哈希算法的特点

    基于ros和stm32f1的小车代码含串口通信+项目说明.zip

    复刻平衡小车,自行添加了转向环和蓝牙控制,蓝牙APP使用的是轮趣科技的。该项目为复刻平衡小车,其中大量代码为b站up主_WNNN的开源代码,加了转向环的代码,添加蓝牙控制功能. 该小车主体由洞洞板焊接,使用的io口焊的时候忘记记了,随缘吧。

    ZEND解密dezender12

    zend解密 dezender12 dezender12是一个专业对用Zend Encoder/SafeGuard, ionCube, SourceGuardian,phpcipher、codelock或SourceCop加密过的PHP文件进行破解的网站, 它主要运用密码分析、解压缩和反编译技术将经编码/加密过的PHP文件还原为可阅读、可执行的PHP源文件。

    基于YOLOv8的多端车流检测系统用于毕设+开源

    客户端环境配置 第一步 配置python环境 下载python(版本:python>=3.8)(建议使用访问Anaconda官网配置虚拟环境,具体步骤如下) 1)访问Anaconda官网:https://www.anaconda.com/products/individual 2)选择相应的操作系统版本并下载对应的安装包(推荐下载64位版本) 3)打开下载的安装包,按照提示进行安装即可 4)创建一个虚拟环境: conda create --name 自命名 python=3.9.16 第二步 下载库 注意:下载库前,如果想要更好的帧数体验请安装cuda版本哦(因为一般默认会安装cpu的版本) pip换源: pip config set global.index-url https://pypi.tuna.tsinghua.edu.cn/simple 切换到项目文件夹下,下载依赖: pip install -r requirements.txt 我自己使用的环境:python3.9+CPU 第三步 运行项目(如果不需要(开启网页端) 或 (对接RTSP))

    Qt+FFmpeg实现音频播放器

    运用Qt框架+FFmpeg音视频解码库实现音频播放器,通过实时解码音频传给设备进行播放,可供学习和参考

    liba2ps1-4.15.5-2.2.s390x.rpm

    liba2ps1-4.15.5-2.2.s390x

    一个基于linux C++的Flv解析器.zip

    一个基于linux C++的Flv解析器.zip

    机械设计电机冲切转子组装一体机sw18可编辑非常好的设计图纸100%好用.zip

    机械设计电机冲切转子组装一体机sw18可编辑非常好的设计图纸100%好用.zip

    智能监控JAR进程:Bash脚本助力运维.zip

    本Bash脚本用于自动化管理Java JAR应用的启动、停止及监控。首先检查JAR进程是否在运行,如在运行则安全终止。随后,使用预设的Java参数启动JAR文件,并将输出和错误日志重定向至日志文件。启动后,脚本持续监控JAR进程状态,确保其在预设时间内成功启动。本脚本提供了灵活的配置和错误处理机制,为Java应用的运维管理带来了便捷与可靠性。

    2024-2030全球及中国太阳能汽油泵行业研究及十五五规划分析报告.docx

    2024-2030全球及中国太阳能汽油泵行业研究及十五五规划分析报告

    Geek Geek Geek

    Geek

    yolov3无人机俯视视角下热红外行人小目标检测权重+数据集

    yolov3无人机俯视视角下热红外行人小目标检测权重, 包含5000多千张YOLO算法无人机俯视视角下热红外行人小目标数据集,数据集目录已经配置好,yolo格式的标签,划分好 train,val, test,并附有data.yaml文件,yolov5、yolov7、yolov8等算法可以直接进行训练模型, 数据集和检测结果参考:https://blog.csdn.net/zhiqingAI/article/details/124230743 数据集配置目录结构data.yaml: nc: 1 names: ['person']

    基于C#的开源音乐播放器MetroPlayer.zip

    基于C#的开源音乐播放器MetroPlayer.zip

Global site tag (gtag.js) - Google Analytics