Tag Cloud
I discovered this web site today : http://www.wordle.net/
It proposes some nices layouts to display tag clouds.
The display is made with a java applet, but I’m sure we could do pretty the same thing using SVG. I used SVG in one of my previous projects 2 or 3 years ago and I really like how simple it was to generate complex vectorial drawings. The only issue was that to display SVG files in a browser we had to use an external plugin : adobe svg viewer. At this time, this plugin was not supporting the last svg specification and was going to be stopped (I suppose it is effectively now).
SVG might have lost some interest now with complex javascript framework like DOJO or EXT, and FLash/Flex, but … well I like SVG, may be I’m a little bit nostalgic. I thing one of the main advantage of SVG is that it’s XML based language. So on the server side it’s quite easy to generate and also to debug (any XML editor to check the syntax or SVG editor to look at the rendering, eg Inkscape). And the task to insert some semantic information into this XML cloud using RDFa or eRDF would be hugely simplify.
To generate a tag cloud, which is mainly a font centered diagram, SVG must do the job quite easily and keeping at the same time the application layers well separated.
Now the last browser generation supports SVG natively and for the others a framework like DOJO or EXT is able to encapsulate it and emulate the rendering using the specific browser capabilities. I should try to implement a composant to render different tag cloud layouts with SVG just for fun. I try to found a suitable algorithm to calculate tags positions but it should not be so hard to have a simple one to have a first drop.
Some links to have a look at
http://arxiv.org/abs/cs.DS/0703109
http://dotnetaddict.dotnetdevelopersjournal.com/tw.htm
http://semanticvoid.com/blog/2006/01/06/tag-cloud-font-distribution-algorithm/
http://poeticcode.wordpress.com/2007/01/27/tag-cloud-algorithmlogicformula/
http://internetducttape.com/2007/02/22/tag-cloud-generator-for-wordpresscom/
http://www.citebase.org/abstract?identifier=oai%3AarXiv.org%3Acs%2F0703109&action=citeshits&citeshits=cites
At least I took time to write a post even if it’s not THE post of the year…
Filed under: Semantic Web | 1 Comment
Tags: rendering algorithm, SVG, tag cloud
links for 2008-03-28
-
Follow the faceted navigation system described in this article, a technology demonstrator based on IBM® Omnifind™ Discovery Edition that exploits the XML capabilities of IBM DB2®, to explore the advantages of faceted navigation, and see how to get the
Filed under: del.icio.us links | Leave a Comment
links for 2008-03-20
-
Galway team, work on faceted search/navigation interface
Filed under: del.icio.us links | Leave a Comment
Semantic food for free
Last year, Reuters acquired text analytics company ClearForest.
They recently launched a new free semantic web service, based on ClearForest technology, named OpenCalais. This service helps to extract from a submitted text (web content for example), entities. And, last but not least, the service returns all these extracted concepts as an RDF graph. So using this service and browsing this graph, you can automatically tag any unstructured content (with RDFa for example), provide enhanced search functionalities based on the semantic (if you have a good knowledge of the used ontology), etc…
See below a short example: I submitted a text found on the web to this service through this web page, then I queried the returned RDF graph using this RDF graph visualization tool and a pretty simple SPARQL like query, to retrieve all what was identified as a “Company”. Well, it could be best if all the found companies were linked by something else than their common type, for example an “acquired” relationship, but it’s already a good start.
Original Plain Text
March 16, 2004 (Computerworld) — Enterprise content management vendor Documentum Inc. has acquired a one-step content integration product line from Xerox Corp. and today unveiled a new “virtual repository” for improved organization of stored data.
In an announcement, the Pleasanton, Calif.-based company said its new Documentum Virtual Repository will allow companies to organize and store a wide range of internal and external information that will be easy to retrieve for use. The repository will allow aggregation for automated and scheduled content collection from multiple sources and will make the information available to others in compatible formats.
The new feature will be available early in the second quarter.
In a related move, Documentum acquired the AskOnce business unit of Xerox for an undisclosed price. AskOnce is a secure enterprise content integration product that searches multiple repositories and data types using a single query. AskOnce relies on a uniform query interface to connect it to existing database, document repository, Internet, corporate intranet or e-mail applications.
Financial details of the transaction weren’t disclosed.
“With the Documentum Virtual Repository solution, companies will be able to control all of their content — internal and external, structured and unstructured — regardless of where it resides,” Dave DeWalt, president of the Documentum division of EMC Corp., said in a statement.
“Most enterprises have limited knowledge of the content scattered throughout their organizations — on employee desktops, internal and external networks, Web sites and portals, or in data archives. There’s a great need in the market for technology that helps companies manage all of this content — especially with the intense public scrutiny of both government agencies and public companies.”
All identified entities
Tagged HTML sample
March 16, 2004 (Computerworld) — Enterprise content management vendor Documentum Inc. has acquired a one-step content integration product line from Xerox Corp. and today unveiled a new “virtual repository” for improved organization of stored data. In an announcement, the Pleasanton, Calif. -based company
Global RDF graph
RDQL (SPARQL like) query : What is identified as a “Company”?
SELECT ?subject ?predicate ?object WHERE
(?subject rdf:type <http://s.opencalais.com/1/type/em/e/Company>)
(?subject ?predicate ?object)
RDF graph / Query result
Filed under: Semantic Web | 3 Comments
Tags: OpenCalais RDF SemanticWeb service SPARQL
links for 2007-12-07
-
Adding advanced RDF services to Oracle Database
Filed under: del.icio.us links | Leave a Comment
links for 2007-12-06
-
Use case with the triple store longwell
And more info about facets computation
Filed under: del.icio.us links | Leave a Comment
links for 2007-12-04
-
In this article we present MultiBeeBrowse (MBB), an accessible faceted navigation solution that solves aforementioned problems in the browsing environment. We present how to improve users’ access to their history of refinements; we discuss how users ca
Filed under: del.icio.us links | Leave a Comment
links for 2007-11-21
-
Faceted browsing [23] is a data exploration technique for large datasets. We have
shown how this technique can be employed for arbitrary semi-structured content.
We have extended the expressiveness of existing faceted browsing techniques
and have devel -
About Semantic Search – Web 3.0 for Drupal | Semantic Search – Faceted Search and Web 3.0 for Drupalcode source available and faceted search : need a closer look!
-
Why use an RDF store when I can build search functionality “just like that” in SQL faster, cheaper, and more efficiently?
-
when we implement RDF stores atop existing RDBMS, we begin to use a row for each new property/attribute that we would like to store about the resource, thus making it sub-optimal for joins and other operations
Filed under: del.icio.us links | Leave a Comment
links for 2007-11-16
-
Applying XQuery and OWL to The World Factbook, Wikipedia and Project Gutenberg
-
Relational database integration with RDF/OWL
Filed under: del.icio.us links | Leave a Comment
links for 2007-11-10
-
Word shape is no longer a viable model of word recognition. The bulk of scientific evidence says that we recognize a word’s component letters, then use that visual information to recognize a word. In addition to perceptual information, we also use conte
Filed under: del.icio.us links | Leave a Comment
links for 2007-11-07
-
Using Semantic Web Pipes you can fetch, mix and process RDF files published on the Web. As the output of a Pipe is an HTTP retrievable RDF model, simple pipes can also work as inputs to more complex Pipes.
Filed under: del.icio.us links | Leave a Comment
links for 2007-10-25
-
We believe that future web applications will retain the Web 2.0 focus on community and usability, while drawing on Semantic Web infrastructure to facilitate mashup-like information sharing.
Filed under: del.icio.us links | Leave a Comment
links for 2007-10-07
-
we have designed and implemented a visualization system for playful end-user exploration and navigation of large-scale online social networks.
Filed under: del.icio.us links | Leave a Comment
links for 2007-10-02
-
A novel query interface that enables users to
construct a rich search query without any prior knowledge
of the underlying schema or data. The interface interacts in real-time
with the users as they type, guiding them through the query
construction.
Filed under: del.icio.us links | Leave a Comment
links for 2007-10-01
-
Quintura Search offers a visual map of tags or hints contextually related to your search query. Adding or subtracting tags from the query using a mouse
click, “One-Click-Search”, allows to specify the context or meaning of the query. -
Download Quintura Search 1.5 : standalone version?
Filed under: del.icio.us links | Leave a Comment
RDF Graph visualization tools
-
RDF Gravity is a tool for visualising RDF/OWL Graphs/ ontologies.
-
Welkin is a graph-based RDF visualizer developed by the Simile project
-
IsaViz is a visual environment for browsing and authoring RDF models represented as graphs
-
XML.com: Styling RDF Graphs with GSS
In this article, we have seen how to make the representation of a relatively complex RDF model easier to understand by declaring simple styling and visibility instructions to be applied to selected resources and properties.
-
RDFAuthor is a tool designed to ease the pain of creating rdf instance data. Authoring is largely a matter of dragging in data and binding it together using a graphical interface.
Filed under: Semantic Web, del.icio.us links | Leave a Comment
Tags: rdf graph visualization tools semanticweb
RDF data
What are the possible RDF data sources?
- triple stores : locals and distant, the last ones can be queried using the SPARQL protocol
- local rdf datasets or data available in RDF : loaded in memory
- intermediate layer doing the mapping between the native/legacy data representation (DB or LDAP for example) and an exposed RDF view.=> D2RQ/D2R , SquirrelRDF, …
- web site containing semantic data : RDFa, microformats, eRDF, … These information can be extracted using GRDDL for example.
- scrapping applications : Solvent
Filed under: Semantic Web | 2 Comments
Tags: SemanticWeb RDF
links for 2007-09-28
-
A loose definition of clustering could be “the process of organizing objects into groups whose members are similar in some way”. A cluster is therefore a collection of objects which are “similar” between them and are “dissimilar” to the object
Filed under: del.icio.us links | Leave a Comment
links for 2007-09-27
-
A useful search engine is more than a search algorithm. This article explains how to create a search query analysis tool, a best bets feature, and a basic controlled vocabulary. We’ll use MySQL for the examples.
Filed under: del.icio.us links | Leave a Comment
Recent Entries
Categories
- del.icio.us links (21)
- Semantic Web (5)



