The concept of Linked Data is a new technological trend in information management that has emerged within the general framework that represents the Semantic Web. This term refers to a method for exposing, sharing and connecting data through unreferenced URIs on the Web.
Behind this concept is the ability to be able to transform the information contained in documents or data sources that, although they apparent to be structured, only their content is structured, not its data. This transformation is based on being able to get information from a document (which can be read, but that their data can not be used automatically as they have not been pre-interpreted) in a series of tagged data with which can work automatically.
The information stored in the documents can be read, interpreted and analyzed, among other things, by humans, in a more or less easy way, being able to isolate and categorize the elements contained in the text, so that the document or its contents, go of being mere information and data, meaning data a set of alphanumeric elements that can be used by a programmable element not human.
The process of obtaining these data is not a trivial task. A comprehensive analysis of the documents by using techniques of NLP (Natural Language Processing) is necessary to achieve the extraction of specific data. Besides this NLP process is necessary to generate a mapping process between the extracted concepts and their links with the aim of generating the Linked Open Data.
The aim therefore of this project and the initial hypothesis raised is the generation of a process that allows the extraction of economic data and financial information from the documents.
The main objective of the project is the creation of a prototype for the analysis of employability in the Community of Madrid. This prototype has allowed validating the possibility of obtaining information about employability levels, based on a fixed set of data extracted from websites with available job offers. For this, an analysis of the information will be carried out, which will allow to use such information in a suitable format in the proposed prototype.
The main characteristic of this prototype is that it allows generating an example of reports on the different factors of employability, automating the production of these reports. What is intended with this prototype is to perform a proof of concept by verifying the product requirements, the implementation and the test (in a controlled context) of the key concepts.
Twittiment is a monitorization tools of twitter that surge of the colaborations of two institutes of The Universidad Carlos III de Madrid, the Instituto de Desarrollo Tecnológico y promoción de la innovación 'Pedro Juan de Lastanosa' and the Instituto para el desarrollo empresarial.
Twittiment is an application that allows catching all the relevant information about brands, events, a sector, etc. Then, the system isolates the source of insights, identifies area of interest and detects the associations.
The goal of the GODO project is to develop a platform to facilitate the discovery of Web Services. GODO takes advantage of the incorporation of semantics and uses mechanisms of natural language processing and ontological engineering
The approach of the GODO project refers to the use of PLN and ontologies, focusing on the automatic construction of ontologies from the text and the detection of ontological elements (concepts, classes, relations, attributes).
The fase II, GODO 2 is a clear advance for the discovery of services, implementing a platform that favors its description and composition. This platform, in turn, puts special emphasis on the intuitive definition of objectives through natural language, which allows users to approach with the management of services.
The ISDAC Project main purpose is to define and develop the required technology to address common problems in commercial and packaged IS management and support. These necessities include deployment and diffusion functionalities, support life cycle backup, unattended configuration and novel integration and interaction capacities with other systems/processes/individuals in the environment.
The objective of the SITIO project is the use of several emerging paradigms (SaaS, Semantic Technologies, Business Process Modeling, Cloud Computing) to build a new SaaS platform enriched with semantic technologies, oriented towards interoperability and cost reduction, with consequent benefits for the software industry. Therefore, a key point of the project is the development of a framework, including models and methods, to facilitate access to business services and Cloud Computing services, from the perspective of a broker.
SITIO also addresses the problem of increasing the number of business processes available as Web services, which gives rise to yet another more challenging problem: the integration of heterogeneous applications. The way to add semantics to the SaaS business processes leads us to architecture in integrating several information systems from the perspective of emergent scenarios in which the use of such technologies is beneficial.