Scholarly record
SPREADSHEET DATA EXTRACTION USING SEMANTIC NETWORK
Abstract
Spreadsheets often contain valuable data that are difficult to process due to an unclear structure. For this reason, spreadsheets also called semi-structured data. Table structure recognition and data extraction is an important area of research. Tables may contain statistical reports, schedules, grade books, results of research or product catalogue. Depending on the purpose, tables have a different structure. In this paper, the authors propose an approach for extracting information from "list" type spreadsheets. These tables store information about objects of the same type, each column represents an object property. Price lists are a good example of such tables? type. The main proposed approach idea is the extraction objects from the spreadsheet using the semantic network. The kernel of semantic network graph is based on Wiktionary data and contains senses and semantic relations between them. Every sense has its owns wordforms, theirs morphological characteristics and instances of the sense, where the instance is the object of given sense. For example, "2m" may be the instance of the sense "length". The semantic network is used to describe the object structure, while instances give useful templates for data. The program developed looks in every row in the spreadsheet, match properties to senses and creates objects with given in semantic network structure. The approach was tested for the corpus of price lists, typical for IT distribution area.
Publication Impact Profile
Publication details
References0
Structured references will appear here after the reference import pass. The count is preserved now so the scholarly record is not incomplete.
Citing literature
Number of times cited according to Crossref: 1
View or Download full articleAccess options
SWS access login
Login as SWS Scientific CommitteeLogin as SWS Scientific PartnerLogin as SWS AuthorAuthors and approved SWS contributors will read and export their own linked papers after identity matching by SWS profile, email and SGEM GlobalID.
For librarian assistance: [email protected]
Purchase Instant Access
- Article can be downloaded after successful payment.
- Article may be used according to SWS library access terms.
- Article cannot be redistributed.

