The Spreadsheet Import panel

Content

Importing a simple, tabular spreadsheet
Importing spreadsheets with a complex cell structure
Reusing spreadsheet import tasks


Importing a simple, tabular spreadsheet

A flat tabular collection of data residing in a MS Excel spreadsheet can be imported in Synop Analyzer as a data source as follows:


Importing spreadsheets with a complex cell structure

In the following sections we will explain the features and functions of the spreadsheet import wizard at the example of the MS Excel file doc/sample_data/earnings_sheet.xls. The file contains the monthly earnings sheet for a small company with two locations for the period from January 2006 to March 2009. The figure below shoes a part of this Excel sheet:

screenshot Excel spreadsheet

In the present form, the data are not really suitable for being used by a forecasting and trend analysis tool: in the Excel sheet, meta data information (such as location, date or cost category) is intermixed with number cells, empty space cells, formula cells (such as EBT or Gross Profit II) and auxiliary title or text cells. Furthermore, the sheet contains accountant's corrections at year end (such as the column 13.2008 highlighted in the picture above); these corrections have to be distributed on the 12 months of the preceding year before the corresponding time series can be used for a forecast or trend analysis. We will see that Synop Analyzer supports various preprocessing steps on this input sheet in order to overcome the aforementioned problems.

From the Synop Analyzer main menu, we select FileImport data from spreadsheet. A file chooser dialog opens up.

image file img/tsa_excel_wizard.png not found

We select the file doc/sample_data/earnings_sheet.xls in the file chooser dialog. A new Spreadsheet window opens on the main canvas:

image file img/tsa_excel_wizard.png not found

The upper right part of the window contains several input fields in which we can specify how the spreadsheet is to be used. The lower part of the window shows the effects of these specifications.

The specifications decribed above are automatically reflected by an adapted coloring scheme in the tabular representation of the currently active spreadsheet in the lower part of the screen: spreadsheet cells containing meta data information are displayed with a green background, cells with values to be distributed among other cells have a blue background, cells which are to be ignored are grayed out and 'normal' value cells are displayed with white background.

Finally, we click on the Start Transformation button. An instant later, the pop-up window closes and the transformed flat file earnings_sheet.txt is written into our chosen target directory.

The generated file contains the columns Location, Month, CostCategory and Cost. The new file is suitable for a statistical analysis using the entire set of Synop Analyzer functions. The new file is suitable for being read into Synop Analyzer, and it is automatically opened in the Input Data panel.


Reusing spreadsheet import tasks

If you enter a file name into the input field named Parameter file name in the spreadsheet import opp-up window (the file name should end with .xml), then the specifications that you perform in the pop-up window will be saved to that file automatically when you press the button Start transformation. You can later load these settings by selecting File → Import data from spreadsheet, by changing the file type to parameter file (.xml) in the file chooser dialog and by navigating to the previously stored parameter file.

You can also save and re-load spreadsheet import settings as a part of a larger data import process. To that purpose, leave the spreadsheet import window by pressing Start transformation, optionally perform further data import settings in the left column of the main window and save the settings by selecting File → Save Data Load Task. In this case, your spreadsheet import settings are stored within the resulting parameter file. You can later execute this data load task by selecting File → Open Data Load Task.