How to extract formula from Excel using Alteryx

How to extract formula from Excel using Alteryx

2021.06.11

この記事は公開されてから1年以上経過しています。情報が古い可能性がありますので、ご注意ください。

(日本語はこちらから)

Introduction:

Alteryx is a convenient data-prep tool which can import data from a variety of files. Take the case of Excel, the input can be from a single or multiple files having the target data distributed in one or more excel-sheets. Those files can be located in various directories and can have the same or different schema.

However, these examples mentioned above would just import the data and not the underlying formula which was used to compute the data values. A visual example is shown here:

By default, Alteryx would import the data value “5” and not the underlying formula “=(10+5)/3” which was used to calculate it. Depending on the requirements, sometimes, the user may want to import the formula to understand the logic or for debugging etc.

This post will explain how to extract the formula from excel files.

How to achieve:

Let us consider an excel file with the following data, note that columns A, B and C hold the alpha-numeric values, while column D is a calculation B/C*100

When we read an excel file in Alteryx we just import its data values, so in order to extract the formula, the trick here is to treat it as if it was a zip file. In the input tool configuration, choose “File Format” as Zip (*.zip) and click on the “Table or Query” and select the “worksheets” sub-folder as shown below.

Further configure the input tool so that the delimiters are set to “\n”, “First Row Contains Field Names” is deselected and “Field Length” is set to a higher value to accommodate all the input characters as shown below.

Next, add a sampling tool to just pick up the last row from the data.

Finally to extract the information from the input file, use the XML Parse tool with the following configuration.

The formula can be found in the “n”th outer XML, where “n” is the column number of excel where the formula exists. In this case the formula is present in the 4th column, so the result can be extracted from c_4_OuterXML. Or it can also be found in the row_OuterXML column.

Thus with the help of some workaround, it is possible to extract the formulae from an existing excel file.

Summary

With the help of Alteryx, it is possible to extract the formula which was used to create a calculation in an excel file.

Alteryxの導入なら、クラスメソッドにおまかせください

日本初のAlteryxビジネスパートナーであるクラスメソッドが、Alteryxの導入から活用方法までサポートします。14日間の無料トライアルも実施中ですので、お気軽にご相談ください。

alteryx_960x400


データ分析基盤ならCSAにおまかせください!

クラスメソッドでは、データ分析に関わる皆様の作業コストを削減し、分析アクションを促進・効率化・自動化するためのデータ分析基盤「カスタマーストーリーアナリティクス」を提供・展開しています。興味をお持ち頂けた方は下記バナーをクリックし、詳細をご覧頂けますと幸いです。(各種問い合わせも承っております)

データ分析基盤を最短1ヶ月で導入可能


また、現在クラスメソッドでは毎週無料の相談会を開催しています。データ分析基盤の構築から、BIツール、データ統合など、ビッグデータ分析の導入や活用、技術についてお気軽にご相談ください。

データ分析相談会

この記事をシェアする

データ分析基盤を最短1ヶ月で導入できる

関連記事