Friday, May 20, 2022
HomeBusiness IntelligenceDiscovering Minimal Date and Most Date Throughout All Tables in Energy Question...

Discovering Minimal Date and Most Date Throughout All Tables in Energy Question in Energy BI and Excel


Finding Minimum Date and Maximum Date Across All Tables in Power Query in Power BI and Excel

After we discuss information evaluation in Energy BI, making a Date desk is inevitable. There are completely different strategies to create a Date desk both in DAX or in Energy Question. In DAX you my use both CALENDAR() operate or CALENDARAUTO() operate to create the Date desk. In Energy Question you might use a mixture of Listing.Dates()#date() and #period() features. Both manner, there may be one level that’s all the time difficult and it’s discover out a correct date vary, ranging from a date up to now and ending with a date sooner or later, that covers all related dates throughout the information mannequin. One easy reply is, we will ask the enterprise. The SMEs know what the legitimate date vary is..

Whereas it is a appropriate argument it isn’t all the time the case. Particularly with the Begin Date which is a date up to now. In lots of circumstances the enterprise says:

Lets’s take a look on the information to search out out.

That can be an accurate level, we will all the time a take a look at the information, discover all columns with both Date or DateTime datatypes then kind the information in ascending or descending order to get the outcomes. However what if there a lot of them? Then this course of may be very time consuming.

A lot of you might already thought that we will use CALENDARAUTO() in DAX and we’re good to go. Nicely, that’s not fairly proper. In lots of circumstances there are some Date or DateTime columns that should not be thought-about in our Date dimension. Like Delivery Date or Deceased Date. Extra on this later on this publish.

On this publish I share a chunk of code I wrote for myself. I used to be in a scenario to determine the Begin Date and the Finish Date of the date dimension many occasions, so I believed it would show you how to as properly.

The Energy Question expressions I share on this publish begins with getting all current queries utilizing:

  • #sections intrinsic variable
  • Filtering out the present question identify, which is GetMinMaxAllDates in my pattern, to keep away from getting the next error:

Expression.Error: A cyclic reference was encountered throughout analysis.

Expression.Error: A cyclic reference was encountered during evaluation.

  • Filtering out the queries which can be NOT as kind desk
  • Including a brand new structured column named TableSchema that features the tables’ construction
  • Increasing the TableSchema structured column protecting the Identify and Form columns and renaming the Identify column to Column Identify and the Form column to Datatype
  • Filter the outcomes to maintain solely the columns with both Date or DateTime datatypes
  • Filtering out pointless values from the Column Identify like Delivery Date
  • Including a brand new column named Min Date that will get the minimal worth of the column that seems within the Column Identify column of the desk worth that seems within the Worth column

Hmm! I suppose it’s an excessive amount of mentioning worthcolumn and desk in numerous contexts. I hope I’m not making it much more complicated although.

  • Including one other new column named Max Date much like how we created the Min Date
  • Extracting the minimal worth of the Min Date column
  • Extracting the utmost values of the Max Date column
  • Exhibiting the latter two as an inventory

So if you’re on the lookout for an answer right here is the Energy Question expressions that I take advantage of:

let
    AllQueries = #sections,
    RecordToTable = Document.ToTable(AllQueries[Section1]),
    FilterOutCurrentQuery = Desk.SelectRows(RecordToTable, every [Name] <> "GetMinMaxAllDates" and Sort.Is(Worth.Sort([Value]), kind desk) = true),
    AddTableSchemaColumn = Desk.AddColumn(FilterOutCurrentQuery, "TableSchema", every attempt Desk.Schema([Value]) in any other case null),
    ExpandTableSchema = Desk.Buffer(Desk.ExpandTableColumn(AddTableSchemaColumn, "TableSchema", {"Identify", "Form"}, {"Column Identify", "Datatype"})),
    FilterTypes = Desk.SelectRows(ExpandTableSchema, every ([Datatype] = "datetime" or [Datatype] = "date")),
    AddedMinDateColumn = Desk.AddColumn(FilterTypes, "Min Date", every Date.From(Listing.Min(Desk.Column([Value], [Column Name])))),
    AddedMaxDateColumn = Desk.AddColumn(AddedMinDateColumn, "Max Date", every Date.From(Listing.Max(Desk.Column([Value], [Column Name])))),
    FilterOutUnnecessaryColumns = Desk.SelectRows(AddedMaxDateColumn, every ([Column Name] <> "BirthDate")),
    MinDate = Listing.Min(Listing.Mix({FilterOutUnnecessaryColumns[Min Date], FilterOutUnnecessaryColumns[Max Date]})),
    MaxDate = Listing.Max(Listing.Mix({FilterOutUnnecessaryColumns[Min Date], FilterOutUnnecessaryColumns[Max Date]})),
    MinMaxDates = {"Min Date = " & Textual content.From(MinDate), "Max Date = " & Textual content.From(MaxDate)}
in
        MinMaxDates

You may obtain the above expressions from right here.

The picture under illustrates the outcomes of operating the above code in Energy Question Editor having 11 reality tables and 2 dimension tables. These tables have 17 columns with both Date or DateTime datatypes:

GetMinMaxAllDates Query in Power Query

Be aware: As soon as once more, it is advisable to cross the present question identify within the expressions above. In my case the present question identify is GetMinMaxAllDates as proven within the picture under:

Filtering out the current Query Name

Earlier on this publish I discussed that in lots of circumstances we do NOT need all Date or DateTime columns to be coated by the Date desk. A very good instance for it’s Delivery Date and Deceased Date. If we don’t be aware that then we will create numerous irrelevant dates in our Date desk like what we get because the Min Date within the above picture which is 10/02/1916. As you possibly can within the picture above there’s a FilterOutUnnecessaryColumns step. We click on on that step to filter the pointless values from the Column Identify column as proven within the picture under:

Filtering out Birth Date

Click on on the final step which is MinMaxDates to see the brand new values as proven within the picture under:

New Min Date after fingering out the Birth Date column

By operating the above question you get the legitimate date vary, so now you can create a Date desk with any methodology of alternative, both in Energy Question or DAX utilizing the above date vary. Keep in mind, creating the Date desk is totally separate course of. This question is barely serving to us discovering minimal and most legitimate dates throughout all tables loaded into the Energy Question Editor.

Issues

  • The above tables altogether have 40M rows and the GetMinMaxAllDates question ran in roughly 10 sec on my machine which isn’t dangerous in any respect. Nonetheless, in bigger tables it might take extra to provide the outcomes
  • You could have some queries already loaded into the Energy BI Editor
  • This methodology additionally works in Direct Question mode, however you anticipate the question to take extra time to get the outcomes
  • The above question retrieves the min date and max date throughout all tables. While you create a Date desk, bear in mind that the Date column ought to begin from the 1st Jan of the min date going all the way in which as much as the thirty first Dec of the max date
  • This methodology works in Energy BI Desktop RS
  • This methodology is NOT supported in Energy BI Dataflows

Take pleasure in your Relationship!

RELATED ARTICLES

LEAVE A REPLY

Please enter your comment!
Please enter your name here

Most Popular

Recent Comments