In theory I love Power Query. I've run training etc with examples of ugly data that's a few hundred rows and it all works like dream. In the real world, when I have an excel with 500k rows, using it is agony.
I have no problem with lengthy processing time, I can run that over lunch or overnight, but the actual delay to just create the steps makes me want to tear my hair out. My workflow now is often to go into the actual source data and make a much reduced copy (ie cut 500k rows down to 100), create the query, and then change the source back to my actual file. Or equivalently, put only two files into a folder and create a 'from folder' query and then dump all the files into the folder.
Am I doing something wrong here? I can't wait 10 minutes between creating steps. Should I really need to doctor my data to make queries. I ca get access to Alteryx, should I be looking into that?
I have no problem with lengthy processing time, I can run that over lunch or overnight, but the actual delay to just create the steps makes me want to tear my hair out. My workflow now is often to go into the actual source data and make a much reduced copy (ie cut 500k rows down to 100), create the query, and then change the source back to my actual file. Or equivalently, put only two files into a folder and create a 'from folder' query and then dump all the files into the folder.
Am I doing something wrong here? I can't wait 10 minutes between creating steps. Should I really need to doctor my data to make queries. I ca get access to Alteryx, should I be looking into that?