You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
To visualise the missingness, I made a heat map ordered by hierarchical similarity. However, when I visualised it I had two problems. The first was that the size of the dataframe was so large that ggplot started choking up. The second problem was that when visualised the heatmap could be difficult to understand as individual points were too small to see. I solved both problems by aggregating areas of pixels of size xy to an average value between 0 and 1 and plotting the resulting resized matrix on a scale of red-white-blue. Because the data had already been hierarchically clustered there is not a huge amount of disturbance. This change allows very large datasets to be visualised and that they are still interpretable.
downsample the data
If you just use the first 1,000 rows / 100 columns, then you get a warning stating that this is not all of the data.
Animation / scrolling
"Roll" the data through visdat as if it were an animation, like scrolling down the data (suggested by @krlmlr)
Use transparency to indicate missingness
@krlmlr 's idea for making each column have an alpha level that corresponds to the missingness - so high transparency means lots of missing data.
The text was updated successfully, but these errors were encountered: