You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
As per Ariel Nunez's suggestion, for bucketed time series data like the daily mailing list activity counts, there should be machinery for doing a component analysis of the activity.
This means turning a time stamp into a feature vector. The featurization frameowrk should be flexible enough to support both periodical features (such as 'Mondays'), progression over time (i.e. days from origin), and landmark events (such as release dates).
Use PCA or a similar algorithm to get a sense of the contribution of each feature to the total activity. Document this process in an I Python notebook, and include comments on the limitations of the linear model.
This discussion was converted from issue #35 on December 06, 2021 21:57.
Heading
Bold
Italic
Quote
Code
Link
Numbered list
Unordered list
Task list
Attach files
Mention
Reference
Menu
reacted with thumbs up emoji reacted with thumbs down emoji reacted with laugh emoji reacted with hooray emoji reacted with confused emoji reacted with heart emoji reacted with rocket emoji reacted with eyes emoji
-
As per Ariel Nunez's suggestion, for bucketed time series data like the daily mailing list activity counts, there should be machinery for doing a component analysis of the activity.
This means turning a time stamp into a feature vector. The featurization frameowrk should be flexible enough to support both periodical features (such as 'Mondays'), progression over time (i.e. days from origin), and landmark events (such as release dates).
Use PCA or a similar algorithm to get a sense of the contribution of each feature to the total activity. Document this process in an I Python notebook, and include comments on the limitations of the linear model.
Beta Was this translation helpful? Give feedback.
All reactions