Hi all, I have a total of 9 million rows in my CommCare case. Out of the total 9 millions only the recently collected 3 millions records are needed for analysis and data collection is ongoing.
As I need only the data after '2024-12-01' onwards, I plan to use the --since parameter and the whole command seems as below
commcare-export --output-format sql --output mssql+pyodbc://localhost/MyDB?driver=ODBC+Driver+17+for+SQL+Server --project myCommcareWorkspace --query myExcel.xlsx --missing-value NULL --batch-size 1000 --since 2024-12-01 --username myUserName --password myPassword
I tried it and it pulls all 3 millions data successfully. However, as data collection is on progress whenever I re-run the above script it doesn't bring only the newly added one rather everything from '2024-12-01'.
Is there a way to use the --since parameter for the first time to pull all recent data after 2024-12-01 up until now and then for the subsequent data pulling continue with incremental refresh?