Improve response parsing: no ptypes, faster datetimes by karawoo · Pull Request #513 · posit-dev/connectapi

karawoo · 2026-03-05T20:13:50Z

Intent

Fixes #483

Approach

Removes the connectapi_ptypes dictionary and vctrs-based type coercion system (ensure_columns, ensure_column, vec_cast). Instead, each getter function declares its own datetime_cols and applies lightweight post-parse coercion via coerce_datetime(), which handles RFC 3339 strings, epoch seconds, POSIXct pass-through, and all-NA columns.

Parsing pipeline:

Connect$request() now uses jsonlite::fromJSON() instead of httr::content(as = "parsed"), giving us control over jsonlite's simplification behavior
parse_connect_rfc3339() uses a vectorized substr-based parser (faster than strptime on large vectors)
page_cursor() fetches subsequent pages with simplify=TRUE so jsonlite builds data frames in C, then combines with vctrs::vec_rbind()

I started this work while trying to improve performance of get_usage_static() (see profiling on #501 (comment)). With this branch, get_usage_static() is ~20% faster than on main for 175k records.

Checklist

Does this change update NEWS.md (referencing the connected issue if necessary)?
Does this change need documentation? Have you run devtools::document()?

jonkeane · 2026-03-05T22:16:09Z

R/content.R

+  # Keep only the columns relevant to job termination; the API response
+  # includes extra fields (e.g. payload, guid) on error that vary by outcome.
+  keep <- c("app_id", "app_guid", "job_key", "job_id", "result", "code", "error")


Is it a problem that we get variable data at this point?

I believe that the variable data is for error cases (i.e. trying to terminate a job that is not currently active, which returns a 409 but doesn't raise an R error) vs. successful requests. I guess it's debatable whether we should be raising an R error more eagerly, but I think the behavior of accommodating different field names is consistent with main.

The one difference here is that main will always return the columns of keep whereas on this branch if any columns from keep are missing across all responses, they'd be omitted. I'll update to make it more consistent with main.

I was actually thinking a bit in the opposite direction: if we sometimes get different fieldnames that's probably totally ok. This is probably more important (or really, more possible) when we get to having these objects not be DFs that get passed around. Then the normalization of "here are the columns you're getting" can be done at the as.data.frame() point). We don't need to do this here, it just smelled a little funny to me

Gotcha yeah, if we were returning a list then I agree it'd make more sense to not worry about the columns -- the responses are what they are, and that may include different fields.

karawoo added 3 commits March 4, 2026 15:44

improve performance of parse_connectapi_typed

603986d

remove ptype and simplify parsing

d6cf43b

improve performance for page_cursor

400a862

karawoo requested a review from jonkeane March 5, 2026 20:13

Merge remote-tracking branch 'origin/main' into kara-parsing

aa6619a

karawoo mentioned this pull request Mar 5, 2026

Remove strict type checking #511

Open

2 tasks

jonkeane reviewed Mar 5, 2026

View reviewed changes

keep columns even if missing from all responses

25349d8

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Improve response parsing: no ptypes, faster datetimes#513

Improve response parsing: no ptypes, faster datetimes#513
karawoo wants to merge 5 commits intomainfrom
kara-parsing

karawoo commented Mar 5, 2026

Uh oh!

jonkeane Mar 5, 2026

Uh oh!

karawoo Mar 5, 2026

Uh oh!

jonkeane Mar 5, 2026

Uh oh!

karawoo Mar 5, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

karawoo commented Mar 5, 2026

Intent

Approach

Checklist

Uh oh!

jonkeane Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

karawoo Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

jonkeane Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

karawoo Mar 5, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants