There is very little marginal complexity in the implementation that is due to bare/legacy CTV. On the other hand, there is significant complexity in an additional fork deployment - both in terms of added code and process.
What is the concrete reason that CTV as OP_NOP4
is a sticking point for you?