The Detection and Optimization team in Perception is responsible for designing neural network model architectures that act as the detection backbones for Zoox's sensor data. We work on incorporating state of the art language alignment strategies into the sensor modality backbones. This team helps Zoox advance faster into new geofences by making the models more generalizable and adaptable to new circumstances.