Using OpenAI’s real-time api with server-vad mode – Blancer.com Tutorials and projects

I’m trying to setup the real-time api, but I’m a little confused on how the audio to audio events are suppose to work in server_vad mode.

Currently I do the following:

const client = new RealtimeClient({ apiKey: process.env.OPENAI_API_KEY });

  instructions: "be nice and helpful",
  input_audio_transcription: { model: 'whisper-1' },
  turn_detection: { type: "server_vad" },
});

  console.log("Realtime Event: ", event);
});

client.connect()

  client.appendInputAudio(data)

However, I never receive any events other than the input_audio_append.event. Is there another step I need to take to ensure I receive a response?