Just a guess here, but I wouldn't be surprised if it's used to better spy on your messenger audio conversations. They already listen in and will pick up keywords to populate your FB ad stream.
If I can reconstruct your conversation (through other meta information), without listening to sounds of your voices, have I not listened to your conversation?