I agree with you that -soundex- may not be appropriate given assumptions
about English background, but it may still be a reasonable option to try
given the similarities of the strings you provided as examples (despite
being French). It may or may not work though.
Soundex was created to encode foreign names into the latin alphabet while
robust to transliteration problems. Other than avoiding accented
characters, it isn't especially English oriented.