Reinforcement Learning And Planning For Preference Balancing Tasks