Parallel computing models refer to the design and implementation of systems that can perform multiple tasks simultaneously, by dividing a larger problem into smaller sub-problems that can be solved concurrently. These models are used to improve performance and efficiency in computing systems.