German Credit data set 1: Description. A numerical version of the Statlog German Credit Data data set. Here, the task is to clasify customers as good (1) or bad (2), depending on 20 features about them and their bancary accounts. In this problem, the use of an additional cost matrix is suggested, because it is worse to class a customer as good when they are bad (cost 5), than it is to class a customer as bad when they are good (cost 1). 2: Type. Classification 3: Origin. Real world 4: Instances. 1000 5: Features. 20 6: Classes. 2 7: Missing values. No 8: Header. @relation german @attribute StatusAccount {A11, A12, A14, A13} @attribute DurationMonth integer [4, 72] @attribute CreditHistory {A34, A32, A33, A30, A31} @attribute Purpose {A43, A46, A42, A40, A41, A49, A44, A45, A410, A48} @attribute CreditAmount integer [250, 18424] @attribute SavingsAccount {A65, A61, A63, A64, A62} @attribute EmploymentSince {A75, A73, A74, A71, A72} @attribute InstallmentRate integer [1, 4] @attribute StatusAndSex {A93, A92, A91, A94} @attribute Guarantors {A101, A103, A102} @attribute ResidenceSince integer [1, 4] @attribute Property {A121, A122, A124, A123} @attribute Age integer [19, 75] @attribute InstallmentPlans {A143, A141, A142} @attribute Housing {A152, A153, A151} @attribute NCredits integer [1, 4] @attribute Job {A173, A172, A174, A171} @attribute NPeopleMain integer [1, 2] @attribute Telephone {A192, A191} @attribute ForeignWorker {A201, A202} @attribute Customer {1,2} @inputs StatusAccount, DurationMonth, CreditHistory, Purpose, CreditAmount, SavingsAccount, EmploymentSince, InstallmentRate, StatusAndSex, Guarantors, ResidenceSince, Property, Age, InstallmentPlans, Housing, NCredits, Job, NPeopleMain, Telephone, ForeignWorker @outputs Customer