symfonyserviceentities

Sf2 : using a service inside an entity


i know this has been asked over and over again, i read the topics, but it's always focused on specific cases and i generally try to understand why its not best practise to use a service inside an entity.

Given a very simple service :

Class Age
{
  private $date1;
  private $date2;
  private $format;

  const ym = "%y years and %m month"
  const ...


  // some DateTime()->diff() methods, checking, formating the entry formats, returning different period formats for eg.
  }

and a simple entity :

Class People
{
  private $firstname;
  private $lastname;
  private $birthday;

  }

From a controller, i want to do :

$som1 = new People('Paul', 'Smith', '1970-01-01');
$som1->getAge();

Off course i can rewrite the getAge() function inside my entity, its not long, but im very lazy and as i've already written all the possible datetime->diff() i need in the above service, i dont understand why i shouldnt use'em...

NB : my question isnt about how to inject the container in my entity, i can understand why this doesnt make sense, but more what wld be the best practise to avoid to rewrite the same function in different entities.

Inheritance seems to be a bad "good idea" as i could use the getAge() inside a class BlogArticle and i doubt that this BlogArticle Class should be inheriting from the same class as a People class...

Hope i was clear, but not sure...


Solution

  • One major confusion for many coders is to think that doctrine entities "are" the model. That is a mistake.

    Injecting services into your doctrine entities is a symptom of "trying to do more things than storing data" into your entities. When you see that "anti-pattern" most probably you are violating the "Single Responsibility" principle in SOLID programming.

    Symfony is not an MVC framework, it is a VC framework only. Lacks the M part. Doctrine entities (I'll call them entities from now on, see clarification at the end) are a "data persistence layer", not a "model layer". Symfony has lots of things for views, web controllers, command controllers... but has no help for domain modelling ( http://en.wikipedia.org/wiki/Domain_model ) - even the persistence layer is Doctrine, not Symfony.

    Overcoming the problem in SF2

    When you "need" services in a data-layer, trigger an antipattern alert. Storage should be only a "put here - get from there" system. Nothing else.

    To overcome this problem, you should inject the services into a "logic layer" (Model) and separate it from "pure storage" (data-persistence layer). Following the single responsibility principle, put the logics in one side, put the getters and setters to MySQL in another.

    The solution is to create the missing Model layer, not present in Symfony2, and make it to give "the logic" of the domain objects, completely separated and decoupled from the layer of data-persistence which knows "how to store" the model into a MySQL database with doctrine, or to a redis, or simply to a text file.

    All those storage systems should be interchangeable and your Model should still expose the very same public methods with absolutely no change to the consumer.

    Here's how you do it:

    Step 1: Separate the model from the data-persistence

    To do so, in your bundle, you can create another directory named Model at the bundle-root level (besides tests, DependencyInjection and so), as in this example of a game.

    Model should be a separated layer different from the persistence layer

    Particularly for your question

    In your example, you could have:

    Entity/People.php
    Model/People.php
    

    Step 2: Use factories

    Then, you must remember that the consumers of the model, should never ever create model objects using "new". They should use a factory instead, that will setup the model objects properly (will bind to the proper data-storage layer). The only exception is in unit-testing (we'll see it later). But apart from unitary tests, grab this with fire in your brain, and tattoo it with a laser in your retina: Never do a 'new' in a controller or a command. Use the factories instead ;)

    To do so, you create a service that acts as the "getter" of your model. You create the getter as a factory accessible thru a service. See the image:

    Use a service as a factory to get your model

    You can see a BoardManager.php there. It is the factory. It acts as the main getter for anything related to boards. In this case, the BoardManager has methods like the following:

    public function createBoardFromScratch( $width, $height )
    public function loadBoardFromJson( $document )
    public function loadBoardFromTemplate( $boardTemplate )
    public function cloneBoard( $referenceBoard )
    

    Particularly for your question

    In your example, you would have a PeopleManager in the model, able to get the people objects as you need.

    Also in the Model, you should use the proper singular-plural names, as this is decoupled from your data-persistence layer. Seems you are currently using People to represent a single Person - this can be because you are currently (wrongly) matching the model to the database table name.

    So, involved model classes will be:

    PeopleManager -> the factory
    People -> A collection of persons.
    Person -> A single person.
    

    For example (pseudocode! using C++ notation to indicate the return type):

    PeopleManager
    {
        // Examples of getting single objects:
        Person getPersonById( $personId ); -> Load it from somewhere (mysql, redis, mongo, file...)
        Person ClonePerson( $referencePerson ); -> Maybe you need or not, depending on the nature the your problem that your program solves.
        Person CreatePersonFromScratch( $name, $lastName, $birthDate ); -> returns a properly initialized person.
    
        // Examples of getting collections of objects:
        People getPeopleByTown( $townId ); -> returns a collection of people that lives in the given town.
    }
    
    People implements ArrayObject
    {
        // You could overload assignment, so you can throw an exception if any non-person object is added, so you can always rely on that People contains only Person objects.
    }
    
    Person
    {
        private $firstname;
        private $lastname;
        private $birthday;
    }
    

    So, continuing with your example, when you do...

    // **Never ever** do a new from a controller!!!
    $som1 = new People('Paul', 'Smith', '1970-01-01');
    $som1->getAge();
    

    ...you now can mutate to:

    // Use factory services instead:
    $peopleManager = $this->get( 'myproject.people.manager' );
    $som1 = $peopleManager->createPersonFromScratch( 'Paul', 'Smith', '1970-01-01' );
    $som1->getAge();
    

    The PeopleManager will do the newfor you.

    At this point, your variable $som1 of type Person, as it was created by the factory, can be pre-populated with the necessary mechanics to store and save to the persistence layer.

    The myproject.people.manager will be defined in your services.yml and will have access to the doctrine either directly, either via a 'myproject.persistence.manager` layer or whatever.

    Note: This injection of the persistence layer via the manager, has several side effects, that would side track from "how to make the model have access to services". See steps 4 and 5 for that.

    Step 3: Inject the services you need via the factory.

    Now you can inject any services you need into the people.manager

    You, if your model object needs to access that service, you have now 2 choices:

    In this example, we provide the PeopleManager with the service to be consumed by the model. When the people manager is requested a new model object, it injects the service needed to it in the new sentence, so the model object can access the external service directly.

    // Example of injecting the low-level service.
    class PeopleManager
    {
        private $externalService = null;
    
        class PeopleManager( ServiceType $externalService )
        {
            $this->externalService = $externalService;
        }
    
        public function CreatePersonFromScratch()
        {
            $externalService = $this->externalService;
            $p = new Person( $externalService );
        }
    }
    
    class Person
    {
        private $externalService = null;
    
        class Person( ServiceType $externalService )
        {
            $this->externalService = $externalService;
        }
    
        public function ConsumeTheService()
        {
            $this->externalService->nativeCall();  // Use the external API.
        }
    }
    
    // Using it.
    $peopleManager = $this->get( 'myproject.people.manager' );
    $person = $peopleManager->createPersonFromScratch();
    $person->consumeTheService()
    

    In this example, we provide the PeopleManager with the service to be consumed by the model. Nevertheless, when the people manager is requested a new model object, it injects itself to the object created, so the model object can access the external service via the manager, which then hides the API, so if ever the external service changes the API, the manager can do the proper conversions for all the consumers in the model.

    // Second example. Using the manager as a proxy.
    class PeopleManager
    {
        private $externalService = null;
    
        class PeopleManager( ServiceType $externalService )
        {
            $this->externalService = $externalService;
        }
    
        public function createPersonFromScratch()
        {
            $externalService = $this->externalService;
            $p = new Person( $externalService);
        }
    
        public function wrapperCall()
        {
             return $this->externalService->nativeCall();
        }
    }
    
    class Person
    {
        private $peopleManager = null;
    
        class Person( PeopleManager $peopleManager )
        {
            $this->peopleManager = $peopleManager ;
        }
    
        public function ConsumeTheService()
        {
            $this->peopleManager->wrapperCall(); // Use the manager to call the external API.
        }
    }
    
    // Using it.
    $peopleManager = $this->get( 'myproject.people.manager' );
    $person = $peopleManager->createPersonFromScratch();
    $person->ConsumeTheService()
    

    Step 4: Throw events for everything

    At this point, you can use any service in any model. Seems all is done.

    Nevertheless, when you implement it, you will find problems at decoupling the model with the entity, if you want a truly SOLID pattern. This also applies to decoupling this model from other parts of the model.

    The problem clearly arises at places like "when to do a flush()" or "when to decide if something must be saved or left to be saved later" (specially in long-living PHP processes), as well as the problematic changes in case the doctrine changes its API and things like this.

    But is also true when you want to test a Person without testing its House, but the House must "monitor" if the Person changes its name to change the name in the mailbox. This is specially try for long-living processes.

    The solution to this is to use the observer pattern ( http://en.wikipedia.org/wiki/Observer_pattern ) so your model objects throw events nearly for anything and an observer decides to cache data to RAM, to fill data or to store data to the disk.

    This strongly enhances the solid/closed principle. You should never change your model if the thing you change is not domain-related. For example adding a new way of storing to a new type of database, should require zero edition on your model classes.

    You can see an example of this in the following image. In it, I highlight a bundle named "TurnBasedBundle" that is like the core functionality for every game that is turn-based, despite if it has a board or not. You can see that the bundle only has Model and Tests.

    Every game has a ruleset, players, and during the game, the players express the desires of what they want to do.

    In the Game object, the instantiators will add the ruleset (poker? chess? tic-tac-toe?). Caution: what if the ruleset I want to load does not exist?

    When initializing, someone (maybe the /start controller) will add players. Caution: what if the game is 2-players and I add three?

    And during the game the controller that receives the players movements will add desires (for example, if playing chess, "the player wants to move queen to this tile" -which may be a valid, or not-.

    In the picture you can see those 3 actions under control thanks to the events.

    Example of throwing events from the model for virtually everything that can happen

    1. Someone calls the $game->addPlayer( $player ) method.
    2. As soon as we enter the addPlayer() function, the PRE event is raised.
    3. The observers then can catch this event to decide if a player can be added or not.
    4. All PRE events should come with a cancel passed by reference. So if someone decides this is a game for 2 players and you try to add a 3rd one, the $cancel will be set to true.
    5. Then you are again inside the addPlayer function. You can check if someone wanted to cancel the operation.
    6. Do the operation if allowed (ie: mutate the $this-> state).
    7. After the state has been changed, raise a POST event to indicate the observers that the operation has been completed.

    In the picture you see three, but of course it has a lot lot more. As a rule of thumb, you will have nearly 2 events per setter, 2 events per method that can modify the state of the model and 1 event for each "unavoidable" action. So if you have 10 methods on a class that operate on it, you can expect to have about 15 or 20 events.

    You can easily see this in the typical simple text box of any graphyc library of any operating system: Typical events will be: gotFocus, lostFocus, keyPress, keyDown, keyUp, mouseDown, mouseMove, etc...

    Particularly, in your example

    The Person will have something like preChangeAge, postChangeAge, preChangeName, postChangeName, preChangeLastName, postChangeLastName, in case you have setters for each of them.

    For long-living actions like "person, do walk for 10 seconds" you maybe have 3: preStartWalking, postStartWalking, postStopWalking (in case a stop of 10 seconds cannot be programatically prevented).

    If you want to simplify, you can have two single preChanged( $what, & $cancel ) and postChanged( $what ) events for everything.

    If you never prevent your changes to happen, you can even just have one single event changed() for all and any change to your model. Then your entity will just "copy" the model properties in the entity properties at every change. This is OK for simple classes and projects or for structures you are not going to publish for third-party consumers, and saves some coding. If the model class becomes a core class to your project, spending a bit of time adding all the events list will save you time in the future.

    Step 5: Catch the events from the data layer.

    It is at this point that your data-layer bundle enters in action!!!

    Make your data layer an observer of your model. When the model Changes its internal state then make your Entity to "copy" that state into the entity state.

    In this case, the MVC acts as expected: The Controller, operates on the Model. The consequences of this are still hidden from the controller (as the controller should not have access to Doctrine). The model "broadcasts" the operation made, so anyone interested knows, which in turn triggers that the data-layer knows about the model change.

    Particularly, in your project

    The Model/Person object will have been created by the PeopleManager. When creating it, the PeopleManager, which is a service, and therefore can have other services injected, can have the ObjectStorageManager subsystem handy. So the PeopleManager can get the Entity/People that you reference in your question and add the Entity/People as an observer to Model/Person.

    In the Entity/People mainly you substitute all the setters by event catchers.

    You read your code like this: When the Model/Person changes its LastName, the Entity/People will be notified and will copy the data into its internal structure.

    Most probably, you are tempted to inject the entity inside the model, so instead of throwing an event, you call the setters of the Entity.

    But with that approach, you 'break' the Open-Closed principle. So if at any given point you want to migrate to MongoDb, you need to "change" your "entities" by "documents" in your model. With the observer-pattern, this change occurs outside the model, who never knows the nature of the observer beyond that is its a PersonObserver.

    Step 6: Unit test everything

    Finally, you want to unit test your software. As this pattern I have explained overcomes the anti-pattern that you discovered, you can (and you should) unit-test the logics of your model independently of how that is stored.

    Following this pattern, helps you to go towards the SOLID principles, so each "unit of code" is independent on the others. This will allow you to create unit-tests that will test the "logics" of your Model without writing to the database, as it will inject a fake data-storage layer as a test-double.

    Let me use the game example again. I show you in the image the Game test. Assume all games can last several days and the starting datetime is stored in the database. We in the example currently test only if getStartDate() returns a dateTime object.

    enter image description here

    There are some arrows in it, that represent the flow.

    In this example, from the two injecting strategies I told you, I choose the first one: To inject into the Game model object the services it needs (in this case a BoardManager, PieceManager and ObjectStorageManager) and not to inject the GameManager itself.

    1. First, you invoke phpunit that will call look for the Tests directory, recursively in all the directories, finding classes named XxxTest. Then will desire to invoke all the methods named textSomething().
    2. But before calling it, for each test method it calls the setup().
    3. In the setup we will create some test-doubles to avoid "real access" to the database when testing, while correctly testing the logics in our model. In this case a double of my own data layer manager, ObjectStorageManager.
    4. It is assigned to a temporary variable for clarity...
    5. ...that is stored in the GameTest instance...
    6. ...for later use in the test itself.
    7. The $sut (system under test) variable is then created with a new command, not via a manager. Do you remember that I said that tests were an exception? If you use the manager (you still can) here it is not a unit-test, it's an integration test because tests two classes: the manager and the game. In the new command we fake all the dependencies that the model has (like a board manager, and like a piece manager). I am hardcoding GameId = 1 here. This relates to data-persistance, see below.
    8. We then may call the system under test (a simple Game model object) to test its internals.

    I am hardcoding "Game id = 1" in the new. In this case we are only testing that the returned type is a DateTime object. But in case we want to test also that the date that it gets is the proper one, we can "tune" the ObjectStorageManager (data-persistance layer) mock to return whatever we want in the internal call, so we could test that for example when I request the date to the data-layer for game=1 the date is 1st-jun-2014 and for game=2 the date is 2nd-jun-2014. Then in the testGetStartDate I would create 2 new instances, with Ids 1 and 2 and check the content of the result.

    Particularly, in your project

    You will have a Test/Model/PersonTest unit test that will be able to play with the logics of the person, and in case of needing a person from the database, you will fake it thru the mock.

    In case you want to test the storing of the person to the database, it is enough that you unit-test that the event is thrown, no matter who listens to it. You can create a fake listener, attach to the event, and when the postChangeAge happens mark a flag and do nothing (no real database storage). Then you assert that the flag is set.

    In short:

    1. Do not confuse logics and data-persistance. Create a Model that has nothing to do with entities, and put all the logics in it.
    2. Do never use new to get your models from any consumer. Use factory services instead. Special attention to avoid news in controllers and commands. Exception: The unit-test is the only consumer that can use a new.
    3. Inject the services you need in the Model via the factory, which in turn receives it from the services.yml configuration file.
    4. Throw events for everything. When I say everything, means everything. Just imagine you observe the model. What would you like to know? Add an event for it.
    5. Catch the events from controllers, views, commands and from other parts of the model, but, specially, catch them in the data-storage layer, so you can "copy" the object to the disk without being intrusive to the model.
    6. Unit test your logics without depending on any real database. Attach the real database storage system in production and attach a dummy implementation for your tests.

    Seems a lot of work. But it is not. It is a matter of getting used to it. Just think about the "objects" you need, create them and make the data-layer be "monitors" of your objects. Then your objects are free to run, decoupled. If you create the model from a factory, inject any needed service in the model to the model, and leave the data alone.

    Edit apr/2016 - Separating Domain from Persistance

    All occurences of the word entity in this answer are referring to the "doctrine entities" which is what causes confusion to the majority of coders, between the model layer and the persistance layer which should be always different.

    Edit jun/2019 - CQRS+ES analogy

    Ancients already used persistant history methods to recod things (for example placing marks on a stone to register transactions).

    Since a decade long the CQRS+ES approach (Command Query Responsability Segregation + Event Sourcing) in programming has been growing in popularity, bringing that idea of "the history is immutable" to the programs we code and today many coders think of separating the command side vs the query side. If you don't know what I'm talking about, no worries, just skip the next paragraphs.

    The growing popularity of CQRS+ES in the last 3 or 4 years makes me think to consider a comment here and how it relates to what I answered here 5 years ago:

    This answer was thought as 1 single model, not a write-model and a read-model. But I'm happy to see many overlapping ideas.

    Think of the PRE events, I mention here, as the "commands and the write-model". Think of the POST events as the "Event Sourcing part going towards the read-model".

    In CQRS you can easily find that "commands can be accepted or not" in function of the internal state. Usually one implements them throwing exceptions but there are other alternatives there, like answering if the command was accepted or not.

    For example, in a "Train" I can "set it to X speed". But if the state is that the train is in a rail that cannot go further 80Km/h, then setting it to 200 should be rejected.

    This is ANALOGOUS to the cancel boolean passed by reference where an entity could just "reject" something PRIOR to its state change.

    Instead the POST events do not carry the "cancel" event and are thrown AFTER the state change happened. This is why you could not cancel them: They talk about the "state change that actually occurred" and therefore it cannot be cancelled: It aleady happened.

    So...

    In my answer of 2014, the "pre" events match with the "Command acceptance" of the CQRS+ES systems (the command can be accepted or rejected), and the "post" events match the "Domain events" of the CQRS+ES systems (it just informs that the change actually already happened, do whatever you want with that information).